Software Engineer — GPU Systems

4 days ago


Toronto, Ontario, Canada Arc Compute Full time

Software Engineer — GPU Systems

Location: Toronto — Onsite

About Arc Compute

Arc Compute operates high-performance GPU clusters and is focused on improving efficiency, throughput, and reliability at scale. We're looking for an Embedded Software Engineer to help build the software that makes our GPU infrastructure faster and more efficient.

What You'll Be Doing

  • Build and improve GPU performance telemetry using CUDA, DCGM and low-level profiling data.
  • Participate in exploring scheduling and optimization strategies to make multi-GPU workloads run more efficiently.
  • Performance optimization, analysis, and tuning of DL models in various domains like LLM, Multimodal, and Generative AI.
  • Scale performance of DL models across different architectures and types of NVIDIA accelerators.
  • Collaborate with team members and other partners.

What We're Looking For

  • Completed Bachelors or Masters Computer Engineering, Electrical Engineering or equivalent experience in relevant fields.
  • 4+ years of work experience in software development, design patterns and software engineering principles.
  • At least 1 year of experience in CUDA development and GPU performance concepts.
  • C/C++ programming and software design skills. Python experience is a plus.
  • Modeling, profiling, debug, and code optimization or architectural knowledge of CPU and GPU is a plus.
  • Familiarity with Linux environments and debugging on real hardware.
  • Comfortable working onsite with GPU servers and real workloads.
  • Experience with Git.

Nice to Have

  • Experience deploying or operating systems in Kubernetes, Docker-based environments, or other job orchestration frameworks.
  • Understanding of AI model serving backends, ML runtimes, or AI compilers (e.g., TensorRT, TVM, XLA).
  • Basic experience building or extending backend web services (e.g., REST APIs, data ingestion pipelines, or simple dashboards).

Job Type: Full-time

Application question(s):

  • Are you able to work onsite in Toronto with GPU servers and real workloads?
  • Which of the following have you used professionally?

C/ C++/ Python

  • Have you worked in Linux environments and used Git for version control in production projects?
  • Have you worked with multi-GPU workloads?
  • Have you used profiling tools (Nsight, nvprof, DCGM, or similar)?
  • Have you collaborated with ML researchers or infrastructure teams? How did you translate requirements into performant systems?
  • Have you used Docker, Kubernetes, or job schedulers?
  • Have you worked with AI runtimes or compilers such as TensorRT, TVM, or XLA?

Experience:

  • software engineering: 5 years (required)
  • CUDA or GPU-focused development: 5 years (required)

Work Location: In person



  • Toronto, Ontario, Canada Arc Compute Full time

    Senior Software Engineer — GPU SystemsLocation: Toronto — OnsiteAbout Arc ComputeArc Compute operates high-performance GPU clusters and is focused on improving efficiency, throughput, and reliability at scale. We're looking for an Senior Embedded Software Engineer to help build the software that makes our GPU infrastructure faster and more efficient.What...


  • Toronto, Ontario, Canada Qualcomm Full time

    CompanyQualcomm Canada ULCJob AreaEngineering Group, Engineering Group > GPU ASICS EngineeringGeneral SummaryRemote work option is available from anywhere in CanadaAdreno GPU has been the industry leading mobile graphics solution and has been the dominating GPU in todays smart phone market. Our power efficient GPU solution is fundamental to enable the new...


  • Toronto, Ontario, Canada Cerebras Systems Full time

    Cerebras Systems builds the world's largest AI chip, 56 times larger than GPUs. Our novel wafer-scale architecture provides the AI compute power of dozens of GPUs on a single chip, with the programming simplicity of a single device. This approach allows Cerebras to deliver industry-leading training and inference speeds and empowers machine learning users to...


  • Toronto, Ontario, Canada Arc Compute Full time

    Senior System EngineerLocation:Toronto — OnsiteAbout Arc ComputeArc Compute operates high-performance GPU clusters and is focused on improving efficiency, throughput, and reliability at scale. We're looking for an Senior Embedded Software Engineer to help build the software that makes our GPU infrastructure faster and more efficient.What You'll Be...


  • Toronto, Ontario, Canada Cerebras Systems Full time

    Cerebras Systems builds the world's largest AI chip, 56 times larger than GPUs. Our novel wafer-scale architecture provides the AI compute power of dozens of GPUs on a single chip, with the programming simplicity of a single device. This approach allows Cerebras to deliver industry-leading training and inference speeds and empowers machine learning users to...


  • Toronto, Ontario, Canada Cerebras Systems Full time

    Cerebras Systems builds the world's largest AI chip, 56 times larger than GPUs. Our novel wafer-scale architecture provides the AI compute power of dozens of GPUs on a single chip, with the programming simplicity of a single device. This approach allows Cerebras to deliver industry-leading training and inference speeds and empowers machine learning users to...


  • Toronto, Ontario, Canada Cerebras Systems Full time

    Cerebras Systems builds the world's largest AI chip, 56 times larger than GPUs. Our novel wafer-scale architecture provides the AI compute power of dozens of GPUs on a single chip, with the programming simplicity of a single device. This approach allows Cerebras to deliver industry-leading training and inference speeds and empowers machine learning users to...


  • Toronto, Ontario, Canada Arc Compute Full time

    Senior GPU Solutions Consultant - GPU & Server HardwareLocation:Onsite, Toronto, ONType:Full-TimeAbout ArcArc Computedesigns and delivers GPU infrastructure for AI, deep learning, high-performance computing (HPC), and media workloads.From custom GPU servers to full-scale clusters, Arc equips organizations with the performance, reliability, and control they...

  • Software Engineer

    3 days ago


    Toronto, Ontario, Canada Beacon Software Full time

    Beacon Software is a permanent capital holding company which acquires and grows essential businesses. We are a profitable series B+ firm that combines great technologists, operators and M&A professionals to accelerate the scale of the ambition of the dozens of businesses we own and operate. We are supported by capital from tier-1 venture capital, crossover,...


  • Toronto, Ontario, Canada Cerebras Systems Full time

    Cerebras Systems builds the world's largest AI chip, 56 times larger than GPUs. Our novel wafer-scale architecture provides the AI compute power of dozens of GPUs on a single chip, with the programming simplicity of a single device. This approach allows Cerebras to deliver industry-leading training and inference speeds and empowers machine learning users to...