Senior Software Engineer — GPU Systems

1 day ago


Toronto, Ontario, Canada Arc Compute Full time

Senior Software Engineer — GPU Systems

Location: Toronto — Onsite

About Arc Compute

Arc Compute operates high-performance GPU clusters and is focused on improving efficiency, throughput, and reliability at scale. We're looking for an Senior Embedded Software Engineer to help build the software that makes our GPU infrastructure faster and more efficient.

What You'll Be Doing

  • Build and improve GPU performance telemetry using CUDA, DCGM and low-level profiling data.
  • Participate in exploring scheduling and optimization strategies to make multi-GPU workloads run more efficiently.
  • Performance optimization, analysis, and tuning of DL models in various domains like LLM, Multimodal and Generative AI.
  • Scale performance of DL models across different architectures and types of NVIDIA accelerators.
  • Collaborate with team members and other partners.

What We're Looking For

  • 7+ years of work experience in software development, design patterns and software engineering principles.
  • 3+ years of experience in CUDA development and GPU performance concepts.
  • Proven experience owning and architecting performance-critical systems or telemetry pipelines.
  • Mentorship experience or demonstrated ability to guide junior and mid-level engineers.
  • C/C++ programming and software design skills. Python experience is a plus.
  • Modeling, profiling, debug, and code optimization or architectural knowledge of CPU and GPU.
  • Familiarity with Linux environments and debugging on real hardware.
  • Comfortable working onsite with GPU servers and real workloads.
  • Experience with Git.

Nice to Have

  • Experience deploying or operating systems in Kubernetes, Docker-based environments, or other job orchestration frameworks.
  • Understanding of AI model serving backends, ML runtimes, or AI compilers (e.g., TensorRT, TVM, XLA).
  • Experience with performance profiling tools such as Nsight Systems/Compute.
  • Experience leading performance investigations or driving cross-team initiatives. (added senior signal)

Job Type: Full-time

Application question(s):

  • Are you able to work onsite in Toronto with GPU servers and real workloads?
  • Which of the following have you used professionally? Please briefly describe your primary language.

☐ C

☐ C++

☐ Python

  • Have you worked in Linux environments and used Git for version control in production projects?
  • Have you owned or architected a GPU performance or telemetry system end-to-end?
  • Have you used CUDA-based profiling or telemetry tools (e.g., DCGM, Nsight Systems, Nsight Compute)?
  • Have you led investigations to identify and resolve GPU performance bottlenecks?
  • Have you optimized performance for LLMs, multimodal, or generative models? What techniques did you apply (e.g., kernel fusion, memory optimization, batching)?
  • Have you used Git in a collaborative, production codebase?
  • Have you deployed or operated systems using Docker or Kubernetes?
  • Have you led or driven cross-team performance initiatives?

Experience:

  • Software engineering: 7 years (required)
  • CUDA or GPU-focused development: 3 years (required)

Work Location: In person



  • Toronto, Ontario, Canada Arc Compute Full time

    Software Engineer — GPU SystemsLocation: Toronto — OnsiteAbout Arc ComputeArc Compute operates high-performance GPU clusters and is focused on improving efficiency, throughput, and reliability at scale. We're looking for an Embedded Software Engineer to help build the software that makes our GPU infrastructure faster and more efficient.What You'll Be...


  • Toronto, Ontario, Canada Arc Compute Full time

    Senior System EngineerLocation:Toronto — OnsiteAbout Arc ComputeArc Compute operates high-performance GPU clusters and is focused on improving efficiency, throughput, and reliability at scale. We're looking for an Senior Embedded Software Engineer to help build the software that makes our GPU infrastructure faster and more efficient.What You'll Be...

  • GPU compiler Engineer

    21 hours ago


    Toronto, Ontario, Canada Qualcomm Full time

    CompanyQualcomm Canada ULCJob AreaEngineering Group, Engineering Group > GPU ASICS EngineeringGeneral SummaryRemote work option is available from anywhere in CanadaAdreno GPU has been the industry leading mobile graphics solution and has been the dominating GPU in todays smart phone market. Our power efficient GPU solution is fundamental to enable the new...


  • Toronto, Ontario, Canada Arc Compute Full time US$1,200,000 - US$2,500,000 per year

    Senior GPU Solutions ConsultantLocation:Onsite, Toronto, ONType:Full-TimeAbout ArcArc Computedesigns designs and delivers GPU infrastructure for AI, deep learning, high-performance computing (HPC), and media workloads.From custom GPU servers to full-scale clusters, Arc equips organizations with the performance, reliability, and control they can't get from...


  • Toronto, Ontario, Canada Cerebras Systems Full time $120,000 - $180,000 per year

    Cerebras Systems builds the world's largest AI chip, 56 times larger than GPUs. Our novel wafer-scale architecture provides the AI compute power of dozens of GPUs on a single chip, with the programming simplicity of a single device. This approach allows Cerebras to deliver industry-leading training and inference speeds and empowers machine learning users to...


  • Toronto, Ontario, Canada Cerebras Systems Full time

    Cerebras Systems builds the world's largest AI chip, 56 times larger than GPUs. Our novel wafer-scale architecture provides the AI compute power of dozens of GPUs on a single chip, with the programming simplicity of a single device. This approach allows Cerebras to deliver industry-leading training and inference speeds and empowers machine learning users to...


  • Toronto, Ontario, Canada Beacon Software Full time $75,000 - $120,000 per year

    Senior Software Engineer SnailWorks is a vertical SaaS platform purpose-built for the direct mail and marketing industry. We provide real-time visibility into USPS-delivered campaigns by ingesting, translating, and enriching scan data at scale. Our customers rely on us for mission-critical tracking—outbound mail, inbound reply workflows, analytics...


  • Toronto, Ontario, Canada Beacon Software Full time US$120,000 - US$200,000 per year

    Beacon Software is a permanent capital holding company which acquires and grows essential businesses. We are a profitable series B+ firm that combines great technologists, operators and M&A professionals to accelerate the scale of the ambition of the dozens of businesses we own and operate. We are supported by capital from tier-1 venture capital, crossover,...


  • Toronto, Ontario, Canada Cerebras Systems Full time

    Cerebras Systems builds the world's largest AI chip, 56 times larger than GPUs. Our novel wafer-scale architecture provides the AI compute power of dozens of GPUs on a single chip, with the programming simplicity of a single device. This approach allows Cerebras to deliver industry-leading training and inference speeds and empowers machine learning users to...

  • Performance Engineer

    2 weeks ago


    Toronto, Ontario, Canada Cerebras Systems Full time $120,000 - $180,000 per year

    Cerebras Systems builds the world's largest AI chip, 56 times larger than GPUs. Our novel wafer-scale architecture provides the AI compute power of dozens of GPUs on a single chip, with the programming simplicity of a single device. This approach allows Cerebras to deliver industry-leading training and inference speeds and empowers machine learning users to...