Senior Software Engineer — GPU Systems
1 day ago
Senior Software Engineer — GPU Systems
Location: Toronto — Onsite
About Arc Compute
Arc Compute operates high-performance GPU clusters and is focused on improving efficiency, throughput, and reliability at scale. We're looking for an Senior Embedded Software Engineer to help build the software that makes our GPU infrastructure faster and more efficient.
What You'll Be Doing
- Build and improve GPU performance telemetry using CUDA, DCGM and low-level profiling data.
- Participate in exploring scheduling and optimization strategies to make multi-GPU workloads run more efficiently.
- Performance optimization, analysis, and tuning of DL models in various domains like LLM, Multimodal and Generative AI.
- Scale performance of DL models across different architectures and types of NVIDIA accelerators.
- Collaborate with team members and other partners.
What We're Looking For
- 7+ years of work experience in software development, design patterns and software engineering principles.
- 3+ years of experience in CUDA development and GPU performance concepts.
- Proven experience owning and architecting performance-critical systems or telemetry pipelines.
- Mentorship experience or demonstrated ability to guide junior and mid-level engineers.
- C/C++ programming and software design skills. Python experience is a plus.
- Modeling, profiling, debug, and code optimization or architectural knowledge of CPU and GPU.
- Familiarity with Linux environments and debugging on real hardware.
- Comfortable working onsite with GPU servers and real workloads.
- Experience with Git.
Nice to Have
- Experience deploying or operating systems in Kubernetes, Docker-based environments, or other job orchestration frameworks.
- Understanding of AI model serving backends, ML runtimes, or AI compilers (e.g., TensorRT, TVM, XLA).
- Experience with performance profiling tools such as Nsight Systems/Compute.
- Experience leading performance investigations or driving cross-team initiatives. (added senior signal)
Job Type: Full-time
Application question(s):
- Are you able to work onsite in Toronto with GPU servers and real workloads?
- Which of the following have you used professionally? Please briefly describe your primary language.
☐ C
☐ C++
☐ Python
- Have you worked in Linux environments and used Git for version control in production projects?
- Have you owned or architected a GPU performance or telemetry system end-to-end?
- Have you used CUDA-based profiling or telemetry tools (e.g., DCGM, Nsight Systems, Nsight Compute)?
- Have you led investigations to identify and resolve GPU performance bottlenecks?
- Have you optimized performance for LLMs, multimodal, or generative models? What techniques did you apply (e.g., kernel fusion, memory optimization, batching)?
- Have you used Git in a collaborative, production codebase?
- Have you deployed or operated systems using Docker or Kubernetes?
- Have you led or driven cross-team performance initiatives?
Experience:
- Software engineering: 7 years (required)
- CUDA or GPU-focused development: 3 years (required)
Work Location: In person
-
Software Engineer — GPU Systems
21 hours ago
Toronto, Ontario, Canada Arc Compute Full timeSoftware Engineer — GPU SystemsLocation: Toronto — OnsiteAbout Arc ComputeArc Compute operates high-performance GPU clusters and is focused on improving efficiency, throughput, and reliability at scale. We're looking for an Embedded Software Engineer to help build the software that makes our GPU infrastructure faster and more efficient.What You'll Be...
-
Senior System Engineer
1 day ago
Toronto, Ontario, Canada Arc Compute Full timeSenior System EngineerLocation:Toronto — OnsiteAbout Arc ComputeArc Compute operates high-performance GPU clusters and is focused on improving efficiency, throughput, and reliability at scale. We're looking for an Senior Embedded Software Engineer to help build the software that makes our GPU infrastructure faster and more efficient.What You'll Be...
-
GPU compiler Engineer
21 hours ago
Toronto, Ontario, Canada Qualcomm Full timeCompanyQualcomm Canada ULCJob AreaEngineering Group, Engineering Group > GPU ASICS EngineeringGeneral SummaryRemote work option is available from anywhere in CanadaAdreno GPU has been the industry leading mobile graphics solution and has been the dominating GPU in todays smart phone market. Our power efficient GPU solution is fundamental to enable the new...
-
Senior GPU Solutions Consultant
2 weeks ago
Toronto, Ontario, Canada Arc Compute Full time US$1,200,000 - US$2,500,000 per yearSenior GPU Solutions ConsultantLocation:Onsite, Toronto, ONType:Full-TimeAbout ArcArc Computedesigns designs and delivers GPU infrastructure for AI, deep learning, high-performance computing (HPC), and media workloads.From custom GPU servers to full-scale clusters, Arc equips organizations with the performance, reliability, and control they can't get from...
-
Senior Runtime Performance Engineer
2 weeks ago
Toronto, Ontario, Canada Cerebras Systems Full time $120,000 - $180,000 per yearCerebras Systems builds the world's largest AI chip, 56 times larger than GPUs. Our novel wafer-scale architecture provides the AI compute power of dozens of GPUs on a single chip, with the programming simplicity of a single device. This approach allows Cerebras to deliver industry-leading training and inference speeds and empowers machine learning users to...
-
Senior Runtime Engineer
7 days ago
Toronto, Ontario, Canada Cerebras Systems Full timeCerebras Systems builds the world's largest AI chip, 56 times larger than GPUs. Our novel wafer-scale architecture provides the AI compute power of dozens of GPUs on a single chip, with the programming simplicity of a single device. This approach allows Cerebras to deliver industry-leading training and inference speeds and empowers machine learning users to...
-
Senior Software Engineer
2 weeks ago
Toronto, Ontario, Canada Beacon Software Full time $75,000 - $120,000 per yearSenior Software Engineer SnailWorks is a vertical SaaS platform purpose-built for the direct mail and marketing industry. We provide real-time visibility into USPS-delivered campaigns by ingesting, translating, and enriching scan data at scale. Our customers rely on us for mission-critical tracking—outbound mail, inbound reply workflows, analytics...
-
Senior Software Engineer
2 weeks ago
Toronto, Ontario, Canada Beacon Software Full time US$120,000 - US$200,000 per yearBeacon Software is a permanent capital holding company which acquires and grows essential businesses. We are a profitable series B+ firm that combines great technologists, operators and M&A professionals to accelerate the scale of the ambition of the dozens of businesses we own and operate. We are supported by capital from tier-1 venture capital, crossover,...
-
Performance Reliability Engineer
1 day ago
Toronto, Ontario, Canada Cerebras Systems Full timeCerebras Systems builds the world's largest AI chip, 56 times larger than GPUs. Our novel wafer-scale architecture provides the AI compute power of dozens of GPUs on a single chip, with the programming simplicity of a single device. This approach allows Cerebras to deliver industry-leading training and inference speeds and empowers machine learning users to...
-
Performance Engineer
2 weeks ago
Toronto, Ontario, Canada Cerebras Systems Full time $120,000 - $180,000 per yearCerebras Systems builds the world's largest AI chip, 56 times larger than GPUs. Our novel wafer-scale architecture provides the AI compute power of dozens of GPUs on a single chip, with the programming simplicity of a single device. This approach allows Cerebras to deliver industry-leading training and inference speeds and empowers machine learning users to...