ML Compiler Engineer

4 weeks ago

Toronto, Canada Amazon Full time

Overview The Annapurna Labs team at Amazon Web Services (AWS) builds AWS Neuron, the software development kit that accelerates deep learning and GenAI workloads on AWS’s custom machine learning accelerators Inferentia and Trainium. AWS Neuron is trusted by customers such as Snap, Autodesk, and Amazon Alexa. Annapurna Labs, acquired by AWS in 2015, is now fully integrated and covers silicon engineering, hardware design, software, and operations. The Neuron Compiler team develops a deep‑learning compiler stack that enables state‑of‑the‑art LLM, vision, and multimodal models to efficiently on our accelerators. This role is part of the performance team in Toronto, focused on analysis and optimization of system‑level performance for machine learning models on AWS ML accelerators. You will work across the stack—from frameworks and compilers to runtime and collectives—to deliver performance improvements and automate them in the SDK. For more information: Key Job Responsibilities models across the entire technology stack, from frameworks to runtime. Conduct detailed performance analysis and profiling of ML workloads, identifying and resolving bottlenecks in large‑scale ML systems. Work directly with customers to enable and optimize their ML models on AWS accelerators, understanding their specific requirements and use cases. Design and implement compiler optimizations, transforming manual performance improvements into automated compiler passes. Collaborate across teams to develop innovative optimization techniques that enhance AWS Neuron SDK’s performance capabilities. Work in a startup‑like development environment, focusing on the most important problems. Basic Qualifications 3+ years of non‑internship professional software development experience. 2+ years of non‑internship design or architecture experience (design patterns, reliability, and scaling) of new and existing systems. Experience programming with at least one software programming language. Preferred Qualifications 3+ years of full software development life cycle experience, including coding standards, code reviews, source control management, build processes, testing, and operations. Bachelor’s degree in computer science or equivalent. Experience in compiler design for CPU/GPU/Vector engines or ML accelerators. Experience with system‑level performance analysis and optimization. Experience with LLVM and/or MLIR. Experience with PyTorch, OpenXLA, StableHLO, JAX, TVM, deep learning models, and algorithms. Equal Opportunity and Accommodations Amazon is an equal opportunity employer and does not discriminate on the basis of protected veteran status, disability, or other legally protected status. If you have a disability and need a workplace accommodation or adjustment during the application and hiring process, please visit for more information. #J-18808-Ljbffr

ML Compiler Engineer

2 weeks ago

Toronto, Ontario, Canada Amazon Web Services (AWS) Full time US$140,000 - US$200,000 per year

DescriptionThe Annapurna Labs team at Amazon Web Services (AWS) builds AWS Neuron, the software development kit used to accelerate deep learning and GenAI workloads on Amazon's custom machine learning accelerators, Inferentia and Trainium.The Product: The AWS Machine Learning accelerators (Inferentia/Trainium) offer unparalleled ML inference and training...
ML Compiler Engineer

7 days ago

Toronto, Ontario, CAN, Canada Amazon Full time $120,000 - $180,000 per year

The Annapurna Labs team at Amazon Web Services (AWS) builds AWS Neuron, the software development kit used to accelerate deep learning and GenAI workloads on Amazon's custom machine learning accelerators, Inferentia and Trainium. The Product: The AWS Machine Learning accelerators (Inferentia/Trainium) offer unparalleled ML inference and training performances....
Ml Compiler Intern

6 days ago

Toronto, Canada d-Matrix Full time

**About us** d-Matrix is developing a novel hardware system and a full-stack software solution to accelerate large-scale modern deep neural network compute workloads for the cloud. Leveraging a combination of unique in-memory compute array design, digital signal processing system design, and on-chip and chip-to-chip interconnect fabric, d-Matrix's AI...
ML Compiler Engineer

1 week ago

Toronto, Canada Amazon Web Services (AWS) Full time

OverviewJoin to apply for the ML Compiler Engineer, AWS Neuron, Annapurna Labs role at Amazon Web Services (AWS).At AWS our vision is to make deep learning pervasive for everyday developers and to democratize access to innovative infrastructure. In order to deliver on that vision, we’ve created innovative software and hardware solutions that make it...
Senior ML Compiler Engineer for AI Accelerators

3 weeks ago

Toronto, Canada Amazon Full time

A leading technology company in Toronto is seeking a Senior Deep Learning Compiler Engineer to develop compilers targeting AWS Inferentia and Trainium. You will work at the intersection of machine learning and distributed architectures, mentoring a team while driving innovative solutions for large ML workloads. The ideal candidate has extensive software...
ML Compiler Engineer

5 days ago

Toronto, Canada Amazon Web Services (AWS) Full time

Overview Join to apply for the ML Compiler Engineer, AWS Neuron, Annapurna Labs role at Amazon Web Services (AWS). At AWS our vision is to make deep learning pervasive for everyday developers and to democratize access to innovative infrastructure. In order to deliver on that vision, we’ve created innovative software and hardware solutions that make it...
ML Compiler Engineer, AWS Neuron, Annapurna Labs

2 weeks ago

Toronto, Canada Amazon Development Centre Canada ULC Full time

The Annapurna Labs team at Amazon Web Services (AWS) builds AWS Neuron, the software development kit used to accelerate deep learning and GenAI workloads on Amazon’s custom machine learning accelerators, Inferentia and Trainium. The Product: The AWS Machine Learning accelerators (Inferentia/Trainium) offer unparalleled ML inference and training...
ML Compiler Engineer

4 weeks ago

Toronto, Canada Amazon Full time

Overview The Annapurna Labs team at Amazon Web Services (AWS) builds AWS Neuron, the software development kit that accelerates deep learning and GenAI workloads on AWS’s custom machine learning accelerators Inferentia and Trainium. AWS Neuron is trusted by customers such as Snap, Autodesk, and Amazon Alexa. Annapurna Labs, acquired by AWS in 2015, is now...
ML Compiler Engineer

4 weeks ago

Toronto, Canada Amazon Full time

Overview The Annapurna Labs team at Amazon Web Services (AWS) builds AWS Neuron, the software development kit that accelerates deep learning and GenAI workloads on AWS’s custom machine learning accelerators Inferentia and Trainium. AWS Neuron is trusted by customers such as Snap, Autodesk, and Amazon Alexa. Annapurna Labs, acquired by AWS in 2015, is now...
ML Compiler Engineer

6 days ago

Toronto, Canada Amazon Full time

The Annapurna Labs team at Amazon Web Services (AWS) builds AWS Neuron, the software development kit used to accelerate deep learning and GenAI workloads on Amazon’s custom machine learning accelerators, Inferentia and Trainium. The Product: The AWS Machine Learning accelerators (Inferentia/Trainium) offer unparalleled ML inference and training...

Americas

Europe

Asia / Oceania

Africa

ML Compiler Engineer