ML Compiler Engineer

6 days ago


Toronto, Canada Amazon Web Services (AWS) Full time

Overview Join to apply for the ML Compiler Engineer, AWS Neuron, Annapurna Labs role at Amazon Web Services (AWS). At AWS our vision is to make deep learning pervasive for everyday developers and to democratize access to innovative infrastructure. In order to deliver on that vision, we’ve created innovative software and hardware solutions that make it possible. AWS Neuron is the SDK that optimizes the performance of complex neural net models executed on AWS Inferentia and Trainium, our custom chips designed to accelerate deep-learning workloads. The Neuron SDK consists of a compiler, run-time, and debugger, integrated with Tensorflow, PyTorch, and MXNet. It’s preinstalled in AWS Deep Learning AMIs and Deep Learning Containers for customers to quickly get started with running high performance and cost-effective inference. Role The Neuron team is hiring senior compiler engineers to solve our customers toughest problems. This is an opportunity to work on innovative products at the intersection of machine-learning, high-performance computing, and distributed architectures. You will architect and implement business-critical features, publish innovative research, and mentor a brilliant team of experienced engineers. We operate in spaces that are very large, yet our teams remain small and agile. There is no blueprint. We're inventing. We're experimenting. It is a very unique learning culture. Responsibilities As a senior deep learning compiler engineer on the Neuron team, you will be a thought leader supporting the development of a compiler targeting AWS Inferentia and Trainum. You will be developing and scaling the compiler to handle the world\'s largest ML workloads. You will leverage your technical communications skill as a hands-on partner to AWS ML services teams and you will be involved in pre-silicon design, bringing new products/features to market, and many other exciting projects. Qualifications Basic Qualifications 3+ years of non-internship professional software development experience 2+ years of non-internship design or architecture (design patterns, reliability and scaling) of new and existing systems experience Experience programming with at least one software programming language Preferred Qualifications 3+ years of full software development life cycle, including coding standards, code reviews, source control management, build processes, testing, and operations experience Bachelor\'s degree in computer science or equivalent Experience in compiler design for CPU/GPU/Vector engines/ML-accelerators Experience with System Level performance analysis and optimization Experience with LLVM and/or MLIR Experience with the following technologies: PyTorch, OpenXLA, StableHLO, JAX, TVM, deep learning models, and algorithms Amazon is an equal opportunity employer and does not discriminate on the basis of protected veteran status, disability, or other legally protected status. If you have a disability and need a workplace accommodation or adjustment during the application and hiring process, including support for the interview or onboarding process, please visit for more information. If the country/region you’re applying in isn’t listed, please contact your Recruiting Partner. Company - Amazon Canada Fulfillment Services, ULC Job ID: A #J-18808-Ljbffr


  • ML Compiler Engineer

    2 weeks ago


    Toronto, Ontario, Canada Amazon Web Services (AWS) Full time US$140,000 - US$200,000 per year

    DescriptionThe Annapurna Labs team at Amazon Web Services (AWS) builds AWS Neuron, the software development kit used to accelerate deep learning and GenAI workloads on Amazon's custom machine learning accelerators, Inferentia and Trainium.The Product: The AWS Machine Learning accelerators (Inferentia/Trainium) offer unparalleled ML inference and training...


  • Toronto, Ontario, CAN, Canada Amazon Full time $120,000 - $180,000 per year

    The Annapurna Labs team at Amazon Web Services (AWS) builds AWS Neuron, the software development kit used to accelerate deep learning and GenAI workloads on Amazon's custom machine learning accelerators, Inferentia and Trainium. The Product: The AWS Machine Learning accelerators (Inferentia/Trainium) offer unparalleled ML inference and training performances....

  • Ml Compiler Intern

    7 days ago


    Toronto, Canada d-Matrix Full time

    **About us** d-Matrix is developing a novel hardware system and a full-stack software solution to accelerate large-scale modern deep neural network compute workloads for the cloud. Leveraging a combination of unique in-memory compute array design, digital signal processing system design, and on-chip and chip-to-chip interconnect fabric, d-Matrix's AI...

  • ML Compiler Engineer

    2 weeks ago


    Toronto, Canada Amazon Web Services (AWS) Full time

    OverviewJoin to apply for the ML Compiler Engineer, AWS Neuron, Annapurna Labs role at Amazon Web Services (AWS).At AWS our vision is to make deep learning pervasive for everyday developers and to democratize access to innovative infrastructure. In order to deliver on that vision, we’ve created innovative software and hardware solutions that make it...


  • Toronto, Canada Amazon Full time

    A leading technology company in Toronto is seeking a Senior Deep Learning Compiler Engineer to develop compilers targeting AWS Inferentia and Trainium. You will work at the intersection of machine learning and distributed architectures, mentoring a team while driving innovative solutions for large ML workloads. The ideal candidate has extensive software...

  • ML Compiler Engineer

    4 weeks ago


    Toronto, Canada Amazon Full time

    Overview The Annapurna Labs team at Amazon Web Services (AWS) builds AWS Neuron, the software development kit that accelerates deep learning and GenAI workloads on AWS’s custom machine learning accelerators Inferentia and Trainium. AWS Neuron is trusted by customers such as Snap, Autodesk, and Amazon Alexa. Annapurna Labs, acquired by AWS in 2015, is now...


  • Toronto, Canada Amazon Development Centre Canada ULC Full time

    The Annapurna Labs team at Amazon Web Services (AWS) builds AWS Neuron, the software development kit used to accelerate deep learning and GenAI workloads on Amazon’s custom machine learning accelerators, Inferentia and Trainium. The Product: The AWS Machine Learning accelerators (Inferentia/Trainium) offer unparalleled ML inference and training...

  • ML Compiler Engineer

    4 weeks ago


    Toronto, Canada Amazon Full time

    Overview The Annapurna Labs team at Amazon Web Services (AWS) builds AWS Neuron, the software development kit that accelerates deep learning and GenAI workloads on AWS’s custom machine learning accelerators Inferentia and Trainium. AWS Neuron is trusted by customers such as Snap, Autodesk, and Amazon Alexa. Annapurna Labs, acquired by AWS in 2015, is now...

  • ML Compiler Engineer

    4 weeks ago


    Toronto, Canada Amazon Full time

    Overview The Annapurna Labs team at Amazon Web Services (AWS) builds AWS Neuron, the software development kit that accelerates deep learning and GenAI workloads on AWS’s custom machine learning accelerators Inferentia and Trainium. AWS Neuron is trusted by customers such as Snap, Autodesk, and Amazon Alexa. Annapurna Labs, acquired by AWS in 2015, is now...


  • Toronto, Canada Amazon Full time

    The Annapurna Labs team at Amazon Web Services (AWS) builds AWS Neuron, the software development kit used to accelerate deep learning and GenAI workloads on Amazon’s custom machine learning accelerators, Inferentia and Trainium. The Product: The AWS Machine Learning accelerators (Inferentia/Trainium) offer unparalleled ML inference and training...