Sr. AI/ML Software System Design Engineer

11 hours ago


Markham, Ontario, Canada AMD Full time

WHAT YOU DO AT AMD CHANGES EVERYTHING
At AMD, our mission is to build great products that accelerate next-generation computing experiences—from AI and data centers, to PCs, gaming and embedded systems. Grounded in a culture of innovation and collaboration, we believe real progress comes from bold ideas, human ingenuity and a shared passion to create something extraordinary. When you join AMD, you'll discover the real differentiator is our culture. We push the limits of innovation to solve the world's most important challenges—striving for execution excellence, while being direct, humble, collaborative, and inclusive of diverse perspectives. Join us as we shape the future of AI and beyond.
Together, we advance your career.
The Role
As a Sr. AI/ML Engineer, you will lead the design and implementation of advanced AI/ML architectures across AMD's GPU and data center platforms. This global technical leadership role focuses on defining strategies for AI-driven validation methodologies, ensuring robust system performance, scalability, and reliability. You will collaborate across silicon, firmware, hardware, and software teams to deliver optimized AI solutions for next-generation computing experiences.

The Person
You are passionate about AI/ML technologies and system architecture, with a strong ability to innovate and solve complex technical challenges. You thrive in a collaborative environment, influencing cross-functional teams and driving architectural decisions that shape the future of AI computing. Your curiosity and leadership will enable continuous improvement and excellence in

AMD's AI solutions.

Key Responsibilities

  • Define and drive AI architecture strategies for GPU-based platforms and distributed systems.
  • Collaborate with engineering teams to design and optimize AI/ML workloads for performance, scalability, and efficiency, while architecting AI/ML solutions that integrate into innovative validation methodologies for driver code and hardware.
  • Develop AI-driven frameworks for automated testing, predictive analytics, and intelligent bug triage to accelerate validation cycles.
  • Lead architectural reviews and provide guidance on design decisions for AI frameworks, drivers, and system integration.
  • Create reference designs and benchmarks for AI workloads, ensuring alignment with industry standards.
  • Drive automation and validation strategies for AI solutions, including cluster-scale deployments.
  • Partner with customers and internal teams to deliver end-to-end AI solutions for data centers and edge platforms.
  • Mentor junior engineers and foster technical innovation across teams.
  • Provide regular updates on architectural progress and influence roadmap decisions.

Preferred Experience

  • Strong background in AI/ML frameworks such as PyTorch, TensorFlow, ONNX Runtime, and familiarity with Hugging Face for model fine-tuning and deployment.
  • Experience with GPU computing and ROCm software stack, including libraries like MIGraphX, rocBLAS, and MIOpen.
  • Knowledge of distributed systems and performance optimization for AI workloads.
  • Proficiency in C/C++, Python, and Linux environments; experience with HIP for GPU programming.
  • Familiarity with networking technologies such as RDMA and RoCE for high-performance data transfer in cluster environments.
  • Excellent communication, leadership, and problem-solving skills.
  • Proven track record of delivering complex, multi-functional AI solutions in fast-paced environments.

Academic Credentials

  • Bachelor's or Master's degree in Computer or Electrical Engineering or equivalent

Benefits offered are described:
AMD benefits at a glance.

AMD does not accept unsolicited resumes from headhunters, recruitment agencies, or fee-based recruitment services. AMD and its subsidiaries are equal opportunity, inclusive employers and will consider all applicants without regard to age, ancestry, color, marital status, medical condition, mental or physical disability, national origin, race, religion, political and/or third-party affiliation, sex, pregnancy, sexual orientation, gender identity, military or veteran status, or any other characteristic protected by law. We encourage applications from all qualified candidates and will accommodate applicants' needs under the respective laws throughout all stages of the recruitment and selection process.
AMD may use Artificial Intelligence to help screen, assess or select applicants for this position. AMD's "Responsible AI Policy" is available here.
This posting is for an existing vacancy.



  • Markham, Ontario, Canada AMD Full time

    WHAT YOU DO AT AMD CHANGES EVERYTHINGAt AMD, our mission is to build great products that accelerate next-generation computing experiences—from AI and data centers, to PCs, gaming and embedded systems. Grounded in a culture of innovation and collaboration, we believe real progress comes from bold ideas, human ingenuity and a shared passion to create...


  • Markham, Ontario, Canada GE Vernova Full time

    Job Description SummaryGE Vernova is accelerating the path to more reliable, affordable, and sustainable energy, while helping our customers power economies and deliver the electricity that is vital to health, safety, security, and improved quality of life. Are you excited at the opportunity to electrify and decarbonize the world?We are seeking a highly...


  • Markham, Ontario, Canada GE Vernova Full time

    Job Description SummaryGE Vernova is accelerating the path to more reliable, affordable, and sustainable energy, while helping our customers power economies and deliver the electricity that is vital to health, safety, security, and improved quality of life. Are you excited at the opportunity to electrify and decarbonize the world?We are seeking a highly...


  • Markham, Ontario, Canada Advanced Micro Devices, Inc Full time

    WHAT YOU DO AT AMD CHANGES EVERYTHING At AMD, our mission is to build great products that accelerate next-generation computing experiences—from AI and data centers, to PCs, gaming and embedded systems. Grounded in a culture of innovation and collaboration, we believe real progress comes from bold ideas, human ingenuity and a shared passion to create...


  • Markham, Ontario, Canada AMD Full time

    WHAT YOU DO AT AMD CHANGES EVERYTHINGAt AMD, our mission is to build great products that accelerate next-generation computing experiences—from AI and data centers, to PCs, gaming and embedded systems. Grounded in a culture of innovation and collaboration, we believe real progress comes from bold ideas, human ingenuity and a shared passion to create...


  • Markham, Ontario, Canada Qualcomm Full time

    CompanyQualcomm Canada ULCJob AreaEngineering Group, Engineering Group > Machine Learning EngineeringGeneral SummaryAs a leading technology innovator, Qualcomm pushes the boundaries of what's possible to enable next-generation experiences and drives digital transformation to help create a smarter, connected future for all. As a Qualcomm Machine Learning...


  • Markham, Ontario, Canada Advanced Micro Devices, Inc Full time

    WHAT YOU DO AT AMD CHANGES EVERYTHING At AMD, our mission is to build great products that accelerate next-generation computing experiences—from AI and data centers, to PCs, gaming and embedded systems. Grounded in a culture of innovation and collaboration, we believe real progress comes from bold ideas, human ingenuity and a shared passion to create...


  • Markham, Ontario, Canada Qualcomm Full time

    CompanyQualcomm Canada ULCJob AreaEngineering Group, Engineering Group > Software EngineeringGeneral SummaryWe are seeking a highly skilled and self drivenCybersecurity engineerto join our AI software development organization. This role is critical in ensuring the security and integrity of our AI platforms, SDKs, and data pipelines. You will guide the work...


  • Markham, Ontario, Canada AMD Full time

    WHAT YOU DO AT AMD CHANGES EVERYTHINGAt AMD, our mission is to build great products that accelerate next-generation computing experiences—from AI and data centers, to PCs, gaming and embedded systems. Grounded in a culture of innovation and collaboration, we believe real progress comes from bold ideas, human ingenuity and a shared passion to create...


  • Markham, Ontario, Canada AMD Full time

    WHAT YOU DO AT AMD CHANGES EVERYTHINGAt AMD, our mission is to build great products that accelerate next-generation computing experiences—from AI and data centers, to PCs, gaming and embedded systems. Grounded in a culture of innovation and collaboration, we believe real progress comes from bold ideas, human ingenuity and a shared passion to create...