Infrastructure Architect – GPU Test Automation Farm

3 days ago


Markham, Canada Advanced Micro Devices Full time

WHAT YOU DO AT AMD CHANGES EVERYTHING At AMD, our mission is to build great products that accelerate next‑generation computing experiences—from AI and data centers, to PCs, gaming and embedded systems. Grounded in a culture of innovation and collaboration, we believe real progress comes from bold ideas, human ingenuity and a shared passion to create something extraordinary. When you join AMD, you’ll discover the real differentiator is our culture. We push the limits of innovation to solve the world’s most important challenges—striving for execution excellence, while being direct, humble, collaborative, and inclusive of diverse perspectives. Join us as we shape the future of AI and beyond. Together, we advance your career. THE ROLE AMD is looking for a highly skilled and experienced systems deployment architect to design, plan, and lead the deployment of a large‑scale GPU test automation farm in a datacenter‑style environment. This individual will translate AMD’s test and validation vision into a robust, modular, and scalable infrastructure capable of supporting continuous integration and validation for next‑generation products. THE PERSON The ideal candidate combines deep technical expertise in infrastructure design with hands‑on experience building large compute farms and automation systems, and has a strong understanding of datacenter operational constraints. Able to demonstrate strong architectural judgment, operational discipline, and a practical understanding of the technologies that enable scalable infrastructure. KEY RESPONSIBILITIES Architect and design a distributed, large‑scale GPU test automation farm optimized for performance, scalability, and reliability. Lead the deployment and operation of infrastructure in datacenter‑like environments, ensuring compliance with standards for power, cooling, networking, and management systems. Define and enforce best practices for system configuration, monitoring, and fault tolerance to ensure high availability and performance. Collaborate with cross‑functional teams (QA, IT, software, datacenter ops, and engineering) to deliver seamless test workflows and system integration. Evaluate and implement technologies that improve deployment efficiency, system observability, and scalability (containerization, virtualization, orchestration, MaaS, etc.). Mentor engineers in infrastructure design principles and contribute to the overall architectural vision of AMD’s GPU validation environment. PREFERRED EXPERIENCE Proven expertise in GPU or HPC cluster environments, including system provisioning, scheduling, and performance tuning. Expert background in Windows and Linux administration, including automation tools and scripting. Experience with automation frameworks (Ansible, Terraform, etc.) and CI/CD pipelines for infrastructure deployment. Hands‑on experience with MaaS (Metal‑as‑a‑Service) platforms for large‑scale bare‑metal provisioning. Knowledge of Network Boot (PXE, iPXE, UEFI) configurations and automation. Experience building or integrating inventory health management systems, including real‑time monitoring of servers, network devices, and supporting services. Skilled in space allocation and racking strategies in datacenter or lab environments. Deep understanding of power planning for dense compute environments. Experience with network design and topology optimization for high‑throughput data paths. ACADEMIC CREDENTIALS Bachelor’s or Master’s degree in Computer Science, Computer Engineering, Electrical Engineering, or equivalent. LOCATION Markham, Ontario Canada Benefits offered are described: AMD benefits at a glance. AMD does not accept unsolicited resumes from headhunters, recruitment agencies, or fee‑based recruitment services. AMD and its subsidiaries are equal opportunity, inclusive employers and will consider all applicants without regard to age, ancestry, color, marital status, medical condition, mental or physical disability, national origin, race, religion, political and/or third‑party affiliation, sex, pregnancy, sexual orientation, gender identity, military or veteran status, or any other characteristic protected by law. We encourage applications from all qualified candidates and will accommodate applicants’ needs under the respective laws throughout all stages of the recruitment and selection process. #J-18808-Ljbffr



  • Markham, Canada Advanced Micro Devices Full time

    WHAT YOU DO AT AMD CHANGES EVERYTHING At AMD, our mission is to build great products that accelerate next‑generation computing experiences—from AI and data centers, to PCs, gaming and embedded systems. Grounded in a culture of innovation and collaboration, we believe real progress comes from bold ideas, human ingenuity and a shared passion to create...


  • Markham, Canada Advanced Micro Devices Full time

    WHAT YOU DO AT AMD CHANGES EVERYTHING At AMD, our mission is to build great products that accelerate next‑generation computing experiences—from AI and data centers, to PCs, gaming and embedded systems. Grounded in a culture of innovation and collaboration, we believe real progress comes from bold ideas, human ingenuity and a shared passion to create...


  • Markham, Ontario, Canada AMD Full time

    WHAT YOU DO AT AMD CHANGES EVERYTHINGAt AMD, our mission is to build great products that accelerate next-generation computing experiences—from AI and data centers, to PCs, gaming and embedded systems. Grounded in a culture of innovation and collaboration, we believe real progress comes from bold ideas, human ingenuity and a shared passion to create...


  • Markham, Canada Advanced Micro Devices Full time

    A leading semiconductor company is seeking a skilled Systems Deployment Architect in Markham, Ontario, to design and lead a large-scale GPU test automation farm. The ideal candidate will have deep technical expertise in infrastructure design, extensive hands-on experience, and knowledge of datacenter operations. Responsibilities include defining best...


  • Markham, Canada Advanced Micro Devices Full time

    A leading semiconductor company is seeking a skilled Systems Deployment Architect in Markham, Ontario, to design and lead a large-scale GPU test automation farm. The ideal candidate will have deep technical expertise in infrastructure design, extensive hands-on experience, and knowledge of datacenter operations. Responsibilities include defining best...


  • Markham, Canada AMD Full time

    Staff Software Development Engineer – Test Content Architect Join AMD as a Staff Software Development Engineer, focusing on test content architecture for GPU validation systems. Base Pay Range $148,720.00/yr - $223,080.00/yr Overview At AMD, our mission is to build great products that accelerate next-generation computing experiences—from AI and data...


  • Markham, Canada Advanced Micro Devices Full time

    A leading technology company is seeking a Test Content Architect based in Markham, Ontario. You will drive the definition and execution of the test content strategy for large-scale GPU validation. The ideal candidate will have a strong technical background in test automation and system architecture, combined with leadership skills to guide diverse...


  • Markham, Canada Advanced Micro Devices Full time

    A leading technology company is seeking a Test Content Architect based in Markham, Ontario. You will drive the definition and execution of the test content strategy for large-scale GPU validation. The ideal candidate will have a strong technical background in test automation and system architecture, combined with leadership skills to guide diverse...


  • Markham, Canada Advanced Micro Devices Full time

    WHAT YOU DO AT AMD CHANGES EVERYTHING At AMD, our mission is to build great products that accelerate next-generation computing experiences—from AI and data centers, to PCs, gaming and embedded systems. Grounded in a culture of innovation and collaboration, we believe real progress comes from bold ideas, human ingenuity and a shared passion to create...


  • Markham, Canada Advanced Micro Devices Full time

    WHAT YOU DO AT AMD CHANGES EVERYTHING At AMD, our mission is to build great products that accelerate next-generation computing experiences—from AI and data centers, to PCs, gaming and embedded systems. Grounded in a culture of innovation and collaboration, we believe real progress comes from bold ideas, human ingenuity and a shared passion to create...