AI Performance Engineer

2 weeks ago


Markham, Canada Qualcomm Full time

AI Performance Engineer (Cloud AI Engineering) – Senior – Staff – Senior Staff role at Qualcomm. Engineering Group, Machine Learning Engineering. Job Summary Qualcomm is utilizing its traditional strengths in digital wireless technologies to play a central role in the evolution of Cloud AI. We are investing in several supporting technologies including Deep Learning. The Qualcomm Cloud AI team is developing hardware and software solutions for Inference Acceleration. Responsibilities Convert, optimize and deploy models for efficient inference using PyTorch, ONNX. Work at the forefront of GenAI by understanding advanced algorithms e.g. attention mechanisms, MoEs and numerics to identify new optimization opportunities. Performance analysis and optimization of LLM, VLM, and diffusion models for inference. Scale performance for throughput and latency constraints. Mapping the next generation AI workloads on top of current and future hardware designs. Work closely with customers to drive solutions by collaborating with internal compiler, firmware and platform teams. Analyze complex performance or stability issues to work towards final root cause of underlying problems. Create engineering solutions to deliver continuous insights into performance of AI workloads guiding the improvements over time. Design and implement high-level kernels, e.g. in Triton, with a focus on generating efficient, low-level code. Qualifications Hands-on experience in building and optimizing language models, notably in PyTorch, ONNX, preferably in production-grade environments. Deep understanding of transformer architectures, attention mechanisms and performance trade-offs. Experience in workload mapping strategies exhibiting sharding or various parallelisms. Strong Python programming skills. Proactive learning about the latest inference optimization techniques. Understanding of computer architecture, ML accelerators, in-memory processing and distributed systems. Strong communication, problem-solving skills and ability to learn and work effectively in a fast-paced and collaborative environment. MS in Computer Science, Machine Learning, Computer Engineering or Electrical Engineering. Bonus Skills Background in neural network operators and mathematical operations, including linear algebra and math libraries. Understanding of machine learning compilers. Experience in converging accuracy and its evaluation methods. Knowledge of torch.compile or torchDynamo. PhD in Computer Science, Computer Engineering or Machine Learning. Minimum Qualifications Bachelor's degree in Computer Science, Engineering, Information Systems, or related field and 6+ years of Hardware Engineering, Software Engineering, Systems Engineering, or related work experience. Master's degree in Computer Science, Engineering, Information Systems, or related field and 5+ years of Hardware Engineering, Software Engineering, Systems Engineering, or related work experience. PhD in Computer Science, Engineering, Information Systems, or related field and 4+ years of Hardware Engineering, Software Engineering, Systems Engineering, or related work experience. Pay Range And Other Compensation & Benefits $178,400.00 - $267,600.00Qualcomm offers competitive annual discretionary bonus program and RSU grants. Contact Qualcomm Careers for more details. Equal Opportunity and Accessibility Statement Qualcomm is an equal opportunity employer. If you are an individual with a disability and need an accommodation during the application/hiring process, rest assured that Qualcomm is committed to providing an accessible process. Contact disability-accomodations@qualcomm.com or call Qualcomm's toll‑free number for reasonable accommodations. Qualcomm is an equal opportunity employer; all qualified applicants will receive consideration for employment without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, disability, Veteran status, or any other protected classification. #J-18808-Ljbffr



  • Markham, Canada Advanced Micro Devices inc. Full time

    WHAT YOU DO AT AMD CHANGES EVERYTHING At AMD, our mission is to build great products that accelerate next-generation computing experiences—from AI and data centers, to PCs, gaming and embedded systems. Grounded in a culture of innovation and collaboration, we believe real progress comes from bold ideas, human ingenuity and a shared passion to create...

  • Senior Engineer AI

    2 weeks ago


    Markham, Canada Honda Canada Inc. Full time

    Join to apply for the Senior Engineer AI role at Honda Canada Inc. The primary purpose of this role is to leverage advanced data engineering and AI system architecture skills to design, develop, and deploy innovative AI solutions that address complex business challenges. This role is crucial in transforming raw data into actionable insights and intelligent...

  • Senior Engineer AI

    2 weeks ago


    Markham, Canada Honda Canada Inc. Full time

    Join to apply for the Senior Engineer AI role at Honda Canada Inc. The primary purpose of this role is to leverage advanced data engineering and AI system architecture skills to design, develop, and deploy innovative AI solutions that address complex business challenges. This role is crucial in transforming raw data into actionable insights and intelligent...

  • Senior Engineer AI

    2 weeks ago


    Markham, Canada Honda Canada Inc. Full time

    Join to apply for the Senior Engineer AI role at Honda Canada Inc. The primary purpose of this role is to leverage advanced data engineering and AI system architecture skills to design, develop, and deploy innovative AI solutions that address complex business challenges. This role is crucial in transforming raw data into actionable insights and intelligent...


  • Markham, Ontario, Canada Advanced Micro Devices, Inc Full time US$60,000 - US$200,000 per year

    WHAT YOU DO AT AMD CHANGES EVERYTHING At AMD, our mission is to build great products that accelerate next-generation computing experiences—from AI and data centers, to PCs, gaming and embedded systems. Grounded in a culture of innovation and collaboration, we believe real progress comes from bold ideas, human ingenuity and a shared passion to create...

  • Senior Engineer AI

    2 weeks ago


    Markham, Ontario, Canada Honda Canada Inc. Full time $120,000 - $200,000 per year

    The primary purpose of the Senior AI Engineer role is to leverage advanced data engineering and AI system architecture skills to design, develop, and deploy innovative AI solutions that address complex business challenges. This role is crucial in transforming raw data into actionable insights and intelligent applications that drive strategic decision-making...


  • Toronto, Montreal, Calgary, Vancouver, Edmonton, Old Toronto, Ottawa, Mississauga, Quebec, Winnipeg, Halifax, Saskatoon, Burnaby, Hamilton, Victoria, Surrey, Halton Hills, London, Regina, Markham, Brampton, Vaughan, Kelowna, Laval, Southwestern Ontario, R, Canada AI Jobs Full time

    A leading technology research firm in Canada is seeking experienced engineers to work on an innovative AI research project. The role entails developing high‑rigor engineering problems and evaluating AI solutions for accuracy and feasibility. Candidates should have an advanced degree in engineering and be skilled in LaTeX for documentation. This is a...


  • Markham, Canada Huawei Canada Full time

    About the team Huawei Canada has an immediate 12-month contract opening for an Engineer. Established in 2014, the Distributed Scheduling and Data Engine Lab is Huawei Cloud's technical innovation center in Canada. The lab focuses on researching and developing advanced cloud technologies, supporting the productization and iterative optimization of its...


  • Markham, Canada Huawei Canada Full time

    About the team Huawei Canada has an immediate 12-month contract opening for an Engineer. Established in 2014, the Distributed Scheduling and Data Engine Lab is Huawei Cloud's technical innovation center in Canada. The lab focuses on researching and developing advanced cloud technologies, supporting the productization and iterative optimization of its...


  • Toronto, Montreal, Calgary, Vancouver, Edmonton, Old Toronto, Ottawa, Mississauga, Quebec, Winnipeg, Halifax, Saskatoon, Burnaby, Hamilton, Surrey, Victoria, London, Halton Hills, Regina, Markham, Brampton, Vaughan, Kelowna, Laval, Southwestern Ontario, R, Canada Sepal AI Full time

    A tech research firm in Canada is seeking experienced database administrators to influence AI systems evaluation standards. This fully remote short-term project offers hourly compensation ranging from $38 to $87 based on experience. Participants will review AI-generated database queries and provide expert insights into database administration for safety and...