Senior AI Inference Performance Engineer

1 month ago


Waterloo, Ontario, Canada Huawei Technologies Canada Co., Ltd. Full time
Job Summary

We are seeking a highly skilled Principal Engineer to lead our AI Inference Performance team. The successful candidate will be responsible for developing and maintaining real-time and historical performance monitoring tools for AI inference workloads, including profiling tools for various AI model types.

Key Responsibilities
  • Develop and maintain performance monitoring tools for AI inference workloads, including profiling tools for various AI model types.
  • Analyze and classify inference workloads based on characteristics like profile, decode, pre/post-processing overheads, and computational complexity.
  • Develop performance models that consider the systematic factors of AI inference, including model size, architecture, and compute resource characteristics.
  • Optimize inference workloads across various hardware resources by reducing latency, minimizing memory overhead, and improving throughput.
  • Lead efforts in creating benchmarks for different types of inference tasks and conduct benchmarking and performance comparisons across various hardware platforms.
  • Work closely with AI research, software engineering, and DevOps teams to improve the end-to-end AI inference pipeline.
Requirements
  • Ph.D. or Master's degree in Computer Science, Electrical Engineering, Machine Learning, or related field.
  • Minimum 5+ years of experience in AI/ML engineering with a focus on inference performance, workload analysis, and system optimization.
  • Extensive experience with AI frameworks and model optimization techniques.
  • Proficient with profiling tools and workload analysis for diverse AI models and applications.
  • Expertise in optimizing small models, large language models, VLMs, and multimodal models for inference.
  • Strong programming skills in Python, C++, CUDA, and experience with low-level hardware performance tuning.
  • Familiarity with performance modeling methodologies and frameworks for predicting inference workload performance under varying conditions.


  • Waterloo, Ontario, Canada Huawei Technologies Canada Co., Ltd. Full time

    About the RoleAt Huawei Technologies Canada Co., Ltd., we are seeking an exceptional American English-speaking Senior AI Inference Performance Optimization Engineer to join our team. As a key member of our team, you will be responsible for developing and maintaining performance monitoring tools, supporting profiling and analyzing inference workloads, and...


  • Waterloo, Ontario, Canada Huawei Technologies Canada Co., Ltd. Full time

    Job Title: AI Inference Performance EngineerAbout the Role:We are seeking a highly skilled AI Inference Performance Engineer to join our team at Huawei Technologies Canada Co., Ltd. The successful candidate will be responsible for developing and maintaining real-time and historical performance monitoring tools for AI inference workloads.Key...


  • Waterloo, Ontario, Canada Huawei Technologies Canada Co., Ltd. Full time

    Job DescriptionOur team at Huawei Technologies Canada Co., Ltd. has an exciting opportunity for an Assistant Engineer to join our AI Inference Performance team.Responsibilities:Collaborate with senior engineers to develop and maintain performance monitoring tools for AI inference workloads.Support the analysis of inference workloads to identify performance...

  • Senior AI Engineer

    7 days ago


    Waterloo, Ontario, Canada Huawei Technologies Canada Co., Ltd. Full time

    About the RoleHuawei Technologies Canada Co., Ltd. seeks a highly skilled Principal Engineer - Machine Learning Systems to join our team. The ideal candidate will have a strong background in AI/ML engineering, with a focus on inference performance, workload analysis, and system optimization.Key ResponsibilitiesDevelop and maintain real-time and historical...

  • Senior AI Engineer

    1 month ago


    Waterloo, Ontario, Canada Huawei Technologies Canada Co., Ltd. Full time

    Senior AI EngineerWe are seeking a highly skilled Senior AI Engineer to join our team at Huawei Technologies Canada Co., Ltd. As a key member of our AI engineering team, you will be responsible for developing and maintaining real-time and historical performance monitoring tools for AI inference workloads.Key Responsibilities:Develop and maintain performance...


  • Waterloo, Ontario, Canada Borealis AI Full time

    RBC Borealis is seeking a highly skilled Senior AI Research Lead to spearhead the development of cutting-edge AI-based products for the financial services industry. This key role will provide strategic leadership and direction to a team of machine learning researchers and engineers, driving the effectiveness of the team to deliver high-value business...


  • Waterloo, Ontario, Canada Huawei Technologies Canada Co., Ltd. Full time

    Huawei Technologies Canada Co., Ltd. has an immediate permanent opening for a High-Level AI Data Security Specialist.Job DescriptionWe are seeking a seasoned expert in AI data security to join our team. The successful candidate will be responsible for researching and analyzing state-of-the-art AI data security technologies applicable to various scenarios,...


  • Waterloo, Ontario, Canada Huawei Technologies Canada Co., Ltd. Full time

    Our team at Huawei Technologies Canada Co., Ltd. is seeking a highly skilled Technical Expert to lead our AI data security efforts.Responsibilities:Conduct in-depth research and analysis of cutting-edge AI data security technologies, focusing on consumer applications and cloud environments, including traditional AI and large language models (LLMs).Ensure the...


  • Waterloo, Ontario, Canada Borealis AI Full time

    RBC Borealis seeks an accomplished AI Research Director to spearhead a team of researchers and engineers in crafting innovative AI-based products for the financial services industry. As a key member of our organization, you will provide strategic leadership and direction to drive the effectiveness of your team, enabling them to deliver high-value business...


  • Waterloo, Ontario, Canada Huawei Technologies Canada Co., Ltd. Full time

    Job Description:We are seeking a highly skilled Technical Expert to join our team at Huawei Technologies Canada Co., Ltd. as a Technical Expert AI Data Security. In this role, you will be responsible for researching and analyzing state-of-the-art AI data security technologies applicable to consumer applications and cloud environments, covering all stages of...

  • Data Scientist

    3 weeks ago


    Waterloo, Ontario, Canada Dental Corp Full time

    Job Title: AI and Machine Learning EngineerAbout the Role:As an AI and Machine Learning Engineer, you will be responsible for designing, developing, and deploying artificial intelligence and machine learning models. You will work closely with cross-functional teams to integrate AI and ML solutions into our products and services.Key Responsibilities:Design...


  • Waterloo, Ontario, Canada Huawei Technologies Canada Co., Ltd. Full time

    Job OverviewHuawei Technologies Canada Co., Ltd. is seeking a highly skilled Data Science Engineer to join our team.About the RoleWe are looking for an experienced Data Science Engineer to design, implement, and benchmark AI models and algorithms for proof-of-concept development.In this role, you will work closely with senior researchers on various...

  • Senior VP Engineering

    1 month ago


    Waterloo, Ontario, Canada iGUIDE Full time

    At iGUIDE, we're shaping the future of property data and virtual tours. As our Senior VP of Engineering, you'll lead a diverse team of 20 professionals across AI, R&D, hardware, and UI/UX. You'll report directly to the CEO and be an integral member of the executive team, driving our vision for growth and technical excellence.Key Responsibilities:Lead the...


  • Waterloo, Ontario, Canada Google Inc. Full time

    Software Engineer III, AI/Machine Learning ExpertAt Google Cloud, we're looking for a highly skilled Software Engineer III, AI/Machine Learning Expert to join our team. As a key member of our software development team, you will be responsible for designing, developing, and deploying cutting-edge AI and machine learning solutions that drive business growth...


  • Waterloo, Ontario, Canada Huawei Technologies Canada Co., Ltd. Full time

    About the RoleAt Huawei Technologies Canada Co., Ltd., we are seeking an exceptional individual to fill a 6-month internship position as an AI Systems Engineer. This opportunity will enable you to contribute significantly to our team while expanding your knowledge and skills in Requirements Engineering (RE) for AI systems.Key Responsibilities:Conduct...

  • Software Engineer III

    1 month ago


    Waterloo, Ontario, Canada Google Inc. Full time

    Software Engineer III - AI/MLAt Google Cloud, we're looking for talented software engineers to join our team and help us develop the next-generation technologies that change how billions of users connect, explore, and interact with information and one another.About the RoleWe're seeking software engineers with expertise in AI/ML to work on critical projects...


  • Waterloo, Ontario, Canada Huawei Technologies Canada Co., Ltd. Full time

    Job Title: AII Engineer Internship Opportunity for Research and DevelopmentAbout the Job:This is an exciting opportunity to work with our team at Huawei Technologies Canada Co., Ltd. as an AI Engineer Intern. The role involves conducting literature reviews on existing Requirements Engineering practices and challenges in AI systems, assisting in identifying...


  • Waterloo, Ontario, Canada Goiguide Full time

    Job Title: Senior VP EngineeringAbout Goiguide: Goiguide is the maker of iGUIDE, a proprietary camera and software platform for capturing and delivering accurate floorplans, immersive 3D virtual tours, and extensive property data. iGUIDE is the most efficient system to map interior spaces and features accurate floor plans, measurements, and reliable property...


  • Waterloo, Ontario, Canada Huawei Technologies Canada Co., Ltd. Full time

    Job Title: Data Engineer for AI Value AlignmentAbout the Role:We are seeking a highly skilled Data Engineer to join our team at Huawei Technologies Canada Co., Ltd. in the role of Data Engineer for AI Value Alignment. The successful candidate will be responsible for developing a robust value framework and benchmarks to assess the alignment of Large Language...


  • Waterloo, Ontario, Canada Huawei Technologies Canada Co., Ltd. Full time

    Company OverviewHuawei Technologies Canada Co., Ltd. is a leading global provider of information and communications technology infrastructure and smart devices.SalaryThe estimated salary range for this position is between $140,000 and $200,000 per year, depending on experience.Job DescriptionWe are seeking an experienced Digital Engineering Architect to join...