Researcher - Reinforcement Learning

1 week ago


Edmonton, Alberta, Canada Huawei Technologies Canada Co., Ltd. Full time $60,000 - $90,000 per year

Huawei Canada has an immediate 12-month contract opening for a Reinforcement Learning Researcher.


About the team:

Founded in 2012, the Noah's Ark lab has evolved into a prominent research organization with notable achievements in academia and industry. The lab's mission focuses on advancing artificial intelligence and related fields to benefit the company and society. Driven by impactful, long-term projects, the aim is to enhance state-of-the-art research while integrating innovations into the company's products and services, including LLMs, RL, NLP, computer vision, AI theory, and Autonomous driving.

About the job:

  • Enabling Large Language Models (LLMs) to learn from experience, interaction, and environment feedback, moving beyond static fine-tuning toward continual, agentic self-improvement.

  • LLM post-training paradigms (e.g., RLHF, GRPO, reward-free methods, etc.);

  • Agentic reinforcement learning for tool-using and browsing-based LLMs trained in interactive environments;

  • Agentic evaluation and benchmarking, including design of multi-turn, verifiable reasoning tasks.

  • Your work will involve implementing and evaluating new training and evaluation pipelines for reasoning-enhanced LLMs and tool-using agents, scaling experiments on large GPU clusters, and contributing to scientific insights and publications in this emerging area.


About the ideal candidate:

  • PhD degree in Computer Science or related fields or master's degree with comparable experience.

  • Strong foundation in deep learning, including architectures such as Transformers and optimization techniques for large models.

  • Practical or research experience in reinforcement learning, self-supervised learning, or language model fine-tuning

  • Proven research record in AI by having at least one paper as the first author in top tier venues, such as NeurIPS, ICML, ICLR, CVPR, ICCV, ECCV, ICRA.

  • Solid proficiency in Python and experience with PyTorch, DeepSpeed, Megatron and other distributed training frameworks.

  • Familiarity with LLM post-training pipelines (RLHF, GRPO/PPO, SFT, LoRA, MoE, etc.) is a strong asset.

  • Experience with multi-agent RL, tool-use / browser/coding agents, is a strong asset.

  • Strong communication and writing skills; enthusiasm for open research and collaborative problem-solving.



  • Edmonton, Alberta, Canada Amii (Alberta Machine Intelligence Institute) Full time $60,000 - $120,000 per year

    "If you are interested in the application of Reinforcement Learning from human feedback methods with generative models and knowledge graphs for RAG based generation, this is the right opportunity for you. Be a part of the team of research and machine learning scientists building from the ground up and get mentored by some of the best minds in AI during the...


  • Edmonton, Alberta, Canada Alberta Machine Intelligence Institute Full time $80,000 - $120,000 per year

    "If you are interested in the application of Reinforcement Learning from human feedback methods with generative models and knowledge graphs for RAG based generation, this is the right opportunity for you. Be a part of the team of research and machine learning scientists building from the ground up and get mentored by some of the best minds in AI during the...


  • Edmonton, Alberta, Canada Alberta Machine Intelligence Institute Full time $80,000 - $120,000 per year

    "If you are interested in the application of forecasting models for workforce management and optimization, this is the right opportunity for you. Be a part of the team of research and machine learning scientists building state-of-the-art models from the ground up and get mentored by some of the best minds in AI during the process." - Shadan Golestan,...


  • Edmonton, Alberta, Canada Amii (Alberta Machine Intelligence Institute) Full time $80,000 - $120,000 per year

    "If you are interested in the application of forecasting models for workforce management and optimization, this is the right opportunity for you. Be a part of the team of research and machine learning scientists building state-of-the-art models from the ground up and get mentored by some of the best minds in AI during the process."Shadan Golestan, Machine...

  • Research Specialist

    1 week ago


    Edmonton, Alberta, Canada Dentons Full time $60,000 - $120,000 per year

    Dentons is designed to be different. Our Firm leads the way in a rapidly changing legal marketplace. We challenge the status quo and deliver consistent results as well as uncompromising quality and value to our clients. Our global presence is renowned as a Firm with over 21,000 individuals in more than 200 offices serving clients across 80+ countries.Dentons...


  • Edmonton, Alberta, Canada Alberta Machine Intelligence Institute Full time $60,000 - $120,000 per year

    "If you are interested in the application of machine learning, Computer Vision, and Optical Character Recognition for automating information extraction from architectural/engineering drawings, this is the right opportunity for you. Be a part of the team of research and machine learning scientists building a multi-stage information extraction pipeline from...

  • Research Assistant

    6 days ago


    Edmonton, Alberta, Canada University of Alberta Full time $70,000 - $90,000 per year

    Job DescriptionThis position is a part of the Non-Academic Staff Association (NASA).This position has a term length of 1 year plus 1 day and offers a comprehensive benefits package . Location - This role is hybrid with a mix of remote and in-person work at North Campus Edmonton. We are actively seeking a Research Assistant in Indigenous Women's...


  • Edmonton, Alberta, Canada Alberta Motor Association Full time $60,000 - $90,000 per year

    YOU SEE THE world AS ONE BIG classroom.The OpportunityLife never stops teaching, so you never stop learning—and you love helping others do the same. You're passionate about interpreting different visions and translating them into a positive learning experience. A good day at work is knowing you helped others learn and grow into their best selves –...


  • Edmonton, Alberta, Canada Takeaway Full time $80,000 - $120,000 per year

    Ready for a challenge? Then Just Eat might be the place for you. We're a leading global online food delivery platform, and our vision is to empower everyday convenience. Whether it's a Friday-night feast, a post-gym poke bowl, or grabbing some groceries, our tech platform connects tens of millions of customers with hundreds of thousands of restaurant,...


  • Edmonton, Alberta, Canada University of Alberta Full time $60,000 - $80,000 per year

    This competition is open to all applicants however; internal candidates and applicants who were former employees of the University of Alberta will be given priority consideration before external candidates. Please log in to verify your internal candidate status.This position is a part of the Non-Academic Staff Association (NASA).This position has a term...