Reinforcement Learning Engineer

3 days ago


Cambridge, Ontario, Canada Axibo Full time $120,000 - $180,000 per year
About AXIBO

AXIBO is a robotics company pioneering the design, prototyping, and manufacturing of advanced robotic systems—all under one roof. We build everything in-house and take pride in delivering robust, reliable products that power automation across industries. Our fast-paced environment demands high levels of precision, organization, and execution—not just in engineering, but across all functions.

Position Overview

As a Reinforcement Learning Engineer, you will develop and deploy machine learning systems that enable intelligent behaviors in our humanoid and legged robots. You'll work at the intersection of control theory, deep learning, and robotics—helping close the loop between simulation and reality to bring adaptive behaviors into real-world machines.

Key Responsibilities
  • Develop reinforcement learning agents for robotic control tasks such as locomotion, manipulation, and dynamic balance

  • Implement learning architectures using policy gradient methods, actor-critic frameworks, and off-policy algorithms (e.g., PPO, SAC, TD3)

  • Build reward functions, curriculum learning strategies, and simulation environments tailored for real-world transfer

  • Design multi-agent training pipelines, including distributed rollouts, experience replay, and adaptive difficulty scaling

  • Interface with Isaac Gym, Mujoco, Brax, and custom physics simulators to run large-scale experiments

  • Work with hardware and firmware teams to deploy trained policies to embedded or real-time environments

  • Design diagnostic tools and visualization dashboards to monitor training progress and system behavior

  • Apply domain randomization, sim2real techniques, and sensor noise modeling to enhance policy robustness

  • Maintain code quality through version control, testing, and modular design

  • Stay current with academic literature and integrate novel RL methods as appropriate

Required Skills and Qualifications
  • Bachelor's or Master's degree in Computer Science, Engineering, Robotics, or a related field

  • 2+ years of hands-on experience applying deep reinforcement learning to simulation or robotic control tasks

  • Strong grasp of machine learning fundamentals and control theory

  • Proficiency with PyTorch, JAX, or TensorFlow

  • Programming experience in Python and C++

  • Deep understanding of policy optimization, generalization, and environment design

  • Experience working in Linux development environments and with GPU-based training pipelines

  • Excellent debugging skills across ML, software, and hardware stacks

  • Ability to independently manage experiments and rapidly iterate on model architectures

Preferred Experience (Bonus)
  • Deployment of RL systems to real-world robots, especially legged or humanoid platforms

  • Contributions to open-source RL frameworks or robotics middleware (e.g., ROS, Isaac ROS)

  • Experience with imitation learning, behavior cloning, or inverse reinforcement learning

  • Prior research/publications in reinforcement learning, multi-agent systems, or robotic control

  • Familiarity with low-level robot interfaces, sensor fusion, or control loop tuning

  • Knowledge of real-time systems, embedded software, or custom actuator control

Job Details
  • Location: Cambridge, Ontario

  • Work Environment: In-person (on-site at our Waterloo facility)

  • Type: Full-time

  • Compensation: Competitive salary (based on experience)

  • Health Insurance: Provided

  • Growth: Regular performance evaluations with potential for salary increases and stock option participation



  • Cambridge, Ontario, Canada WSP Full time

    The Opportunity:As a Landfill Gas Engineer with us, you'll be working with the strongest Waste group in Ontario and on some of the largest and most interesting and technically challenging waste projects in the province and beyond.  You will be surrounded by some of the most accomplished waste consultants in the industry, and will complement them with your...


  • Cambridge, Ontario, Canada WSP Full time $120,000 - $180,000 per year

    DescriptionThe Opportunity:As a Landfill Gas Engineer with us, you'll be working with the strongest Waste group in Ontario and on some of the largest and most interesting and technically challenging waste projects in the province and beyond.  You will be surrounded by some of the most accomplished waste consultants in the industry, and will complement them...

  • Weld Engineer

    1 week ago


    Cambridge, Ontario, Canada BWXT Full time $90,000 - $130,000 per year

    BWXT Canada Ltd. (BWXT Canada) has over 60 years of expertise and experience in the design, manufacturing, commissioning and service of nuclear power generation equipment. This includes steam generators, nuclear fuel and fuel components, critical plant components, parts and related plant services. BWXT Canada's subsidiary, BWXT Medical Ltd. (BWXT Medical)...

  • Test Engineer

    19 hours ago


    Cambridge, Ontario, Canada BWXT Full time $85,000 - $126,000 per year

    BWXT Canada Ltd. (BWXT Canada) has over 60 years of expertise and experience in the design, manufacturing, commissioning and service of nuclear power generation equipment. This includes steam generators, nuclear fuel and fuel components, critical plant components, parts and related plant services. BWXT Canada's subsidiary, BWXT Medical Ltd. (BWXT Medical)...


  • Cambridge, Ontario, Canada 91589b22-3d61-4e6d-9142-1366a0d1f589 Full time $80,000 - $120,000 per year

    Heitech Software Solutions (HeitechSoft) is a rapidly expanding firm specializing in AI, software development, and digital transformation. We empower organizations to modernize operations and achieve a competitive edge through intelligent automation and custom-built technology.We are looking for a dynamic and adaptable Forward Deployed Engineer to join our...


  • Cambridge, Ontario, Canada BWXT Full time $73,000 - $106,000 per year

    BWXT Canada Ltd. (BWXT Canada) has over 60 years of expertise and experience in the design, manufacturing, commissioning and service of nuclear power generation equipment. This includes steam generators, nuclear fuel and fuel components, critical plant components, parts and related plant services. BWXT Canada's subsidiary, BWXT Medical Ltd. (BWXT Medical)...


  • Cambridge, Ontario, Canada BWXT Full time $89,000 - $139,000 per year

    BWXT Canada Ltd. (BWXT Canada) has over 60 years of expertise and experience in the design, manufacturing, commissioning and service of nuclear power generation equipment. This includes steam generators, nuclear fuel and fuel components, critical plant components, parts and related plant services. BWXT Canada's subsidiary, BWXT Medical Ltd. (BWXT Medical)...


  • Cambridge, Ontario, Canada Rockwell Automation Full time $80,000 - $150,000 per year

    Rockwell Automation is a global technology leader focused on helping the world's manufacturers be more productive, sustainable, and agile. With more than 28,000 employees who make the world better every day, we know we have something special. Behind our customers - amazing companies that help feed the world, provide life-saving medicine on a global scale,...


  • Cambridge, Ontario, Canada BWXT Full time $73,000 - $106,000 per year

    BWXT Canada Ltd. (BWXT Canada) has over 60 years of expertise and experience in the design, manufacturing, commissioning and service of nuclear power generation equipment. This includes steam generators, nuclear fuel and fuel components, critical plant components, parts and related plant services. BWXT Canada's subsidiary, BWXT Medical Ltd. (BWXT Medical)...


  • Cambridge, Ontario, Canada Rockwell Automation Full time $80,000 - $140,000 per year

    Rockwell Automation is a global technology leader focused on helping the world's manufacturers be more productive, sustainable, and agile. With more than 28,000 employees who make the world better every day, we know we have something special. Behind our customers - amazing companies that help feed the world, provide life-saving medicine on a global scale,...