ML Systems Integration Engineer

3 weeks ago


Old Toronto, Ontario, Canada Cerebras Systems Full time
About the Role

Cerebras Systems is revolutionizing the field of artificial intelligence with its cutting-edge technology. As an ML Integration and Ops Engineer, you will play a crucial role in bringing together software and hardware components to make large-scale LLM model training simple and easy to use.

Key Responsibilities
  • Drive technical projects involving multiple teams and software/hardware components to simplify large-scale LLM model training.
  • Develop and implement effective integration methodologies, strong debugging skills, and excellent communication techniques.
  • Break down complex tasks into manageable parts and identify creative solutions to problems.
  • Automate workflows, testbed setups, and build tools to monitor and debug systems.
  • Contribute to developing software specifications with a focus on ML products.
Requirements
  • Master's degree in computer science or EE with 0-6 years of industry experience.
  • Experience in product validation for compute/machine learning/networking/storage systems within a large-scale enterprise environment.
  • Strong automation and programming skills using Python, C++, or Go.
  • Knowledge of software system design and ML workflows and frameworks like TensorFlow or PyTorch.
Preferred Qualifications
  • Hands-on experience with training LLMs and working with container, Kubernetes.
  • Experience in driving projects across multiple teams.
Why Cerebras Systems

Cerebras Systems is committed to creating an equal and diverse environment and is proud to be an equal opportunity employer. We celebrate different backgrounds, perspectives, and skills. We believe inclusive teams build better products and companies. We try every day to build a work environment that empowers people to do their best work through continuous learning, growth, and support of those around them.



  • Old Toronto, Ontario, Canada Cerebras Systems Full time

    About The RoleCerebras Systems is revolutionizing the field of machine learning with its cutting-edge technology. As an MTS (ML Integration and Ops Engineer), you will play a crucial role in bringing together software and hardware components to make large-scale LLM model training simple and easy to use.You will be part of the MIQ (ML Integration and Quality)...


  • Old Toronto, Ontario, Canada Albin Engineering Services, Inc. Full time

    Job SummaryAlbin Engineering Services, Inc. is seeking a highly experienced Senior Principal Engineer System Architect - AI/ML to lead the development of complex AI/ML infrastructure platform solutions. This is a 100% remote position that requires international travel, approximately 4-6 trips per year.Key ResponsibilitiesDefine and Implement AI/ML...


  • Old Toronto, Ontario, Canada Albin Engineering Services, Inc. Full time

    Job SummaryAlbin Engineering Services, Inc. is seeking a highly experienced Senior Principal Engineer System Architect - AI/ML to lead the development of complex AI/ML infrastructure platform solutions. This is a 100% remote position that requires international travel, approximately 4-6 trips per year.Key ResponsibilitiesDefine and Implement AI/ML...


  • Old Toronto, Ontario, Canada Albin Engineering Services, Inc. Full time

    Job Title: Senior Principal Engineer - AI/ML System ArchitectAlbin Engineering Services, Inc. is seeking a highly experienced Senior Principal Engineer - AI/ML System Architect to lead the design and development of complex AI/ML infrastructure platform solutions. This is a 100% remote position with opportunities for international travel.Key...


  • Old Toronto, Ontario, Canada Albin Engineering Services, Inc. Full time

    Job Title: Sr Principal Engineer - AI/ML System ArchitectAlbin Engineering Services, Inc. is seeking a highly experienced Sr Principal Engineer - AI/ML System Architect to join our team. This is a 100% remote position with opportunities for international travel.Job Responsibilities:Develop and lead the design of complex AI/ML infrastructure platform...


  • Old Toronto, Ontario, Canada Albin Engineering Services, Inc. Full time

    Job Title: Sr Principal Engineer - AI/ML System ArchitectAlbin Engineering Services, Inc. is seeking a highly experienced Sr Principal Engineer - AI/ML System Architect to join our team. This is a 100% remote position with opportunities for international travel.Job Responsibilities:Develop and lead the design of complex AI/ML infrastructure platform...


  • Old Toronto, Ontario, Canada Albin Engineering Services, Inc. Full time

    Job Title: Sr Principal Engineer - AI/ML System ArchitectAlbin Engineering Services, Inc. is seeking a highly experienced Sr Principal Engineer - AI/ML System Architect to join our team. This is a 100% remote position with opportunities for international travel.Job Responsibilities:Develop and lead the design of complex AI/ML infrastructure platforms,...


  • Old Toronto, Ontario, Canada Albin Engineering Services, Inc. Full time

    Job Title: Sr Principal Engineer - AI/ML System ArchitectAlbin Engineering Services, Inc. is seeking a highly experienced Sr Principal Engineer - AI/ML System Architect to join our team. This is a 100% remote position with opportunities for international travel.Job Responsibilities:Develop and lead the design of complex AI/ML infrastructure platform...


  • Old Toronto, Ontario, Canada Albin Engineering Services, Inc. Full time

    Job Title: Sr Principal Engineer - AI/ML System ArchitectAlbin Engineering Services, Inc. is seeking a highly experienced Sr Principal Engineer - AI/ML System Architect to join our team. This is a 100% remote position with opportunities for international travel.Job Responsibilities:Develop and lead the design of complex AI/ML infrastructure platform...


  • Old Toronto, Ontario, Canada Albin Engineering Services, Inc. Full time

    Job Title: Sr Principal Engineer - AI/ML System ArchitectAlbin Engineering Services, Inc. is seeking a highly experienced Sr Principal Engineer - AI/ML System Architect to join our team. This is a 100% remote position with opportunities for international travel.Job Responsibilities:Develop and lead the design of complex AI/ML infrastructure platforms,...


  • Old Toronto, Ontario, Canada Albin Engineering Services, Inc. Full time

    Job Title: Sr Principal Engineer - AI/ML System ArchitectAlbin Engineering Services, Inc. is seeking a highly experienced Sr Principal Engineer - AI/ML System Architect to join our team. This is a 100% remote position with opportunities for international travel.Job Responsibilities:Develop and lead the design of complex AI/ML infrastructure platform...


  • Old Toronto, Ontario, Canada Albin Engineering Services, Inc. Full time

    Job Title: Sr Principal Engineer - AI/ML System ArchitectAlbin Engineering Services, Inc. is seeking a highly experienced Sr Principal Engineer - AI/ML System Architect to join our team. This is a 100% remote position with opportunities for international travel.Job Responsibilities:Develop and lead the design of complex AI/ML infrastructure platform...


  • Old Toronto, Ontario, Canada Albin Engineering Services, Inc. Full time

    Job SummaryWe are seeking a highly experienced Senior Principal Engineer to lead our AI/ML infrastructure platform solutions team. As a key member of our engineering team, you will be responsible for defining complex AI/ML infrastructure platform solutions and leading a multi-disciplined team to bring them to implementation.Key ResponsibilitiesDefine AI/ML...


  • Old Toronto, Ontario, Canada Albin Engineering Services, Inc. Full time

    Job SummaryWe are seeking a highly experienced Senior Principal Engineer to lead our AI/ML infrastructure platform solutions team. As a key member of our engineering team, you will be responsible for defining complex AI/ML infrastructure platform solutions and leading a multi-disciplined team to bring them to implementation.Key ResponsibilitiesDefine AI/ML...


  • Old Toronto, Ontario, Canada Albin Engineering Services, Inc. Full time

    Job Title: Senior Principal Engineer - AI/ML System ArchitectAlbin Engineering Services, Inc. is seeking a highly experienced Senior Principal Engineer - AI/ML System Architect to join our team. This is a 100% remote position, and as an employee, you will qualify for our full benefit package.Job Responsibilities:Develop and lead the design of complex AI/ML...


  • Old Toronto, Ontario, Canada Albin Engineering Services, Inc. Full time

    Job Title: Sr Principal Engineer - AI/ML System ArchitectAlbin Engineering Services, Inc. is seeking a highly experienced Sr Principal Engineer - AI/ML System Architect to join our team. This is a 100% remote position with opportunities for international travel.Job Responsibilities:Develop and lead the design of complex AI/ML infrastructure platforms and...


  • Old Toronto, Ontario, Canada Albin Engineering Services, Inc. Full time

    Job Title: Sr Principal Engineer - AI/ML System ArchitectAlbin Engineering Services, Inc. is seeking a highly experienced Sr Principal Engineer - AI/ML System Architect to join our team. This is a 100% remote position with opportunities for international travel.Job Responsibilities:Develop and lead the design of complex AI/ML infrastructure platforms and...


  • Old Toronto, Ontario, Canada Albin Engineering Services, Inc. Full time

    Job Title: Sr Principal EngineerAlbin Engineering Services, Inc. is seeking a highly experienced Sr Principal Engineer to lead the development of AI/ML infrastructure platform solutions. This is a 100% remote position with opportunities for international travel.Key Responsibilities:Develop and implement complex AI/ML infrastructure platform solutions,...


  • Old Toronto, Ontario, Canada Albin Engineering Services, Inc. Full time

    Job Title: Sr Principal EngineerAlbin Engineering Services, Inc. is seeking a highly experienced Sr Principal Engineer to lead the development of AI/ML infrastructure platform solutions. This is a 100% remote position with opportunities for international travel.Key Responsibilities:Develop and implement complex AI/ML infrastructure platform solutions,...


  • Old Toronto, Ontario, Canada Nexus Systems Group Inc. Full time

    Job DescriptionNexus Systems Group Inc. is seeking a highly skilled AI/ML Solution Architect to lead the development of cutting-edge AI and machine learning solutions.Key ResponsibilitiesCollaborate with cross-functional teams to design and implement AI and ML solutions that meet business requirements.Develop and maintain technical standards and guidelines...