Senior ML Quality Engineer

12 hours ago


Old Toronto, Canada Cerebras Systems Inc. Full time

Cerebras has developed a radically new chip and system to dramatically accelerate deep learning applications. Our system runs training and inference workloads orders of magnitude faster than contemporary machines, fundamentally changing the way ML researchers work and pursue AI innovation.

We are innovating at every level of the stack – from chip, to microcode, to power delivery and cooling, to new algorithms and network architectures at the cutting edge of ML research. Our fully-integrated system delivers unprecedented performance because it is built from the ground up for deep learning workloads.

About The Role

As Senior ML Quality Engineer, you ensure quality of Cerebras SW across all supported ML workloads and workflows. You will be part of MIQ (ML Integration and Quality) team that will focus on SW components feature testing, ML training accuracy and performance, pre deployment/production validation, validating customer workloads and workflows.

As part of this role, you will influence the best testing practice, good debugging methodology, effective cross team communication and advocate for world-class products.

Responsibilities
  • Drive quality of various software and hardware components of Cerebras solution to ensure accuracy, performance and usability of model trainings.
  • Bring good testing methodology, effective communication and strong debugging skills to the team.
  • Demand the highest quality from all components within the Cerebras environment.
  • Ability to automate workflows, setup testbeds and build tools to effectively monitor and debug issues.
  • Implement creative ways to break Cerebras software and identify potential problems.
  • Break down complex tasks into smaller tasks. Be a problem solver. Be a thought leader.
  • Ability to work in a fast-paced environment and make the necessary prioritizations and judgements which affects productivity at a company level.
Minimum Qualifications
  • 3-7 years of relevant industry experience in Software quality and testing areas.
  • Experience testing AI/ML models and evaluation of the model quality.
  • Stong automation and programming skills using one or more programming languages like Python, C++ or go.
  • Experience in testing compute/machine learning/networking/storage systems within a large-scale enterprise environment.
  • Experience in debugging issues across scale out deployment.
  • Experience in putting together thorough test-plans.
  • Experience working effectively across teams, including product development, product management, customer operations, and field teams.
Preferred Skills
  • Knowledge of ML workflows and frameworks.
  • Knowledge of basic storage and networking protocols.
  • Hands-on experience with training LLMs.
  • Hands-on experience working with containers, Kubernetes.
Why Join Cerebras

People who are serious about software make their own hardware. At Cerebras we have built a breakthrough architecture that is unlocking new opportunities for the AI industry. With dozens of model releases and rapid growth, we’ve reached an inflection point in our business. Members of our team tell us there are five main reasons they joined Cerebras:

  • Build a breakthrough AI platform beyond the constraints of the GPU
  • Publish and open source their cutting-edge AI research
  • Work on one of the fastest AI supercomputers in the world
  • Enjoy job stability with startup vitality
  • Our simple, non-corporate work culture that respects individual beliefs
Apply today and become part of the forefront of groundbreaking advancements in AI.

Cerebras Systems is committed to creating an equal and diverse environment and is proud to be an equal opportunity employer. We celebrate different backgrounds, perspectives, and skills. We believe inclusive teams build better products and companies. We try every day to build a work environment that empowers people to do their best work through continuous learning, growth and support of those around them.

This website or its third-party tools process personal data. For more details, click here to review our CCPA disclosure notice.

Apply for this Job

* Required

Candidate saved successfully

Functional Functional Always active The technical storage or access is strictly necessary for the legitimate purpose of enabling the use of a specific service explicitly requested by the subscriber or user, or for the sole purpose of carrying out the transmission of a communication over an electronic communications network. Preferences Preferences The technical storage or access is necessary for the legitimate purpose of storing preferences that are not requested by the subscriber or user. Statistics Statistics The technical storage or access that is used exclusively for statistical purposes. The technical storage or access that is used exclusively for anonymous statistical purposes. Without a subpoena, voluntary compliance on the part of your Internet Service Provider, or additional records from a third party, information stored or retrieved for this purpose alone cannot usually be used to identify you. Marketing Marketing The technical storage or access is required to create user profiles to send advertising, or to track the user on a website or across several websites for similar marketing purposes.

#J-18808-Ljbffr

  • Old Toronto, Ontario, Canada Cerebras Systems Inc. Full time

    About the RoleCerebras Systems Inc. is revolutionizing the field of artificial intelligence with its cutting-edge technology. As a Senior ML Quality Engineer, you will play a crucial role in ensuring the quality of our software across various machine learning workloads and workflows.ResponsibilitiesDrive quality across software and hardware components to...


  • Old Toronto, Ontario, Canada Cerebras Systems Inc. Full time

    About the RoleCerebras Systems Inc. is revolutionizing the field of artificial intelligence with its cutting-edge technology. As a Senior ML Quality Engineer, you will play a crucial role in ensuring the quality of our software across various machine learning workloads and workflows.ResponsibilitiesDrive quality across software and hardware components to...


  • Old Toronto, Canada Zs Associates Full time

    As a management consulting and technology firm focused on transforming global healthcare and beyond, our most valuable asset is our people. We partner collaboratively with our clients to develop products that create value and deliver company results across critical areas of their business including portfolio strategy, customer insights, research and...


  • Old Toronto, Ontario, Canada S I Systems Full time

    Senior Machine Learning Engineer (Python) - Edge ServicesS.i. Systems is seeking a highly skilled Senior Machine Learning Engineer (Python) to lead the development of edge services using TensorFlow, PyTorch, or onnx.Full-time, permanent roleKey Responsibilities:Collaborate with development and product teams to design, develop, and maintain scalable,...


  • Old Toronto, Ontario, Canada S I Systems Full time

    Senior Machine Learning Engineer (Python) - Edge ServicesS.i. Systems is seeking a highly skilled Senior Machine Learning Engineer (Python) to lead the development of edge services using TensorFlow, PyTorch, or onnx.Full-time, permanent roleKey Responsibilities:Collaborate with development and product teams to design, develop, and maintain scalable,...


  • Old Toronto, Ontario, Canada Albin Engineering Services, Inc. Full time

    Job SummaryAlbin Engineering Services, Inc. is seeking a highly experienced Senior Principal Engineer System Architect - AI/ML to lead the development of complex AI/ML infrastructure platform solutions. This is a 100% remote position that requires international travel, approximately 4-6 trips per year.Key ResponsibilitiesDefine and Implement AI/ML...


  • Old Toronto, Ontario, Canada Albin Engineering Services, Inc. Full time

    Job SummaryAlbin Engineering Services, Inc. is seeking a highly experienced Senior Principal Engineer System Architect - AI/ML to lead the development of complex AI/ML infrastructure platform solutions. This is a 100% remote position that requires international travel, approximately 4-6 trips per year.Key ResponsibilitiesDefine and Implement AI/ML...


  • Toronto, Ontario, Canada Voxel Full time $200,000 - $250,000

    About VoxelVoxel is a pioneering technology company that is revolutionizing workplace safety and operations with cutting-edge AI and computer vision solutions. Our mission is to protect essential workers and prevent workplace incidents by providing innovative, data-driven insights to safety and operations leaders.Job SummaryWe are seeking a highly skilled...


  • Toronto, Ontario, Canada Voxel Full time $200,000 - $250,000

    About VoxelVoxel is a pioneering technology company that is revolutionizing workplace safety and operations with cutting-edge AI and computer vision solutions. Our mission is to protect essential workers and prevent workplace incidents by providing innovative, data-driven insights to safety and operations leaders.Job SummaryWe are seeking a highly skilled...

  • Lead Product Engineer

    1 month ago


    Old Toronto, Canada Fusemachines Full time

    ```html About Fusemachines Fusemachines is a leading AI strategy, talent, and education services and products provider. Founded by Sameer Maskey Ph.D., Adjunct Associate Professor at Columbia University, Fusemachines has a core mission of democratizing AI. With a presence in 4 countries (Nepal, United States, Canada, and Dominican Republic) and more than 400...

  • Lead Product Engineer

    2 months ago


    Old Toronto, Canada Fusemachines Full time

    ```html About Fusemachines Fusemachines is a leading AI strategy, talent, and education services and products provider. Founded by Sameer Maskey Ph.D., Adjunct Associate Professor at Columbia University, Fusemachines has a core mission of democratizing AI. With a presence in 4 countries (Nepal, United States, Canada, and Dominican Republic) and more than 400...

  • Lead Product Engineer

    2 months ago


    Old Toronto, Canada Fusemachines Full time

    ```html About Fusemachines Fusemachines is a leading AI strategy, talent, and education services and products provider. Founded by Sameer Maskey Ph.D., Adjunct Associate Professor at Columbia University, Fusemachines has a core mission of democratizing AI. With a presence in 4 countries (Nepal, United States, Canada, and Dominican Republic) and more than 400...


  • Toronto, Ontario, Canada Thomson Reuters Full time

    About the RoleWe are seeking a passionate Senior Research Engineer who will bring expertise in AI and ML and is interested in building data-driven capabilities that drive transformation.Key ResponsibilitiesDevelop and implement innovative AI and ML solutions to drive business growth and improvement.Collaborate with cross-functional teams to design and...


  • Toronto, Ontario, Canada Thomson Reuters Full time

    About the RoleWe are seeking a passionate Senior Research Engineer who will bring expertise in AI and ML and is interested in building data-driven capabilities that drive transformation.Key ResponsibilitiesDevelop and implement innovative AI and ML solutions to drive business growth and improvement.Collaborate with cross-functional teams to design and...


  • Old Toronto, Canada TD Full time

    Lieu de travail:CanadaHoraire:37.5Secteur d’activité:Données et AnalysesDétails de la rémunération :Nous avons à cœur d’offrir une rémunération juste et équitable à tous nos collègues. En votre qualité de candidat ou de candidate, nous vous encourageons à avoir une conversation franche avec votre recruteur et à poser des questions sur la...


  • Old Toronto, Canada TD Full time

    Lieu de travail:CanadaHoraire:37.5Secteur d’activité:Données et AnalysesDétails de la rémunération :Nous avons à cœur d’offrir une rémunération juste et équitable à tous nos collègues. En votre qualité de candidat ou de candidate, nous vous encourageons à avoir une conversation franche avec votre recruteur et à poser des questions sur la...


  • Old Toronto, Ontario, Canada Albin Engineering Services, Inc. Full time

    Job SummaryWe are seeking a highly experienced Senior Principal Engineer to lead our AI/ML infrastructure platform solutions team. As a key member of our engineering team, you will be responsible for defining complex AI/ML infrastructure platform solutions and leading a multi-disciplined team to bring them to implementation.Key ResponsibilitiesDefine AI/ML...


  • Old Toronto, Ontario, Canada Albin Engineering Services, Inc. Full time

    Job SummaryWe are seeking a highly experienced Senior Principal Engineer to lead our AI/ML infrastructure platform solutions team. As a key member of our engineering team, you will be responsible for defining complex AI/ML infrastructure platform solutions and leading a multi-disciplined team to bring them to implementation.Key ResponsibilitiesDefine AI/ML...


  • Toronto, Ontario, Canada Thomson Reuters Full time

    About the RoleWe are seeking a highly skilled Senior Research Engineer to join our team at Thomson Reuters Labs in Toronto, Canada. As a member of our interdisciplinary team, you will have the opportunity to work on cutting-edge AI and ML projects that drive transformation in our company.Key ResponsibilitiesDevelop and deliver high-quality software solutions...


  • Toronto, Ontario, Canada Thomson Reuters Full time

    About the RoleWe are seeking a highly skilled Senior Research Engineer to join our team at Thomson Reuters Labs in Toronto, Canada. As a member of our interdisciplinary team, you will have the opportunity to work on cutting-edge AI and ML projects that drive transformation in our company.Key ResponsibilitiesDevelop and deliver high-quality software solutions...