Senior Machine Learning Engineer, Post Training

1 week ago


Toronto, Ontario, Canada Groq Full time
Senior Machine Learning Engineer, Post Training & Speculative Decoding
Mission: We are seeking a highly skilled Machine Learning Engineer to join our advanced model development team. This role focuses on pre-training, continued training, and post-training of models, with a particular emphasis on draft model optimization for speculative decoding and quantization-aware training (QAT). The ideal candidate has deep experience with training methodologies, open-weight models, and performance-tuning for inference.
Responsibilities & outcomes:Lead pre-training and post-training efforts for draft models tailored to speculative decoding architectures.Conduct continued training and post-training of open-weight models for non-draft (standard) inference scenarios.Implement and optimize quantization-aware training pipelines to enable low-precision inference with minimal accuracy loss.Collaborate with model architecture, inference, and systems teams to evaluate model readiness across training and deployment stages.Develop tooling and evaluation metrics for training effectiveness, draft model fidelity, and speculative hit-rate optimization.Contribute to experimental designs for novel training regimes and speculative decoding strategies.
Ideal candidates have/are:5+ years of experience in machine learning, with a strong focus on model training.Proven experience with transformer-based architectures (e.g., LLaMA, Mistral, Gemma).Deep understanding of speculative decoding and draft model usage.Hands-on experience with quantization-aware training, including PyTorch QAT workflows or similar frameworks.Familiarity with open-weight foundation models and continued/pre-training techniques.Proficient in Python and ML frameworks such as PyTorch, JAX, or TensorFlow.
Preferred Qualifications:Experience optimizing models for fast inference and sampling in production environments.Exposure to distributed training, low-level kernel optimizations, and inference-time system constraints.Publications or contributions to open-source ML projects.
Attributes of a Groqster:Humility - Egos are checked at the doorCollaborative & Team Savvy - We make up the smartest person in the room, togetherGrowth & Giver Mindset - Learn it all versus know it all, we share knowledge generouslyCurious & Innovative - Take a creative approach to projects, problems, and designPassion, Grit, & Boldness - no limit thinking, fueling informed risk taking If this sounds like you, we'd love to hear from you
Compensation: At Groq, a competitive base salary is part of our comprehensive compensation package, which includes equity and benefits. For this role, salary range is determined by your location, skills, qualifications, experience and internal benchmarks. Compensation for candidates outside the USA will be dependent on the local market.

  • Toronto, Ontario, Canada Groq Full time

    MissionWe are seeking a highly skilled Machine Learning Engineer to join our advanced model development team. This role focuses on pre-training, continued training, and post-training of models, with a particular emphasis on draft model optimization for speculative decoding and quantization-aware training (QAT). The ideal candidate has deep experience with...


  • Toronto, Ontario, Canada Bloomberg Full time

    LocationTorontoBusiness AreaEngineering and CTORef # Description & RequirementsBloomberg's Engineering AI department comprises over 350 AI experts dedicated to building cutting edge, market-leading products. Leveraging advanced technologies including transformers, large language models, and dense vector databases, we are transforming search, discovery, and...


  • Toronto, Ontario, Canada Deep Genomics Full time

    About UsDeep Genomics is at the forefront of using artificial intelligence to transform drug discovery. Our proprietary AI platform decodes the complexity of genome biology to identify novel drug targets, mechanisms, and genetic medicines inaccessible through traditional methods. With expertise spanning machine learning, bioinformatics, data science,...


  • Toronto, Ontario, Canada Workday Full time $180,000 - $270,000

    Your work days are brighter here.We're obsessed with making hard work pay off, for our people, our customers, and the world around us. As a Fortune 500 company and a leading AI platform for managing people, money, and agents, we're shaping the future of work so teams can reach their potential and focus on what matters most. The minute you join, you'll feel...


  • Toronto, Ontario, Canada RBC Full time

    Job DescriptionWhat's the opportunity?At RBC Borealis, you'll be joining a team of leading researchers and software engineering specializing in machine learning. You will have access to rich and massive datasets, and to computational resources to support novel product development touching machine learning areas such as generative AI, natural language...


  • Toronto, Ontario, Canada RBC Full time

    Job DescriptionWhat's the opportunity?At RBC Borealis, you'll be joining a team of leading researchers and software engineering specializing in machine learning. You will have access to rich and massive datasets, and to computational resources to support novel product development touching machine learning areas such as generative AI, natural language...


  • Toronto, Ontario, Canada EvenUp Full time

    EvenUp is on a mission to close the justice gap using technology and AI. We empower personal injury lawyers and victims to get the justice they deserve. Our products enable law firms to secure faster settlements, higher payouts, and better outcomes for victims injured through no fault of their own in vehicle collisions, accidents, natural disasters, and...


  • Toronto, Ontario, Canada Aviva Full time $110,000 - $145,000

    Individually we are people, but together we are Aviva. Individually these are just words, but together they are our Values – Care, Commitment, Community, and Confidence.We are seeking a highly skilled and experienced Senior Machine Learning Engineer to join our AI/ML Platform team. The ideal candidate will have a strong background in designing, building,...


  • Toronto, Ontario, Canada Spait Infotech Private Limited Full time

    Job SummaryWe are seeking a dynamic and innovative Machine Learning Engineer to join our cutting-edge data science team. In this role, you will harness the power of artificial intelligence and machine learning frameworks to develop, train, and deploy sophisticated models that drive strategic decision-making and enhance product offerings. Your expertise in...


  • Toronto, Ontario, Canada EvenUp Full time

    EvenUp is on a mission to close the justice gap using technology and AI. We empower personal injury lawyers and victims to get the justice they deserve. Our products enable law firms to secure faster settlements, higher payouts, and better outcomes for victims injured through no fault of their own in vehicle collisions, accidents, natural disasters, and...