Machine Learning Engineer

4 weeks ago


Toronto Montreal Calgary Vancouver Edmonton Old Toronto Ottawa Mississauga Quebec Winnipeg Halifax Saskatoon Burnaby Hamilton Victoria Halton Hills Surrey London Regina Markham Brampton Vaughan Kelowna Laval Southwestern Ontario W, Canada Bagel Labs Full time

We are Bagel Labs - a distributed machine learning research lab working towards open-source superintelligence. We ignore years of experience and pedigree. If you have high agency - meaning your default assumption is that you can control the outcome of whatever situation you are in - we want to hear from you. Every requirement below is flexible for a candidate with high enough agency and tolerance for ambiguity.OverviewYou will design and optimize a distributed diffusion model training and serving system. Your focus is on building scalable, fault-tolerant infrastructure that can serve open-source diffusion models across multiple nodes and regions, with efficient support for adaptation techniques.Key ResponsibilitiesDesign and implement distributed diffusion model inference systems for image, video, and multimodal generation across multiple nodes and regions.Architect high-availability clusters for diffusion model serving with automatic failover, load balancing, and dynamic batching for variable-resolution outputs.Build monitoring and observability systems for distributed diffusion inference (denoising steps, memory usage, generation latency, CLIP score tracking).Integrate with open-source diffusion frameworks (Diffusers, ComfyUI, Invoke AI) and optimize for production-scale serving.Implement and optimize cutting-edge techniques: rectified flow models, consistency distillation, and progressive distillation for few-step generation.Design distributed systems for ControlNet, IP-Adapter, and multi-modal conditioning at scale.Build infrastructure for efficient LoRA/LyCORIS adaptation serving with hot-swapping and memory-efficient merging.Optimize VAE decoding pipelines and implement tiled/windowed generation for ultra-high-resolution outputs.Document architectural decisions, review code, and publish technical deep-dives on blog.bagel.com.Who You Might BeYou have a deep understanding of distributed systems and diffusion model architectures. You\'re excited about the rapid evolution from DDPM to flow matching and consistency models. You enjoy architecting scalable infrastructure that can handle the unique challenges of diffusion models - from variable compute requirements per timestep to efficient caching of intermediate states.Desired SkillsAt least 5 years of experience with distributed systems and production ML serving.Hands-on experience with diffusion model frameworks (Diffusers, ComfyUI, or similar) in production environments.Deep understanding of diffusion model architectures (U-Net, DiT, rectified flows, consistency models).Experience with distributed GPU orchestration for high-memory workloads.Proven record of optimizing generation latency (classifier-free guidance, DDIM/DPM solvers, distillation techniques).Experience with attention optimization techniques (Flash Attention, xFormers, memory-efficient attention).Strong understanding of adaptation techniques (LoRA, LyCORIS, textual inversion, DreamBooth).Expertise in handling variable-resolution generation and dynamic batching strategies.What We OfferTop of the market compensation.A deeply technical culture where bold, frontier ideas are debated, stress-tested, and built.Full remote flexibility within North American time zones.Ownership of work that can set the direction for decentralized AI.Paid travel opportunities to the top ML conferences around the world.Please apply via our careers page. Note: we do not share application links here. #J-18808-Ljbffr



  • Oakville, Toronto, Montreal, Calgary, Vancouver, Edmonton, Old Toronto, Ottawa, Mississauga, Quebec, Winnipeg, Halifax, Saskatoon, Burnaby, Hamilton, Victoria, Surrey, Halton Hills, London, Regina, Markham, Brampton, Vaughan, Kelowna, Laval, Southwestern, Canada Skip Full time

    Join to apply for the Machine Learning Engineer role at Skip Ready for a challenge? Then Just Eat Takeaway.com might be the place for you. We’re a leading global online food delivery platform, and our vision is to empower everyday convenience. Whether it’s a Friday-night feast, a post-gym poke bowl, or grabbing some groceries, our tech platform connects...


  • Toronto, Montreal, Calgary, Vancouver, Edmonton, Old Toronto, Ottawa, Mississauga, Quebec, Winnipeg, Halifax, Saskatoon, Burnaby, Hamilton, Surrey, Victoria, London, Halton Hills, Regina, Markham, Brampton, Vaughan, Kelowna, Laval, Southwestern Ontario, R, Canada Mercor Full time

    Machine Learning Engineer 2 days ago Be among the first 25 applicants This range is provided by Mercor. Your actual pay will be based on your skills and experience — talk with your recruiter to learn more. Base pay range $14.00/hr - $14.00/hr Direct message the job poster from Mercor About The Job Mercor connects elite creative and technical talent with...


  • Toronto, Montreal, Calgary, Vancouver, Edmonton, Old Toronto, Ottawa, Mississauga, Quebec, Winnipeg, Halifax, Saskatoon, Burnaby, Hamilton, Victoria, Surrey, Halton Hills, London, Regina, Markham, Brampton, Vaughan, Kelowna, Laval, Southwestern Ontario, R, Canada The Aviation Agency Full time

    Join to apply for the Machine Learning Engineer role at The Aviation Agency About The Role You’ll build, train, and optimize predictive models that power smarter, faster advertising campaigns. Your work will help us anticipate audience behavior and deliver the right message at the right time. Responsibilities Develop machine learning models for ad...


  • Toronto, Montreal, Calgary, Vancouver, Edmonton, Old Toronto, Ottawa, Mississauga, Quebec, Winnipeg, Halifax, Saskatoon, Burnaby, Hamilton, Victoria, Surrey, Halton Hills, London, Regina, Markham, Brampton, Vaughan, Kelowna, Laval, Southwestern Ontario, R, Canada Mercor Full time

    Machine Learning Engineer 2 days ago Be among the first 25 applicants Get AI-powered advice on this job and more exclusive features. This range is provided by Mercor. Your actual pay will be based on your skills and experience — talk with your recruiter to learn more. Base pay range $14.00/hr - $14.00/hr About The Job Mercor connects elite creative and...


  • Toronto, Montreal, Calgary, Vancouver, Edmonton, Old Toronto, Ottawa, Mississauga, Quebec, Winnipeg, Halifax, Saskatoon, Burnaby, Hamilton, Surrey, Victoria, London, Halton Hills, Regina, Markham, Brampton, Vaughan, Kelowna, Laval, Southwestern Ontario, R, Canada theScore Full time

    is North America’s leading provider of integrated entertainment, sports content, and casino gaming experiences. From casinos and racetracks to online gaming, sports betting and entertainment content, we deliver the experiences people want, how and where they want them. We’re always on the lookout for those who are passionate about creating and delivering...


  • Toronto, Montreal, Calgary, Vancouver, Edmonton, Old Toronto, Ottawa, Mississauga, Quebec, Winnipeg, Halifax, Saskatoon, Burnaby, Hamilton, Surrey, Victoria, London, Halton Hills, Regina, Markham, Brampton, Vaughan, Kelowna, Laval, Southwestern Ontario, R, Canada Mercor Full time

    1 day ago Be among the first 25 applicants This range is provided by Mercor. Your actual pay will be based on your skills and experience — talk with your recruiter to learn more. Base pay range $14.00/hr - $14.00/hr Direct message the job poster from Mercor About The Job Mercor connects elite creative and technical talent with leading AI research labs....


  • Toronto, Montreal, Calgary, Vancouver, Edmonton, Old Toronto, Ottawa, Mississauga, Quebec, Winnipeg, Halifax, Saskatoon, Burnaby, Hamilton, Surrey, Victoria, London, Halton Hills, Regina, Markham, Brampton, Vaughan, Kelowna, Laval, Southwestern Ontario, R, Canada Calix Full time

    Calix provides the cloud, software platforms, systems and services required for communications service providers to simplify their businesses, excite their subscribers and grow their value. This is a remote-based position that can be located anywhere in the United States or Canada. Our Products Team is growing and we're looking for a highly skilled Senior...


  • Toronto, Montreal, Calgary, Vancouver, Edmonton, Old Toronto, Ottawa, Mississauga, Quebec, Winnipeg, Halifax, Saskatoon, Burnaby, Hamilton, Victoria, Surrey, Halton Hills, London, Regina, Markham, Brampton, Vaughan, Kelowna, Laval, Southwestern Ontario, R, Canada RAD Intel Full time

    Join to apply for the Machine Learning Engineer role at RAD Intel About RAD RAD Intel is building the future of AI-powered growth. We’re a fast-scaling company backed by 10,000+ investors and $50M+ raised, with a mission to reinvent how businesses grow through our AIBO (Artificial Intelligence Buy-Out) strategy. RAD acquires and partners with agencies,...


  • Toronto, Montreal, Calgary, Vancouver, Edmonton, Old Toronto, Ottawa, Mississauga, Quebec, Winnipeg, Halifax, Saskatoon, Burnaby, Hamilton, Victoria, Surrey, Halton Hills, London, Regina, Markham, Brampton, Vaughan, Kelowna, Laval, Southwestern Ontario, R, Canada Mercor Full time

    Machine Learning Engineer Position: Machine Learning Engineer Type: Hourly contractor Compensation: $14/hour Location: Remote Commitment: 20–40 hours/week Base pay range: $14.00/hr - $14.00/hr About the company: Mercor connects elite creative and technical talent with leading AI research labs. Headquartered in San Francisco, our investors include...


  • Toronto, Montreal, Calgary, Vancouver, Edmonton, Old Toronto, Ottawa, Mississauga, Quebec, Winnipeg, Halifax, Saskatoon, Burnaby, Hamilton, Victoria, Surrey, Halton Hills, London, Regina, Markham, Brampton, Vaughan, Kelowna, Laval, Southwestern Ontario, R, Canada Thoughtworks Full time

    Overview Senior Machine Learning Engineers at Thoughtworks build, maintain and test the architecture and infrastructure for managing machine learning applications. They support and contribute to the design of end-to-end applications and products. They are responsible for building core capabilities including technical and functional machine learning systems...