Senior ML Inference Engineer — Model Efficiency

3 weeks ago


Montreal Toronto Calgary Vancouver Edmonton Old Toronto Ottawa Mississauga Quebec Winnipeg Halifax Saskatoon Burnaby Hamilton Victoria Surrey Halton Hills London Regina Markham Brampton Vaughan Kelowna Laval Southwestern Ontario R, Canada Cohere Full time

A leading AI technology company is seeking a Member of Technical Staff to enhance model efficiency. This role involves improving performance metrics, optimizing bottlenecks, and collaborating with various teams. The ideal candidate has 5+ years in high-performance coding, strong skills in C++ or Python, and familiarity with large language models. Competitive perks include a flexible work environment, health benefits, and generous vacation time.
#J-18808-Ljbffr



  • Toronto, Montreal, Calgary, Vancouver, Edmonton, Old Toronto, Ottawa, Mississauga, Quebec, Winnipeg, Halifax, Saskatoon, Burnaby, Hamilton, Victoria, Surrey, Halton Hills, London, Regina, Markham, Brampton, Vaughan, Kelowna, Laval, Southwestern Ontario, R, Canada Cohere Full time

    A leading AI research company in Toronto is looking for a Staff Research Engineer, Model Efficiency to enhance the performance of large language models. This full-time position requires a PhD in Machine Learning and strong software engineering skills. The ideal candidate will develop and deploy techniques to improve model inference efficiency while enjoying...


  • Montreal, Toronto, Calgary, Vancouver, Edmonton, Old Toronto, Ottawa, Mississauga, Quebec, Winnipeg, Halifax, Saskatoon, Burnaby, Hamilton, Victoria, Surrey, London, Halton Hills, Regina, Markham, Brampton, Vaughan, Kelowna, Laval, Southwestern Ontario, R, Canada Cohere Full time

    A leading AI firm is seeking an Audio Inference Engineer to optimize machine learning audio systems. This role involves advancing audio model metrics like latency and throughput, ensuring seamless integration between model development and deployment. Ideal candidates have experience with audio inference systems, programming in C++ and Python, and a strong...


  • Montreal, Toronto, Calgary, Vancouver, Edmonton, Old Toronto, Ottawa, Mississauga, Quebec, Winnipeg, Halifax, Saskatoon, Burnaby, Hamilton, Victoria, Surrey, Halton Hills, London, Regina, Markham, Brampton, Vaughan, Kelowna, Laval, Southwestern Ontario, R, Canada Cohere Full time

    Member of Technical Staff, Model Efficiency 1 day ago Be among the first 25 applicants Get AI-powered advice on this job and more exclusive features. Who are we? Our mission is to scale intelligence to serve humanity. We’re training and deploying frontier models for developers and enterprises who are building AI systems to power magical experiences like...


  • Toronto, Montreal, Calgary, Vancouver, Edmonton, Old Toronto, Ottawa, Mississauga, Quebec, Winnipeg, Halifax, Saskatoon, Burnaby, Hamilton, Surrey, Victoria, London, Halton Hills, Regina, Markham, Brampton, Vaughan, Kelowna, Laval, Southwestern Ontario, R, Canada PlanHub Full time

    Senior Full Stack Engineer - AI/ML Productization Join to apply for the Senior Full Stack Engineer - AI/ML Productization role at PlanHub PlanHub is the leading pre‑construction SaaS platform and marketplace helping general contractors, subcontractors, and suppliers connect and grow their businesses. Built with tradespeople in mind, PlanHub is designed...


  • Montreal, Canada Cohere Full time

    Join to apply for the Audio Inference Engineer, Model Efficiency role at Cohere. Who are we? Our mission is to scale intelligence to serve humanity. We’re training and deploying frontier models for developers and enterprises who are building AI systems to power magical experiences like content generation, semantic search, RAG, and agents. We believe that...

  • Senior ML Engineer

    4 weeks ago


    Mississauga, Toronto, Montreal, Calgary, Vancouver, Edmonton, Old Toronto, Ottawa, Quebec, Winnipeg, Halifax, Saskatoon, Burnaby, Hamilton, Victoria, Surrey, Halton Hills, London, Regina, Markham, Brampton, Vaughan, Kelowna, Laval, Southwestern Ontario, R, Canada vaga para Senior ML Engineer, Recommendation Systems na Launch Potato Full time

    A profitable digital media company is looking for a Senior Machine Learning Engineer specializing in Recommendation Systems. In this role, you will design and optimize ML systems that deliver real-time recommendations across millions of users. Candidates should have a strong background in ranking algorithms, experience with large-scale ML deployments, and...


  • Toronto, Canada Cohere Full time

    Join to apply for the Audio Inference Engineer, Model Efficiency role at Cohere. Who are we? Our mission is to scale intelligence to serve humanity. We’re training and deploying frontier models for developers and enterprises who are building AI systems to power magical experiences such as content generation, semantic search, RAG, and agents. We believe...


  • Toronto, Canada Cohere Full time

    Join to apply for the Audio Inference Engineer, Model Efficiency role at Cohere. Who are we? Our mission is to scale intelligence to serve humanity. We’re training and deploying frontier models for developers and enterprises who are building AI systems to power magical experiences such as content generation, semantic search, RAG, and agents. We believe...


  • Toronto, Canada Cohere Full time

    Join to apply for the Audio Inference Engineer, Model Efficiency role at Cohere . Who are we? Our mission is to scale intelligence to serve humanity. We’re training and deploying frontier models for developers and enterprises who are building AI systems to power magical experiences such as content generation, semantic search, RAG, and agents. We believe...


  • Montreal, Toronto, Calgary, Vancouver, Edmonton, Old Toronto, Ottawa, Mississauga, Quebec, Winnipeg, Halifax, Saskatoon, Burnaby, Hamilton, Victoria, Surrey, Halton Hills, London, Regina, Markham, Brampton, Vaughan, Kelowna, Laval, Southwestern Ontario, R, Canada Tether Operations Limited Full time

    Join Tether and Shape the Future of Digital Finance At Tether, we’re not just building products, we’re pioneering a global financial revolution. Our cutting‑edge solutions empower businesses—from exchanges and wallets to payment processors and ATMs—to seamlessly integrate reserve‑backed tokens across blockchains. By harnessing the power of...