Staff Research Engineer: Accelerate LLM Inference

3 weeks ago


Montreal Toronto Calgary Vancouver Edmonton Old Toronto Ottawa Mississauga Quebec Winnipeg Halifax Saskatoon Burnaby Hamilton Victoria Surrey Halton Hills London Regina Markham Brampton Vaughan Kelowna Laval Southwestern Ontario R, Canada Cohere Full time

A leading AI research firm in Montreal is seeking a Staff Research Engineer to enhance model efficiency and optimize inference for large language models. In this full-time role, you will develop techniques to improve performance while maintaining model quality. The ideal candidate will hold a PhD in Machine Learning, with strong software engineering skills and experience in AI research. The position offers a collaborative remote-friendly environment along with numerous benefits.
#J-18808-Ljbffr



  • Toronto, Montreal, Calgary, Vancouver, Edmonton, Old Toronto, Ottawa, Mississauga, Quebec, Winnipeg, Halifax, Saskatoon, Burnaby, Hamilton, Victoria, Surrey, Halton Hills, London, Regina, Markham, Brampton, Vaughan, Kelowna, Laval, Southwestern Ontario, R, Canada Cohere Full time

    A leading AI research company in Toronto is looking for a Staff Research Engineer, Model Efficiency to enhance the performance of large language models. This full-time position requires a PhD in Machine Learning and strong software engineering skills. The ideal candidate will develop and deploy techniques to improve model inference efficiency while enjoying...


  • Montreal, Toronto, Calgary, Vancouver, Edmonton, Old Toronto, Ottawa, Mississauga, Quebec, Winnipeg, Halifax, Saskatoon, Burnaby, Hamilton, Victoria, Surrey, Halton Hills, London, Regina, Markham, Brampton, Vaughan, Kelowna, Laval, Southwestern Ontario, R, Canada Tether Operations Limited Full time

    Join Tether and Shape the Future of Digital Finance At Tether, we’re not just building products, we’re pioneering a global financial revolution. Our cutting‑edge solutions empower businesses—from exchanges and wallets to payment processors and ATMs—to seamlessly integrate reserve‑backed tokens across blockchains. By harnessing the power of...


  • Markham, Canada Qualcomm Full time

    Role Overview LLM Serving Engineer (Cloud AI Engineering) – Senior / Staff Engineer at Qualcomm Technologies, Inc. We are building a scalable LLM inference platform that spans from research to commercial deployment. The role spans the full product lifecycle and requires strategic thinking, strong execution, and excellent communication skills....


  • Markham, Canada Qualcomm Full time

    Role Overview LLM Serving Engineer (Cloud AI Engineering) – Senior / Staff Engineer at Qualcomm Technologies, Inc. We are building a scalable LLM inference platform that spans from research to commercial deployment. The role spans the full product lifecycle and requires strategic thinking, strong execution, and excellent communication skills....


  • Markham, Canada Qualcomm Full time

    Role Overview LLM Serving Engineer (Cloud AI Engineering) – Senior / Staff Engineer at Qualcomm Technologies, Inc. We are building a scalable LLM inference platform that spans from research to commercial deployment. The role spans the full product lifecycle and requires strategic thinking, strong execution, and excellent communication skills....


  • Toronto, Canada Cohere Full time

    Staff Research Engineer, Model Efficiency Who are we? Our mission is to scale intelligence to serve humanity. We’re training and deploying frontier models for developers and enterprises who are building AI systems to power magical experiences such as content generation, semantic search, RAG, and agents. We believe that our work is instrumental to the...


  • Toronto, Canada Cohere Full time

    Staff Research Engineer, Model Efficiency Who are we? Our mission is to scale intelligence to serve humanity. We’re training and deploying frontier models for developers and enterprises who are building AI systems to power magical experiences such as content generation, semantic search, RAG, and agents. We believe that our work is instrumental to the...


  • Toronto, Canada Cohere Full time

    Staff Research Engineer, Model Efficiency Who are we? Our mission is to scale intelligence to serve humanity. We’re training and deploying frontier models for developers and enterprises who are building AI systems to power magical experiences such as content generation, semantic search, RAG, and agents. We believe that our work is instrumental to the...

  • Senior Researcher

    3 weeks ago


    Montreal, Canada Huawei Full time

    About the team Huawei Canada has an immediate permanent opening for a Senior Researcher. Noah’s Ark lab has evolved into a prominent research organization with notable achievements in academia and industry. The lab’s mission focuses on advancing artificial intelligence and related fields to benefit the company and society. Driven by impactful, long-term...


  • Toronto, Canada Cohere Full time

    A leading AI research company in Toronto is looking for a Staff Research Engineer, Model Efficiency to enhance the performance of large language models. This full-time position requires a PhD in Machine Learning and strong software engineering skills. The ideal candidate will develop and deploy techniques to improve model inference efficiency while enjoying...