Staff Research Engineer: Accelerate LLM Inference

3 weeks ago

Montreal Toronto Calgary Vancouver Edmonton Old Toronto Ottawa Mississauga Quebec Winnipeg Halifax Saskatoon Burnaby Hamilton Victoria Surrey Halton Hills London Regina Markham Brampton Vaughan Kelowna Laval Southwestern Ontario R, Canada Cohere Full time

A leading AI research firm in Montreal is seeking a Staff Research Engineer to enhance model efficiency and optimize inference for large language models. In this full-time role, you will develop techniques to improve performance while maintaining model quality. The ideal candidate will hold a PhD in Machine Learning, with strong software engineering skills and experience in AI research. The position offers a collaborative remote-friendly environment along with numerous benefits.
#J-18808-Ljbffr

Staff Research Engineer — LLM Inference

3 weeks ago

Toronto, Montreal, Calgary, Vancouver, Edmonton, Old Toronto, Ottawa, Mississauga, Quebec, Winnipeg, Halifax, Saskatoon, Burnaby, Hamilton, Victoria, Surrey, Halton Hills, London, Regina, Markham, Brampton, Vaughan, Kelowna, Laval, Southwestern Ontario, R, Canada Cohere Full time

A leading AI research company in Toronto is looking for a Staff Research Engineer, Model Efficiency to enhance the performance of large language models. This full-time position requires a PhD in Machine Learning and strong software engineering skills. The ideal candidate will develop and deploy techniques to improve model inference efficiency while enjoying...
Senior AI Research Engineer, Model Inference

3 weeks ago

Montreal, Toronto, Calgary, Vancouver, Edmonton, Old Toronto, Ottawa, Mississauga, Quebec, Winnipeg, Halifax, Saskatoon, Burnaby, Hamilton, Victoria, Surrey, Halton Hills, London, Regina, Markham, Brampton, Vaughan, Kelowna, Laval, Southwestern Ontario, R, Canada Tether Operations Limited Full time

Join Tether and Shape the Future of Digital Finance At Tether, we’re not just building products, we’re pioneering a global financial revolution. Our cutting‑edge solutions empower businesses—from exchanges and wallets to payment processors and ATMs—to seamlessly integrate reserve‑backed tokens across blockchains. By harnessing the power of...
LLM Serving Engineer

1 week ago

Markham, Canada Qualcomm Full time

Role Overview LLM Serving Engineer (Cloud AI Engineering) – Senior / Staff Engineer at Qualcomm Technologies, Inc. We are building a scalable LLM inference platform that spans from research to commercial deployment. The role spans the full product lifecycle and requires strategic thinking, strong execution, and excellent communication skills....
LLM Serving Engineer

1 week ago

Markham, Canada Qualcomm Full time

Role Overview LLM Serving Engineer (Cloud AI Engineering) – Senior / Staff Engineer at Qualcomm Technologies, Inc. We are building a scalable LLM inference platform that spans from research to commercial deployment. The role spans the full product lifecycle and requires strategic thinking, strong execution, and excellent communication skills....
LLM Serving Engineer

1 week ago

Markham, Canada Qualcomm Full time

Role Overview LLM Serving Engineer (Cloud AI Engineering) – Senior / Staff Engineer at Qualcomm Technologies, Inc. We are building a scalable LLM inference platform that spans from research to commercial deployment. The role spans the full product lifecycle and requires strategic thinking, strong execution, and excellent communication skills....
Staff Research Engineer, Model Efficiency

3 weeks ago

Toronto, Canada Cohere Full time

Staff Research Engineer, Model Efficiency Who are we? Our mission is to scale intelligence to serve humanity. We’re training and deploying frontier models for developers and enterprises who are building AI systems to power magical experiences such as content generation, semantic search, RAG, and agents. We believe that our work is instrumental to the...
Staff Research Engineer, Model Efficiency

3 weeks ago

Toronto, Canada Cohere Full time

Staff Research Engineer, Model Efficiency Who are we? Our mission is to scale intelligence to serve humanity. We’re training and deploying frontier models for developers and enterprises who are building AI systems to power magical experiences such as content generation, semantic search, RAG, and agents. We believe that our work is instrumental to the...
Staff Research Engineer, Model Efficiency

3 weeks ago

Toronto, Canada Cohere Full time

Staff Research Engineer, Model Efficiency Who are we? Our mission is to scale intelligence to serve humanity. We’re training and deploying frontier models for developers and enterprises who are building AI systems to power magical experiences such as content generation, semantic search, RAG, and agents. We believe that our work is instrumental to the...
Senior Researcher

3 weeks ago

Montreal, Canada Huawei Full time

About the team Huawei Canada has an immediate permanent opening for a Senior Researcher. Noah’s Ark lab has evolved into a prominent research organization with notable achievements in academia and industry. The lab’s mission focuses on advancing artificial intelligence and related fields to benefit the company and society. Driven by impactful, long-term...
Staff Research Engineer — LLM Inference

3 weeks ago

Toronto, Canada Cohere Full time

A leading AI research company in Toronto is looking for a Staff Research Engineer, Model Efficiency to enhance the performance of large language models. This full-time position requires a PhD in Machine Learning and strong software engineering skills. The ideal candidate will develop and deploy techniques to improve model inference efficiency while enjoying...

Americas

Europe

Asia / Oceania

Africa

Staff Research Engineer: Accelerate LLM Inference