Staff Research Engineer: Accelerate LLM Inference
3 weeks ago
A leading AI research firm in Montreal is seeking a Staff Research Engineer to enhance model efficiency and optimize inference for large language models. In this full-time role, you will develop techniques to improve performance while maintaining model quality. The ideal candidate will hold a PhD in Machine Learning, with strong software engineering skills and experience in AI research. The position offers a collaborative remote-friendly environment along with numerous benefits.
#J-18808-Ljbffr
-
Staff Research Engineer — LLM Inference
3 weeks ago
Toronto, Montreal, Calgary, Vancouver, Edmonton, Old Toronto, Ottawa, Mississauga, Quebec, Winnipeg, Halifax, Saskatoon, Burnaby, Hamilton, Victoria, Surrey, Halton Hills, London, Regina, Markham, Brampton, Vaughan, Kelowna, Laval, Southwestern Ontario, R, Canada Cohere Full timeA leading AI research company in Toronto is looking for a Staff Research Engineer, Model Efficiency to enhance the performance of large language models. This full-time position requires a PhD in Machine Learning and strong software engineering skills. The ideal candidate will develop and deploy techniques to improve model inference efficiency while enjoying...
-
Senior AI Research Engineer, Model Inference
3 weeks ago
Montreal, Toronto, Calgary, Vancouver, Edmonton, Old Toronto, Ottawa, Mississauga, Quebec, Winnipeg, Halifax, Saskatoon, Burnaby, Hamilton, Victoria, Surrey, Halton Hills, London, Regina, Markham, Brampton, Vaughan, Kelowna, Laval, Southwestern Ontario, R, Canada Tether Operations Limited Full timeJoin Tether and Shape the Future of Digital Finance At Tether, we’re not just building products, we’re pioneering a global financial revolution. Our cutting‑edge solutions empower businesses—from exchanges and wallets to payment processors and ATMs—to seamlessly integrate reserve‑backed tokens across blockchains. By harnessing the power of...
-
LLM Serving Engineer
1 week ago
Markham, Canada Qualcomm Full timeRole Overview LLM Serving Engineer (Cloud AI Engineering) – Senior / Staff Engineer at Qualcomm Technologies, Inc. We are building a scalable LLM inference platform that spans from research to commercial deployment. The role spans the full product lifecycle and requires strategic thinking, strong execution, and excellent communication skills....
-
LLM Serving Engineer
1 week ago
Markham, Canada Qualcomm Full timeRole Overview LLM Serving Engineer (Cloud AI Engineering) – Senior / Staff Engineer at Qualcomm Technologies, Inc. We are building a scalable LLM inference platform that spans from research to commercial deployment. The role spans the full product lifecycle and requires strategic thinking, strong execution, and excellent communication skills....
-
LLM Serving Engineer
1 week ago
Markham, Canada Qualcomm Full timeRole Overview LLM Serving Engineer (Cloud AI Engineering) – Senior / Staff Engineer at Qualcomm Technologies, Inc. We are building a scalable LLM inference platform that spans from research to commercial deployment. The role spans the full product lifecycle and requires strategic thinking, strong execution, and excellent communication skills....
-
Staff Research Engineer, Model Efficiency
3 weeks ago
Toronto, Canada Cohere Full timeStaff Research Engineer, Model Efficiency Who are we? Our mission is to scale intelligence to serve humanity. We’re training and deploying frontier models for developers and enterprises who are building AI systems to power magical experiences such as content generation, semantic search, RAG, and agents. We believe that our work is instrumental to the...
-
Staff Research Engineer, Model Efficiency
3 weeks ago
Toronto, Canada Cohere Full timeStaff Research Engineer, Model Efficiency Who are we? Our mission is to scale intelligence to serve humanity. We’re training and deploying frontier models for developers and enterprises who are building AI systems to power magical experiences such as content generation, semantic search, RAG, and agents. We believe that our work is instrumental to the...
-
Staff Research Engineer, Model Efficiency
3 weeks ago
Toronto, Canada Cohere Full timeStaff Research Engineer, Model Efficiency Who are we? Our mission is to scale intelligence to serve humanity. We’re training and deploying frontier models for developers and enterprises who are building AI systems to power magical experiences such as content generation, semantic search, RAG, and agents. We believe that our work is instrumental to the...
-
Senior Researcher
3 weeks ago
Montreal, Canada Huawei Full timeAbout the team Huawei Canada has an immediate permanent opening for a Senior Researcher. Noah’s Ark lab has evolved into a prominent research organization with notable achievements in academia and industry. The lab’s mission focuses on advancing artificial intelligence and related fields to benefit the company and society. Driven by impactful, long-term...
-
Staff Research Engineer — LLM Inference
3 weeks ago
Toronto, Canada Cohere Full timeA leading AI research company in Toronto is looking for a Staff Research Engineer, Model Efficiency to enhance the performance of large language models. This full-time position requires a PhD in Machine Learning and strong software engineering skills. The ideal candidate will develop and deploy techniques to improve model inference efficiency while enjoying...