Staff Research Engineer — LLM Inference
3 weeks ago
A leading AI research company in Toronto is looking for a Staff Research Engineer, Model Efficiency to enhance the performance of large language models. This full-time position requires a PhD in Machine Learning and strong software engineering skills. The ideal candidate will develop and deploy techniques to improve model inference efficiency while enjoying a remote-friendly work environment with generous perks.
#J-18808-Ljbffr
-
Staff Research Engineer — LLM Inference
3 weeks ago
Toronto, Canada Cohere Full timeA leading AI research company in Toronto is looking for a Staff Research Engineer, Model Efficiency to enhance the performance of large language models. This full-time position requires a PhD in Machine Learning and strong software engineering skills. The ideal candidate will develop and deploy techniques to improve model inference efficiency while enjoying...
-
Montreal, Toronto, Calgary, Vancouver, Edmonton, Old Toronto, Ottawa, Mississauga, Quebec, Winnipeg, Halifax, Saskatoon, Burnaby, Hamilton, Victoria, Surrey, Halton Hills, London, Regina, Markham, Brampton, Vaughan, Kelowna, Laval, Southwestern Ontario, R, Canada Cohere Full timeA leading AI research firm in Montreal is seeking a Staff Research Engineer to enhance model efficiency and optimize inference for large language models. In this full-time role, you will develop techniques to improve performance while maintaining model quality. The ideal candidate will hold a PhD in Machine Learning, with strong software engineering skills...
-
Staff Research Engineer — LLM Inference
3 weeks ago
Toronto, Montreal, Calgary, Vancouver, Edmonton, Old Toronto, Ottawa, Mississauga, Quebec, Winnipeg, Halifax, Saskatoon, Burnaby, Hamilton, Victoria, Surrey, Halton Hills, London, Regina, Markham, Brampton, Vaughan, Kelowna, Laval, Southwestern Ontario, R, Canada Cohere Full timeA leading AI research company in Toronto is looking for a Staff Research Engineer, Model Efficiency to enhance the performance of large language models. This full-time position requires a PhD in Machine Learning and strong software engineering skills. The ideal candidate will develop and deploy techniques to improve model inference efficiency while enjoying...
-
Staff Research Engineer, Model Efficiency
3 weeks ago
Toronto, Canada Cohere Full timeStaff Research Engineer, Model Efficiency Who are we? Our mission is to scale intelligence to serve humanity. We’re training and deploying frontier models for developers and enterprises who are building AI systems to power magical experiences such as content generation, semantic search, RAG, and agents. We believe that our work is instrumental to the...
-
Staff Research Engineer, Model Efficiency
3 weeks ago
Toronto, Canada Cohere Full timeStaff Research Engineer, Model Efficiency Who are we? Our mission is to scale intelligence to serve humanity. We’re training and deploying frontier models for developers and enterprises who are building AI systems to power magical experiences such as content generation, semantic search, RAG, and agents. We believe that our work is instrumental to the...
-
Staff Research Engineer, Model Efficiency
3 weeks ago
Toronto, Canada Cohere Full timeStaff Research Engineer, Model Efficiency Who are we? Our mission is to scale intelligence to serve humanity. We’re training and deploying frontier models for developers and enterprises who are building AI systems to power magical experiences such as content generation, semantic search, RAG, and agents. We believe that our work is instrumental to the...
-
Toronto, Canada Cerebras Systems Full timeA leading AI technology firm in Toronto is seeking a technical engineering leader for its Inference Service Platform. The role involves leading a team to scale LLM inference and ensure operational excellence. Ideal candidates will have extensive experience in distributed systems and ML infrastructures. This position offers a unique opportunity to shape the...
-
Toronto, Canada Cerebras Systems Full timeA leading AI technology firm in Toronto is seeking a technical engineering leader for its Inference Service Platform. The role involves leading a team to scale LLM inference and ensure operational excellence. Ideal candidates will have extensive experience in distributed systems and ML infrastructures. This position offers a unique opportunity to shape the...
-
Toronto, Canada Cerebras Systems Full timeA leading AI technology firm in Toronto is seeking a technical engineering leader for its Inference Service Platform. The role involves leading a team to scale LLM inference and ensure operational excellence. Ideal candidates will have extensive experience in distributed systems and ML infrastructures. This position offers a unique opportunity to shape the...
-
Engineering Manager, Inference Platform
2 weeks ago
Toronto, Canada Cerebras Systems Full timeCerebras Systems builds the world's largest AI chip, 56 times larger than GPUs. Our novel wafer-scale architecture provides the AI compute power of dozens of GPUs on a single chip, with the programming simplicity of a single device. This approach allows Cerebras to deliver industry-leading training and inference speeds and empowers machine learning users to...