Engineering Manager, On-Prem LLM Inference Platform

2 days ago

Toronto, Canada Cerebras Systems Full time

A leading AI technology firm in Toronto is seeking a technical engineering leader for its Inference Service Platform. The role involves leading a team to scale LLM inference and ensure operational excellence. Ideal candidates will have extensive experience in distributed systems and ML infrastructures. This position offers a unique opportunity to shape the future of AI technology within a rapidly growing organization.
#J-18808-Ljbffr

Engineering Manager, Inference Platform

2 days ago

Toronto, Canada Cerebras Systems Full time

Cerebras Systems builds the world's largest AI chip, 56 times larger than GPUs. Our novel wafer-scale architecture provides the AI compute power of dozens of GPUs on a single chip, with the programming simplicity of a single device. This approach allows Cerebras to deliver industry-leading training and inference speeds and empowers machine learning users to...
Engineering Manager, Inference Platform

1 day ago

Toronto, Canada Cerebras Systems Full time

Cerebras Systems builds the world's largest AI chip, 56 times larger than GPUs. Our novel wafer-scale architecture provides the AI compute power of dozens of GPUs on a single chip, with the programming simplicity of a single device. This approach allows Cerebras to deliver industry-leading training and inference speeds and empowers machine learning users to...
Engineering Manager, On-Prem LLM Inference Platform

1 day ago

Toronto, Canada Cerebras Systems Full time

A leading AI technology firm in Toronto is seeking a technical engineering leader for its Inference Service Platform. The role involves leading a team to scale LLM inference and ensure operational excellence. Ideal candidates will have extensive experience in distributed systems and ML infrastructures. This position offers a unique opportunity to shape the...
Deployment Engineer, AI Inference

5 days ago

Toronto, Canada Cerebras Systems Inc. Full time

About Cerebras Systems Cerebras Systems builds the world's largest AI chip, 56 times larger than GPUs. Our novel wafer‑scale architecture provides the AI compute power of dozens of GPUs on a single chip, with the programming simplicity of one device. This enables industry‑leading training and inference speeds and lets machine learning users run...
Deployment Engineer, AI Inference

2 weeks ago

Toronto, Canada Cerebras Systems Inc. Full time

About Cerebras Systems Cerebras Systems builds the world's largest AI chip, 56 times larger than GPUs. Our novel wafer‑scale architecture provides the AI compute power of dozens of GPUs on a single chip, with the programming simplicity of one device. This enables industry‑leading training and inference speeds and lets machine learning users run...
Deployment Engineer, AI Inference

5 days ago

Toronto, Canada Cerebras Systems Inc. Full time

About Cerebras Systems Cerebras Systems builds the world's largest AI chip, 56 times larger than GPUs. Our novel wafer‑scale architecture provides the AI compute power of dozens of GPUs on a single chip, with the programming simplicity of one device. This enables industry‑leading training and inference speeds and lets machine learning users run...
Sr. Inference ML Runtime Engineer

3 weeks ago

Toronto, Canada Cerebras Systems Full time

Cerebras Systems builds the world's largest AI chip, 56 times larger than GPUs. Our novel wafer-scale architecture provides the AI compute power of dozens of GPUs on a single chip, with the programming simplicity of a single device. This approach allows Cerebras to deliver industry-leading training and inference speeds and empowers machine learning users to...
LLM Engineer

2 days ago

Toronto, Ontario, , Canada Hire DigITalent Full time

Our client is looking for an LLM Engineer to help build and optimize multiple AI agent frameworks that streamline procurement, predict optimal options, and autonomously execute tasks on behalf of users. This is a 12-month contract with 3 days in office per week in downtown Toronto.As an integral part of our AI team, you will develop, fine-tune, and deploy...
LLM Engineer

1 day ago

Toronto, Canada Hire DigITalent Full time

Our client is looking for an LLM Engineer to help build and optimize multiple AI agent frameworks that streamline procurement, predict optimal options, and autonomously execute tasks on behalf of users. This is a 12-month contract with 3 days in office per week in downtown Toronto.As an integral part of our AI team, you will develop, fine-tune, and deploy...
ML Engineer — Open-Source LLM Inference

3 weeks ago

Toronto, Canada Red Hat, Inc. Full time

A leading open-source software company in Toronto is seeking a Machine Learning Engineer to innovate with vLLM systems. You will develop and maintain subsystems, ensuring efficient model performance and compliance. The ideal candidate has significant experience with Python, a strong grasp of LLM Inference concepts, and excellent communication skills. This...

Americas

Europe

Asia / Oceania

Africa

Engineering Manager, On-Prem LLM Inference Platform