Engineering Manager, On-Prem LLM Inference Platform
2 weeks ago
A leading AI technology firm in Toronto is seeking a technical engineering leader for its Inference Service Platform. The role involves leading a team to scale LLM inference and ensure operational excellence. Ideal candidates will have extensive experience in distributed systems and ML infrastructures. This position offers a unique opportunity to shape the future of AI technology within a rapidly growing organization.
#J-18808-Ljbffr
-
Engineering Manager, Inference Platform
2 weeks ago
Toronto, Canada Cerebras Systems Full timeCerebras Systems builds the world's largest AI chip, 56 times larger than GPUs. Our novel wafer-scale architecture provides the AI compute power of dozens of GPUs on a single chip, with the programming simplicity of a single device. This approach allows Cerebras to deliver industry-leading training and inference speeds and empowers machine learning users to...
-
Engineering Manager, Inference Platform
2 weeks ago
Toronto, Canada Cerebras Systems Full timeCerebras Systems builds the world's largest AI chip, 56 times larger than GPUs. Our novel wafer-scale architecture provides the AI compute power of dozens of GPUs on a single chip, with the programming simplicity of a single device. This approach allows Cerebras to deliver industry-leading training and inference speeds and empowers machine learning users to...
-
Engineering Manager, Inference Platform
2 weeks ago
Toronto, Canada Cerebras Systems Full timeCerebras Systems builds the world's largest AI chip, 56 times larger than GPUs. Our novel wafer-scale architecture provides the AI compute power of dozens of GPUs on a single chip, with the programming simplicity of a single device. This approach allows Cerebras to deliver industry-leading training and inference speeds and empowers machine learning users to...
-
Toronto, Canada Cerebras Systems Full timeA leading AI technology firm in Toronto is seeking a technical engineering leader for its Inference Service Platform. The role involves leading a team to scale LLM inference and ensure operational excellence. Ideal candidates will have extensive experience in distributed systems and ML infrastructures. This position offers a unique opportunity to shape the...
-
Toronto, Canada Cerebras Systems Full timeA leading AI technology firm in Toronto is seeking a technical engineering leader for its Inference Service Platform. The role involves leading a team to scale LLM inference and ensure operational excellence. Ideal candidates will have extensive experience in distributed systems and ML infrastructures. This position offers a unique opportunity to shape the...
-
Deployment Engineer, AI Inference
4 weeks ago
Toronto, Canada Cerebras Systems Inc. Full timeAbout Cerebras Systems Cerebras Systems builds the world's largest AI chip, 56 times larger than GPUs. Our novel wafer‑scale architecture provides the AI compute power of dozens of GPUs on a single chip, with the programming simplicity of one device. This enables industry‑leading training and inference speeds and lets machine learning users run...
-
Deployment Engineer, AI Inference
3 weeks ago
Toronto, Canada Cerebras Systems Inc. Full timeAbout Cerebras Systems Cerebras Systems builds the world's largest AI chip, 56 times larger than GPUs. Our novel wafer‑scale architecture provides the AI compute power of dozens of GPUs on a single chip, with the programming simplicity of one device. This enables industry‑leading training and inference speeds and lets machine learning users run...
-
Deployment Engineer, AI Inference
2 weeks ago
Toronto, Canada Cerebras Systems Full timeCerebras Systems builds the world's largest AI chip, 56 times larger than GPUs. Our novel wafer-scale architecture provides the AI compute power of dozens of GPUs on a single chip, with the programming simplicity of a single device. This approach allows Cerebras to deliver industry‑leading training and inference speeds and empowers machine learning...
-
LLM Engineer
2 weeks ago
Toronto, Canada MULTIVERSE COMPUTING Full timeMultiverse Computing Multiverse is a well-funded fast-growing deep-tech company founded in 2019. We are the largest quantum software company in the EU and have been recognized by CB Insights (2023 and 2025) as one of the 100 most promising AI companies in the world. With 180 employees and growing our team is fully multicultural and international. We deliver...
-
LLM Engineer
2 weeks ago
Toronto, Canada MULTIVERSE COMPUTING Full timeMultiverse Computing Multiverse is a well-funded fast-growing deep-tech company founded in 2019. We are the largest quantum software company in the EU and have been recognized by CB Insights (2023 and 2025) as one of the 100 most promising AI companies in the world. With 180 employees and growing our team is fully multicultural and international. We deliver...