ML Engineer — Open-Source LLM Inference
3 weeks ago
A leading open-source software company in Toronto is seeking a Machine Learning Engineer to innovate with vLLM systems. You will develop and maintain subsystems, ensuring efficient model performance and compliance. The ideal candidate has significant experience with Python, a strong grasp of LLM Inference concepts, and excellent communication skills. This position includes a competitive salary range of $133,650 - $220,680, along with comprehensive benefits and a collaborative work environment.#J-18808-Ljbffr
-
ML Engineer — Open-Source LLM Inference
3 weeks ago
Toronto, Canada Red Hat, Inc. Full timeA leading open-source software company in Toronto is seeking a Machine Learning Engineer to innovate with vLLM systems. You will develop and maintain subsystems, ensuring efficient model performance and compliance. The ideal candidate has significant experience with Python, a strong grasp of LLM Inference concepts, and excellent communication skills. This...
-
ML Engineer — Open-Source LLM Inference
3 weeks ago
Toronto, Canada Red Hat, Inc. Full timeA leading open-source software company in Toronto is seeking a Machine Learning Engineer to innovate with vLLM systems. You will develop and maintain subsystems, ensuring efficient model performance and compliance. The ideal candidate has significant experience with Python, a strong grasp of LLM Inference concepts, and excellent communication skills. This...
-
ML Engineer — Open-Source LLM Inference
3 weeks ago
Toronto, Canada Red Hat, Inc. Full timeA leading open-source software company in Toronto is seeking a Machine Learning Engineer to innovate with vLLM systems. You will develop and maintain subsystems, ensuring efficient model performance and compliance. The ideal candidate has significant experience with Python, a strong grasp of LLM Inference concepts, and excellent communication skills. This...
-
Sr. Inference ML Runtime Engineer
3 weeks ago
Toronto, Canada Cerebras Systems Full timeCerebras Systems builds the world's largest AI chip, 56 times larger than GPUs. Our novel wafer-scale architecture provides the AI compute power of dozens of GPUs on a single chip, with the programming simplicity of a single device. This approach allows Cerebras to deliver industry-leading training and inference speeds and empowers machine learning users to...
-
AI Solutions Engineer
3 weeks ago
Toronto, Canada Cerebras Systems Full timeCerebras Systems builds the world's largest AI chip, 56 times larger than GPUs. Our novel wafer‑scale architecture provides the AI compute power of dozens of GPUs on a single chip, with the programming simplicity of a single device. This approach allows Cerebras to deliver industry‑leading training and inference speeds and empowers machine learning users...
-
Senior Research Engineer
2 weeks ago
Toronto, Canada Cerebras Systems Full timeCerebras Systems builds the world’s largest AI chip, 56 times larger than GPUs. Our novel wafer-scale architecture provides the AI compute power of dozens of GPUs on a single chip, with the programming simplicity of a single device. This approach allows Cerebras to deliver industry-leading training and inference speeds and empowers machine learning users...
-
Senior Research Engineer
2 weeks ago
Toronto, Canada Cerebras Systems Full timeCerebras Systems builds the world’s largest AI chip, 56 times larger than GPUs. Our novel wafer-scale architecture provides the AI compute power of dozens of GPUs on a single chip, with the programming simplicity of a single device. This approach allows Cerebras to deliver industry-leading training and inference speeds and empowers machine learning users...
-
Remote ML Engineer: vLLM Inference
2 weeks ago
Toronto, Canada Red Hat Full timeA leading enterprise open-source provider seeks a Machine Learning Engineer in Toronto, Ontario, to advance AI capabilities. You will develop vLLM systems, contribute to tool calling parser designs, and mentor other engineers. The ideal candidate has strong Python skills, experience with LLM inference, and a desire to solve complex challenges in deep...
-
Remote ML Engineer: vLLM Inference
2 weeks ago
Toronto, Canada Red Hat Full timeA leading enterprise open-source provider seeks a Machine Learning Engineer in Toronto, Ontario, to advance AI capabilities. You will develop vLLM systems, contribute to tool calling parser designs, and mentor other engineers. The ideal candidate has strong Python skills, experience with LLM inference, and a desire to solve complex challenges in deep...
-
Senior Research Engineer
3 weeks ago
Toronto, Canada Cerebras Full timeCerebras Systems builds the world’s largest AI chip, 56 times larger than GPUs. Our novel wafer‑scale architecture delivers industry‑leading training and inference speeds while providing the simplicity of a single device. This allows machine‑learning users to run large‑scale models without managing large clusters of GPUs or TPUs. About the Role As...