ML Engineer — Open-Source LLM Inference

3 weeks ago


Toronto, Canada Red Hat, Inc. Full time

A leading open-source software company in Toronto is seeking a Machine Learning Engineer to innovate with vLLM systems. You will develop and maintain subsystems, ensuring efficient model performance and compliance. The ideal candidate has significant experience with Python, a strong grasp of LLM Inference concepts, and excellent communication skills. This position includes a competitive salary range of $133,650 - $220,680, along with comprehensive benefits and a collaborative work environment.
#J-18808-Ljbffr



  • Toronto, Canada Red Hat, Inc. Full time

    A leading open-source software company in Toronto is seeking a Machine Learning Engineer to innovate with vLLM systems. You will develop and maintain subsystems, ensuring efficient model performance and compliance. The ideal candidate has significant experience with Python, a strong grasp of LLM Inference concepts, and excellent communication skills. This...


  • Toronto, Canada Red Hat, Inc. Full time

    A leading open-source software company in Toronto is seeking a Machine Learning Engineer to innovate with vLLM systems. You will develop and maintain subsystems, ensuring efficient model performance and compliance. The ideal candidate has significant experience with Python, a strong grasp of LLM Inference concepts, and excellent communication skills. This...


  • Toronto, Canada Red Hat, Inc. Full time

    A leading open-source software company in Toronto is seeking a Machine Learning Engineer to innovate with vLLM systems. You will develop and maintain subsystems, ensuring efficient model performance and compliance. The ideal candidate has significant experience with Python, a strong grasp of LLM Inference concepts, and excellent communication skills. This...


  • Toronto, Canada Cerebras Systems Full time

    Cerebras Systems builds the world's largest AI chip, 56 times larger than GPUs. Our novel wafer-scale architecture provides the AI compute power of dozens of GPUs on a single chip, with the programming simplicity of a single device. This approach allows Cerebras to deliver industry-leading training and inference speeds and empowers machine learning users to...


  • Toronto, Canada Cerebras Systems Full time

    Cerebras Systems builds the world's largest AI chip, 56 times larger than GPUs. Our novel wafer-scale architecture provides the AI compute power of dozens of GPUs on a single chip, with the programming simplicity of a single device. This approach allows Cerebras to deliver industry-leading training and inference speeds and empowers machine learning users to...


  • Toronto, Canada Cerebras Systems Full time

    Cerebras Systems builds the world's largest AI chip, 56 times larger than GPUs. Our novel wafer-scale architecture provides the AI compute power of dozens of GPUs on a single chip, with the programming simplicity of a single device. This approach allows Cerebras to deliver industry-leading training and inference speeds and empowers machine learning users to...


  • Toronto, Canada Cerebras Systems Full time

    A leading AI technology firm in Toronto is seeking a technical engineering leader for its Inference Service Platform. The role involves leading a team to scale LLM inference and ensure operational excellence. Ideal candidates will have extensive experience in distributed systems and ML infrastructures. This position offers a unique opportunity to shape the...


  • Toronto, Canada Cerebras Systems Full time

    A leading AI technology firm in Toronto is seeking a technical engineering leader for its Inference Service Platform. The role involves leading a team to scale LLM inference and ensure operational excellence. Ideal candidates will have extensive experience in distributed systems and ML infrastructures. This position offers a unique opportunity to shape the...

  • AI Solutions Engineer

    3 weeks ago


    Toronto, Canada Cerebras Systems Full time

    Cerebras Systems builds the world's largest AI chip, 56 times larger than GPUs. Our novel wafer‑scale architecture provides the AI compute power of dozens of GPUs on a single chip, with the programming simplicity of a single device. This approach allows Cerebras to deliver industry‑leading training and inference speeds and empowers machine learning users...


  • Toronto, Canada Cerebras Systems Full time

    Cerebras Systems builds the world’s largest AI chip, 56 times larger than GPUs. Our novel wafer-scale architecture provides the AI compute power of dozens of GPUs on a single chip, with the programming simplicity of a single device. This approach allows Cerebras to deliver industry-leading training and inference speeds and empowers machine learning users...