Remote ML Engineer: vLLM Inference

3 weeks ago


Toronto, Canada Red Hat Full time

A leading enterprise open-source provider seeks a Machine Learning Engineer in Toronto, Ontario, to advance AI capabilities. You will develop vLLM systems, contribute to tool calling parser designs, and mentor other engineers. The ideal candidate has strong Python skills, experience with LLM inference, and a desire to solve complex challenges in deep learning. Competitive salary range from $133,650 to $220,680, with additional benefits for full-time associates.
#J-18808-Ljbffr



  • Toronto, Canada Red Hat Full time

    A leading enterprise open-source provider seeks a Machine Learning Engineer in Toronto, Ontario, to advance AI capabilities. You will develop vLLM systems, contribute to tool calling parser designs, and mentor other engineers. The ideal candidate has strong Python skills, experience with LLM inference, and a desire to solve complex challenges in deep...


  • Toronto, Canada Red Hat Full time

    A leading enterprise open-source provider seeks a Machine Learning Engineer in Toronto, Ontario, to advance AI capabilities. You will develop vLLM systems, contribute to tool calling parser designs, and mentor other engineers. The ideal candidate has strong Python skills, experience with LLM inference, and a desire to solve complex challenges in deep...


  • Toronto, Canada Cerebras Full time

    Cerebras Systems builds the world’s largest AI chip, 56 times larger than GPUs. Our novel wafer‑scale architecture delivers industry‑leading training and inference speeds while providing the simplicity of a single device. This allows machine‑learning users to run large‑scale models without managing large clusters of GPUs or TPUs. About the Role As...


  • Toronto, Canada Cerebras Full time

    Cerebras Systems builds the world’s largest AI chip, 56 times larger than GPUs. Our novel wafer‑scale architecture delivers industry‑leading training and inference speeds while providing the simplicity of a single device. This allows machine‑learning users to run large‑scale models without managing large clusters of GPUs or TPUs. About the Role As...


  • Toronto, Canada Red Hat Full time

    Machine Learning Engineer, vLLM Inference - Tool Calling and Structured Output At Red Hat, we believe the future of AI is open, and we are on a mission to bring the power of open‑source LLMs and vLLM to every enterprise. The Red Hat Inference team accelerates AI for the enterprise and brings operational simplicity to GenAI deployments. As leading...


  • Toronto, Canada Red Hat Full time

    Machine Learning Engineer, vLLM Inference - Tool Calling and Structured Output At Red Hat, we believe the future of AI is open, and we are on a mission to bring the power of open‑source LLMs and vLLM to every enterprise. The Red Hat Inference team accelerates AI for the enterprise and brings operational simplicity to GenAI deployments. As leading...


  • Toronto, Canada Red Hat Full time

    Machine Learning Engineer, vLLM Inference - Tool Calling and Structured Output At Red Hat, we believe the future of AI is open, and we are on a mission to bring the power of open‑source LLMs and vLLM to every enterprise. The Red Hat Inference team accelerates AI for the enterprise and brings operational simplicity to GenAI deployments. As leading...


  • Toronto, Canada Red Hat, Inc. Full time

    A leading open-source software company in Toronto is seeking a Machine Learning Engineer to innovate with vLLM systems. You will develop and maintain subsystems, ensuring efficient model performance and compliance. The ideal candidate has significant experience with Python, a strong grasp of LLM Inference concepts, and excellent communication skills. This...


  • Toronto, Canada Red Hat, Inc. Full time

    A leading open-source software company in Toronto is seeking a Machine Learning Engineer to innovate with vLLM systems. You will develop and maintain subsystems, ensuring efficient model performance and compliance. The ideal candidate has significant experience with Python, a strong grasp of LLM Inference concepts, and excellent communication skills. This...


  • Toronto, Canada Red Hat, Inc. Full time

    A leading open-source software company in Toronto is seeking a Machine Learning Engineer to innovate with vLLM systems. You will develop and maintain subsystems, ensuring efficient model performance and compliance. The ideal candidate has significant experience with Python, a strong grasp of LLM Inference concepts, and excellent communication skills. This...