Inference Serving Engineer — Scalable AI Infra

1 week ago


Toronto, Canada Taalas Full time

A technology firm specializing in AI is seeking a Software Engineer – Inference Serving. This entry-level role involves building software infrastructure for an inference serving cluster. Responsibilities include adapting open-source inference servers and implementing efficient solutions for AI models. Ideal candidates should have a relevant degree and familiarity with Python, ML, and low-level programming.#J-18808-Ljbffr



  • Toronto, Canada Taalas Full time

    A technology firm specializing in AI is seeking a Software Engineer – Inference Serving. This entry-level role involves building software infrastructure for an inference serving cluster. Responsibilities include adapting open-source inference servers and implementing efficient solutions for AI models. Ideal candidates should have a relevant degree and...


  • Toronto, Canada Taalas Full time

    A technology firm specializing in AI is seeking a Software Engineer – Inference Serving. This entry-level role involves building software infrastructure for an inference serving cluster. Responsibilities include adapting open-source inference servers and implementing efficient solutions for AI models. Ideal candidates should have a relevant degree and...


  • Toronto, Canada Taalas Full time

    Join to apply for the Software Engineer – Inference Serving role at Taalas At Taalas we believe that fundamental progress is achieved by those who are willing to understand and assail a problem end-to-end, without regard for commonly accepted abstractions and boundaries. We are building a team of hands‑on technologists who dislike overspecialization and...


  • Toronto, Canada Taalas Full time

    Join to apply for the Software Engineer – Inference Serving role at Taalas At Taalas we believe that fundamental progress is achieved by those who are willing to understand and assail a problem end-to-end, without regard for commonly accepted abstractions and boundaries. We are building a team of hands‑on technologists who dislike overspecialization and...


  • Toronto, Canada Taalas Full time

    Join to apply for the Software Engineer – Inference Serving role at Taalas At Taalas we believe that fundamental progress is achieved by those who are willing to understand and assail a problem end-to-end, without regard for commonly accepted abstractions and boundaries. We are building a team of hands‑on technologists who dislike overspecialization and...


  • Toronto, Canada Cerebras Systems Inc. Full time

    About Cerebras Systems Cerebras Systems builds the world's largest AI chip, 56 times larger than GPUs. Our novel wafer‑scale architecture provides the AI compute power of dozens of GPUs on a single chip, with the programming simplicity of one device. This enables industry‑leading training and inference speeds and lets machine learning users run...


  • Toronto, Canada Cerebras Systems Inc. Full time

    About Cerebras Systems Cerebras Systems builds the world's largest AI chip, 56 times larger than GPUs. Our novel wafer‑scale architecture provides the AI compute power of dozens of GPUs on a single chip, with the programming simplicity of one device. This enables industry‑leading training and inference speeds and lets machine learning users run...


  • Toronto, Canada Cerebras Systems Full time

    ​Cerebras Systems builds the world's largest AI chip, 56 times larger than GPUs. Our novel wafer-scale architecture provides the AI compute power of dozens of GPUs on a single chip, with the programming simplicity of a single device. This approach allows Cerebras to deliver industry‑leading training and inference speeds and empowers machine learning...


  • Toronto, Canada Cerebras Systems Full time

    Cerebras Systems builds the world's largest AI chip, 56 times larger than GPUs. Our novel wafer-scale architecture provides the AI compute power of dozens of GPUs on a single chip, with the programming simplicity of a single device. This approach allows Cerebras to deliver industry-leading training and inference speeds and empowers machine learning users to...


  • Toronto, Canada Cerebras Systems Full time

    Cerebras Systems builds the world's largest AI chip, 56 times larger than GPUs. Our novel wafer-scale architecture provides the AI compute power of dozens of GPUs on a single chip, with the programming simplicity of a single device. This approach allows Cerebras to deliver industry-leading training and inference speeds and empowers machine learning users to...