AI Inference Engineer — Open-Source Integrations

2 weeks ago


Canada Cerebras Full time

A leading AI technology company in Canada seeks an experienced software engineer to develop open-source libraries and applications for its innovative inference platform. The role involves collaborating with engineering teams and creating demo applications that showcase the platform's advantages. Candidates should have a degree in computer science, 4+ years of experience, and proficiency in Python alongside modern LLM frameworks. This position offers the chance to work at the forefront of AI technology in a supportive environment.#J-18808-Ljbffr


  • AI Solutions Engineer

    2 weeks ago


    , , Canada Cerebras Full time

    Cerebras Systems builds the world's largest AI chip, 56 times larger than GPUs. Our novel wafer-scale architecture provides the AI compute power of dozens of GPUs on a single chip, with the programming simplicity of a single device. This approach allows Cerebras to deliver industry-leading training and inference speeds and empowers machine learning users to...


  • , , Canada Cerebras Full time

    Cerebras Systems builds the world's largest AI chip, 56 times larger than GPUs. Our novel wafer‑scale architecture provides the AI compute power of dozens of GPUs on a single chip, with the programming simplicity of a single device. This approach allows Cerebras to deliver industry‑leading training and inference speeds and empowers machine learning users...


  • , , Canada EnCharge AI Full time

    A leading AI technology company in Canada is seeking an experienced AI Research Engineer to optimize deep learning models for edge AI platforms. Responsibilities include developing quantization strategies and efficient inference techniques. Candidates must possess a Master's or Ph.D. in Computer Science or Electrical Engineering, with expertise in deep...


  • , , Canada Cerebras Full time

    Cerebras Systems builds the world's largest AI chip, 56 times larger than GPUs. Our novel wafer-scale architecture provides the AI compute power of dozens of GPUs on a single chip, with the programming simplicity of a single device. This approach allows Cerebras to deliver industry-leading training and inference speeds and empowers machine learning users to...


  • , , Canada Cerebras Full time

    About the team The Cerebras Inference team’s mission is to deliver the world’s most performant, secure, and reliable enterprise‑grade AI service. We build and operate large‑scale distributed systems that power AI inference at unprecedented speed and efficiency. Join us to help scale inference and accelerate AI. About the role We’re looking for a...


  • U.S., Canada, Germany, Norway EnCharge AI Full time

    EnCharge AI is a leader in advanced AI hardware and software systems for edge-to-cloud computing. EnCharge's robust and scalable next-generation in-memory computing technology provides orders-of-magnitude higher compute efficiency and density compared to today's best-in-class solutions. The high-performance architecture is coupled with seamless software...


  • , , Canada EnCharge AI Full time

    EnCharge AI is a leader in advanced AI hardware and software systems for edge-to-cloud computing. EnCharge’s robust and scalable next-generation in‑memory computing technology provides orders‑of‑magnitude higher compute efficiency and density compared to today’s best‑in‑class solutions. The high‑performance architecture is coupled with...


  • , , Canada Cerebras Full time

    Cerebras Systems builds the world's largest AI chip, 56 times larger than GPUs. Our novel wafer‑scale architecture provides the AI compute power of dozens of GPUs on a single chip, with the programming simplicity of a single device. This approach allows Cerebras to deliver industry‑leading training and inference speeds and empowers machine learning users...

  • AI Runtime Engineer

    2 weeks ago


    U.S., Canada, Germany, Norway EnCharge AI Full time

    EnCharge AI is a leader in advanced AI hardware and software systems for edge-to-cloud computing. EnCharge's robust and scalable next-generation in-memory computing technology provides orders-of-magnitude higher compute efficiency and density compared to today's best-in-class solutions. The high-performance architecture is coupled with seamless software...

  • AI Research Engineer

    2 weeks ago


    Canada, Germany, Norway, United States EnCharge AI Full time

    EnCharge AI is a leader in advanced AI hardware and software systems for edge-to-cloud computing. EnCharge's robust and scalable next-generation in-memory computing technology provides orders-of-magnitude higher compute efficiency and density compared to today's best-in-class solutions. The high-performance architecture is coupled with seamless software...