Head of Inference Platform Engineering

3 weeks ago


Canada Cerebras Full time

A pioneering AI hardware company is seeking a technical engineering leader for their Inference Service Platform. The role involves leading a team to tackle scaling challenges for LLM inference while ensuring high reliability and performance. Ideal candidates should have strong experience in distributed systems, inference optimization, and technical leadership, with a commitment to mentoring and operational excellence. Located in Canada, candidates can expect to engage in groundbreaking AI advancements.#J-18808-Ljbffr



  • , , Canada Cerebras Full time

    A leading AI technology company in Canada is seeking a Platform Software Engineer to develop key backend services for their Inference platform. The ideal candidate should have over 5 years of backend development experience with strong Python skills. Responsibilities include API design and maintenance, collaborating with cross-functional teams, and...


  • , , Canada Cerebras Full time

    Cerebras Systems builds the world's largest AI chip, 56 times larger than GPUs. Our novel wafer-scale architecture provides the AI compute power of dozens of GPUs on a single chip, with the programming simplicity of a single device. This approach allows Cerebras to deliver industry-leading training and inference speeds and empowers machine learning users to...


  • , , Canada Cerebras Full time

    A leading AI technology company in Canada seeks an experienced software engineer to develop open-source libraries and applications for its innovative inference platform. The role involves collaborating with engineering teams and creating demo applications that showcase the platform's advantages. Candidates should have a degree in computer science, 4+ years...


  • U.S., Canada, Germany, Norway EnCharge AI Full time US$120,000 - US$200,000 per year

    EnCharge AI is a leader in advanced AI hardware and software systems for edge-to-cloud computing. EnCharge's robust and scalable next-generation in-memory computing technology provides orders-of-magnitude higher compute efficiency and density compared to today's best-in-class solutions. The high-performance architecture is coupled with seamless software...


  • , , Canada Yotta Labs Full time

    Join to apply for the GPU Cloud Platform Engineer role at Yotta Labs . About Yotta Labs Yotta Labs is pioneering the development of a Decentralized Operating System (DeOS) for AI workload orchestration at a planetary scale. Our mission is to democratize access to AI resources by aggregating geo-distributed GPUs, enabling high-performance computing for AI...


  • , , Canada Koyfin Full time

    A progressive fintech company in Canada is seeking a Head of Engineering to scale their analytics platform. This role involves defining technical strategy, building a high-performing engineering team, and driving innovation. The ideal candidate has 10+ years in software engineering, with a strong background in web application architecture and leadership....


  • Remote Canada, France, Germany, Spain, UK or USA Platform Full time US$120,000 - US$180,000 per year

    About is Platform-as-a-Service (PaaS) that removes the complexities of cloud infrastructure management and optimizes development-to-production workflows, reducing the time it takes to build and deploy applications. Delivering efficiency, reliability, and security, giving development teams both control and peace of mind. Built for developers, by...


  • , , Canada Cerebras Full time

    Cerebras Systems builds the world's largest AI chip, 56 times larger than GPUs. Our wafer-scale architecture provides the AI compute power of dozens of GPUs on a single chip, with the programming simplicity of a single device. This approach allows Cerebras to deliver industry-leading training and inference speeds and empowers machine learning users to...

  • LLMOps Engineer

    1 week ago


    Adelaide Street West, Toronto, Ontario, Canada, MV P Thrive Career Wellness Platform Full time $140,000 - $160,000 per year

    Job Title: LLMOps Engineer Location: Hybrid - Toronto, Canada. Salary Range: $140K - $160K Reports To: Head of AI, with collaboration across Engineering & DevOps Job Summary: We are seeking an experienced and highly skilled LLMOps Engineer to join our team at Thrive. This newly created role will be responsible for deploying, optimizing, and scaling large...


  • , BC, Canada Semantic Enterprise AI Full time

    Employee Experience Advocate | Shaping Positive Work Cultures | Trusted HR Advisor About the Role Semantic Enterprise AI (SEAI) builds next-generation Decision Engine workflows that integrate machine learning, agentic automation, and advanced reasoning tools into enterprise products that empower organizations to make better upside decisions faster. As a...