Head of Inference Platform Engineering

3 weeks ago

Canada Cerebras Full time

A pioneering AI hardware company is seeking a technical engineering leader for their Inference Service Platform. The role involves leading a team to tackle scaling challenges for LLM inference while ensuring high reliability and performance. Ideal candidates should have strong experience in distributed systems, inference optimization, and technical leadership, with a commitment to mentoring and operational excellence. Located in Canada, candidates can expect to engage in groundbreaking AI advancements.#J-18808-Ljbffr

Senior Platform Engineer — AI Inference Services

3 weeks ago

, , Canada Cerebras Full time

A leading AI technology company in Canada is seeking a Platform Software Engineer to develop key backend services for their Inference platform. The ideal candidate should have over 5 years of backend development experience with strong Python skills. Responsibilities include API design and maintenance, collaborating with cross-functional teams, and...
Deployment Engineer, AI Inference

4 weeks ago

, , Canada Cerebras Full time

Cerebras Systems builds the world's largest AI chip, 56 times larger than GPUs. Our novel wafer-scale architecture provides the AI compute power of dozens of GPUs on a single chip, with the programming simplicity of a single device. This approach allows Cerebras to deliver industry-leading training and inference speeds and empowers machine learning users to...
AI Inference Engineer — Open-Source Integrations

6 days ago

, , Canada Cerebras Full time

A leading AI technology company in Canada seeks an experienced software engineer to develop open-source libraries and applications for its innovative inference platform. The role involves collaborating with engineering teams and creating demo applications that showcase the platform's advantages. Candidates should have a degree in computer science, 4+ years...
LLM Inference Deployment Engineer

12 hours ago

U.S., Canada, Germany, Norway EnCharge AI Full time US$120,000 - US$200,000 per year

EnCharge AI is a leader in advanced AI hardware and software systems for edge-to-cloud computing. EnCharge's robust and scalable next-generation in-memory computing technology provides orders-of-magnitude higher compute efficiency and density compared to today's best-in-class solutions. The high-performance architecture is coupled with seamless software...
GPU Cloud Platform Engineer

6 days ago

, , Canada Yotta Labs Full time

Join to apply for the GPU Cloud Platform Engineer role at Yotta Labs . About Yotta Labs Yotta Labs is pioneering the development of a Decentralized Operating System (DeOS) for AI workload orchestration at a planetary scale. Our mission is to democratize access to AI resources by aggregating geo-distributed GPUs, enabling high-performance computing for AI...
Remote Head of Engineering for Fintech Platform

3 weeks ago

, , Canada Koyfin Full time

A progressive fintech company in Canada is seeking a Head of Engineering to scale their analytics platform. This role involves defining technical strategy, building a high-performing engineering team, and driving innovation. The ideal candidate has 10+ years in software engineering, with a strong background in web application architecture and leadership....
Head of Corporate Development

22 hours ago

Remote Canada, France, Germany, Spain, UK or USA Platform Full time US$120,000 - US$180,000 per year

About is Platform-as-a-Service (PaaS) that removes the complexities of cloud infrastructure management and optimizes development-to-production workflows, reducing the time it takes to build and deploy applications. Delivering efficiency, reliability, and security, giving development teams both control and peace of mind. Built for developers, by...
QA Tech Lead, Web Consoles

4 weeks ago

, , Canada Cerebras Full time

Cerebras Systems builds the world's largest AI chip, 56 times larger than GPUs. Our wafer-scale architecture provides the AI compute power of dozens of GPUs on a single chip, with the programming simplicity of a single device. This approach allows Cerebras to deliver industry-leading training and inference speeds and empowers machine learning users to...
LLMOps Engineer

1 week ago

Adelaide Street West, Toronto, Ontario, Canada, MV P Thrive Career Wellness Platform Full time $140,000 - $160,000 per year

Job Title: LLMOps Engineer Location: Hybrid - Toronto, Canada. Salary Range: $140K - $160K Reports To: Head of AI, with collaboration across Engineering & DevOps Job Summary: We are seeking an experienced and highly skilled LLMOps Engineer to join our team at Thrive. This newly created role will be responsible for deploying, optimizing, and scaling large...
Senior AI Platform Engineer

3 weeks ago

, BC, Canada Semantic Enterprise AI Full time

Employee Experience Advocate | Shaping Positive Work Cultures | Trusted HR Advisor About the Role Semantic Enterprise AI (SEAI) builds next-generation Decision Engine workflows that integrate machine learning, agentic automation, and advanced reasoning tools into enterprise products that empower organizations to make better upside decisions faster. As a...

Americas

Europe

Asia / Oceania

Africa

Head of Inference Platform Engineering