AI Inference Engineer — Open-Source Integrations

3 weeks ago

Canada Cerebras Full time

A leading AI technology company in Canada seeks an experienced software engineer to develop open-source libraries and applications for its innovative inference platform. The role involves collaborating with engineering teams and creating demo applications that showcase the platform's advantages. Candidates should have a degree in computer science, 4+ years of experience, and proficiency in Python alongside modern LLM frameworks. This position offers the chance to work at the forefront of AI technology in a supportive environment.#J-18808-Ljbffr

AI Solutions Engineer

3 weeks ago

, , Canada Cerebras Full time

Cerebras Systems builds the world's largest AI chip, 56 times larger than GPUs. Our novel wafer-scale architecture provides the AI compute power of dozens of GPUs on a single chip, with the programming simplicity of a single device. This approach allows Cerebras to deliver industry-leading training and inference speeds and empowers machine learning users to...
Edge AI Research Engineer: Quantization

1 week ago

, , Canada EnCharge AI Full time

A leading AI technology company in Canada is seeking an experienced AI Research Engineer to optimize deep learning models for edge AI platforms. Responsibilities include developing quantization strategies and efficient inference techniques. Candidates must possess a Master's or Ph.D. in Computer Science or Electrical Engineering, with expertise in deep...
Deployment Engineer, AI Inference

3 days ago

, , Canada Cerebras Full time

Cerebras Systems builds the world's largest AI chip, 56 times larger than GPUs. Our novel wafer-scale architecture provides the AI compute power of dozens of GPUs on a single chip, with the programming simplicity of a single device. This approach allows Cerebras to deliver industry-leading training and inference speeds and empowers machine learning users to...
Senior Software Engineer, AI Inference Platform

2 weeks ago

, , Canada Cerebras Full time

Cerebras Systems builds the world's largest AI chip, 56 times larger than GPUs. Our novel wafer‑scale architecture provides the AI compute power of dozens of GPUs on a single chip, with the programming simplicity of a single device. This approach allows Cerebras to deliver industry‑leading training and inference speeds and empowers machine learning users...
Principal Engineer, AI Inference Reliability

1 week ago

, , Canada Cerebras Full time

About the team The Cerebras Inference team’s mission is to deliver the world’s most performant, secure, and reliable enterprise‑grade AI service. We build and operate large‑scale distributed systems that power AI inference at unprecedented speed and efficiency. Join us to help scale inference and accelerate AI. About the role We’re looking for a...
AI Compiler Engineer

7 days ago

U.S., Canada, Germany, Norway EnCharge AI Full time

EnCharge AI is a leader in advanced AI hardware and software systems for edge-to-cloud computing. EnCharge's robust and scalable next-generation in-memory computing technology provides orders-of-magnitude higher compute efficiency and density compared to today's best-in-class solutions. The high-performance architecture is coupled with seamless software...
AI Research Engineer

1 week ago

, , Canada EnCharge AI Full time

EnCharge AI is a leader in advanced AI hardware and software systems for edge-to-cloud computing. EnCharge’s robust and scalable next-generation in‑memory computing technology provides orders‑of‑magnitude higher compute efficiency and density compared to today’s best‑in‑class solutions. The high‑performance architecture is coupled with...
Senior Research Engineer

1 week ago

, , Canada Cerebras Full time

Cerebras Systems builds the world's largest AI chip, 56 times larger than GPUs. Our novel wafer‑scale architecture provides the AI compute power of dozens of GPUs on a single chip, with the programming simplicity of a single device. This approach allows Cerebras to deliver industry‑leading training and inference speeds and empowers machine learning users...
AI Runtime Engineer

2 weeks ago

U.S., Canada, Germany, Norway EnCharge AI Full time

EnCharge AI is a leader in advanced AI hardware and software systems for edge-to-cloud computing. EnCharge's robust and scalable next-generation in-memory computing technology provides orders-of-magnitude higher compute efficiency and density compared to today's best-in-class solutions. The high-performance architecture is coupled with seamless software...
AI Research Engineer

2 weeks ago

Canada, Germany, Norway, United States EnCharge AI Full time

EnCharge AI is a leader in advanced AI hardware and software systems for edge-to-cloud computing. EnCharge's robust and scalable next-generation in-memory computing technology provides orders-of-magnitude higher compute efficiency and density compared to today's best-in-class solutions. The high-performance architecture is coupled with seamless software...

Americas

Europe

Asia / Oceania

Africa

AI Inference Engineer — Open-Source Integrations