LLM Serving Engineer
1 week ago
Role Overview LLM Serving Engineer (Cloud AI Engineering) – Senior / Staff Engineer at Qualcomm Technologies, Inc. We are building a scalable LLM inference platform that spans from research to commercial deployment. The role spans the full product lifecycle and requires strategic thinking, strong execution, and excellent communication skills. Responsibilities Build a scalable LLM inference platform using techniques such as disaggregated serving, KV‑Cache management, advanced parallelism, speculative algorithms, model optimization, and specialized kernels. Contribute to the development of LLM serving packages (e.g. vLLM, SGLang, TGI, Triton‑Inference Server, Dynamo, LLM‑d). Collaborate closely with customers to drive solutions by working with internal compiler, firmware and platform teams. Drive efficient serving through smart autoscaling, load balancing, and routing. Engage with open‑source serving communities to evolve the framework. Qualifications Hands‑on experience with one or more LLM serving/orchestration packages (Triton‑Inference Server, vLLM, SGLang, Ollama, llm‑d, KServe, LMCache, MoonCake). Deep understanding of foundational LLMs, VLMs, SLMs, and transformer‑based architectures. Strong experience developing language models using PyTorch. Strong computer science fundamentals – algorithms, data structures, parallel and distributed programming. Understanding of computer architecture, ML accelerators, in‑memory processing and distributed systems. Strong Python development skills for large‑scale projects. Experience analyzing, profiling, and optimizing deep learning workloads. Proactive learning about the latest inference optimization techniques. Excellent communication and problem‑solving skills in a fast‑paced environment. MS in Computer Science, Machine Learning, Computer Engineering or Electrical Engineering. Bonus Skills Open‑source contribution to any GenAI package. Experience architecting and developing large‑scale distributed systems. High‑level kernel design experience (PyTorch, CUDA, Triton). Knowledge of torch.compile or torchDynamo. PhD in Computer Science, Computer Engineering or Machine Learning. Minimum Qualifications Bachelor's degree in Computer Science, Engineering, Information Systems, or related field and 4+ years of Hardware Engineering, Software Engineering, Systems Engineering, or related work experience. Master's degree in Computer Science, Engineering, Information Systems, or related field and 3+ years of Hardware Engineering, Software Engineering, Systems Engineering, or related work experience. PhD in Computer Science, Engineering, Information Systems, or related field and 2+ years of Hardware Engineering, Software Engineering, Systems Engineering, or related work experience. Benefits & Compensation Pay Range: $158,400.00 – $237,600.00. We also offer a competitive annual discretionary bonus program and RSU grants. For full details, review our US benefits here. Equal Opportunity & Accessibility Qualcomm is an equal opportunity employer. We are committed to providing an accessible process for individuals with disabilities. For accommodations, contact disability‑accommodations@qualcomm.com or call our toll‑free number. Qualified applicants will receive consideration for employment without regard to protected classification. Location Toronto, Ontario, Canada – 3 weeks ago. #J-18808-Ljbffr
-
LLM Serving Engineer
1 week ago
Markham, Canada Qualcomm Full timeRole Overview LLM Serving Engineer (Cloud AI Engineering) – Senior / Staff Engineer at Qualcomm Technologies, Inc. We are building a scalable LLM inference platform that spans from research to commercial deployment. The role spans the full product lifecycle and requires strategic thinking, strong execution, and excellent communication skills....
-
LLM Serving Engineer
1 week ago
Markham, Canada Qualcomm Full timeRole Overview LLM Serving Engineer (Cloud AI Engineering) – Senior / Staff Engineer at Qualcomm Technologies, Inc. We are building a scalable LLM inference platform that spans from research to commercial deployment. The role spans the full product lifecycle and requires strategic thinking, strong execution, and excellent communication skills....
-
Research Engineer
2 weeks ago
Markham, Canada Huawei Technologies Canada Co., Ltd. Full timeHuawei Canada has an immediate 12-month contract opening for a Research Engineer.About the team:The Intelligent Testing Technology Team, part of the Waterloo Research Centre, is at the forefront of integrating large language models (LLMs) with formal methods to advance artificial intelligence. The team explores the synergy between LLMs' natural language...
-
Research Engineer
1 day ago
Markham, Canada Huawei Canada Full timeResearch Engineer - Software Systems Engineering/LLMs Join to apply for the Research Engineer - Software Systems Engineering/LLMs role at Huawei Canada Research Engineer - Software Systems Engineering/LLMs Join to apply for the Research Engineer - Software Systems Engineering/LLMs role at Huawei Canada Huawei Canada has an immediate 12-month contract opening...
-
Research Engineer
1 day ago
Markham, Canada Huawei Technologies Canada Co., Ltd. Full timeHuawei Canada has an immediate 12-month contract opening for a Research Engineer. About the team: The Intelligent Testing Technology Team, part of the Waterloo Research Centre, is at the forefront of integrating large language models (LLMs) with formal methods to advance artificial intelligence. The team explores the synergy between LLMs' natural language...
-
Research Engineer
6 days ago
Markham, Canada Huawei Canada Full timeResearch Engineer - Software Systems Engineering/LLMsJoin to apply for the Research Engineer - Software Systems Engineering/LLMs role at Huawei CanadaResearch Engineer - Software Systems Engineering/LLMsJoin to apply for the Research Engineer - Software Systems Engineering/LLMs role at Huawei CanadaHuawei Canada has an immediate 12-month contract opening for...
-
Research Engineer
2 weeks ago
Markham, Canada Huawei Canada Full timeResearch Engineer - Software Systems Engineering/LLMsJoin to apply for the Research Engineer - Software Systems Engineering/LLMs role at Huawei CanadaResearch Engineer - Software Systems Engineering/LLMsJoin to apply for the Research Engineer - Software Systems Engineering/LLMs role at Huawei CanadaHuawei Canada has an immediate 12-month contract opening for...
-
Research Engineer
3 weeks ago
Markham, Canada Futureshaper.com Full timeHuawei Canada has an immediate permanent opening for a Research Engineer. About the team: The Intelligent Testing Technology Team, currently part of the Waterloo Research Centre, is at the forefront of integrating large language models (LLMs) with formal methods to advance artificial intelligence. By harnessing LLMs’ strengths in natural language...
-
Research Engineer
3 weeks ago
Markham, Canada Futureshaper.com Full timeHuawei Canada has an immediate permanent opening for a Research Engineer. About the team: The Intelligent Testing Technology Team, currently part of the Waterloo Research Centre, is at the forefront of integrating large language models (LLMs) with formal methods to advance artificial intelligence. By harnessing LLMs’ strengths in natural language...
-
LLM-Driven Software Systems Research Engineer
3 weeks ago
Markham, Canada Futureshaper.com Full timeA leading technology firm in Canada is seeking a Research Engineer to advance software engineering processes using Large Language Models (LLMs) and AI techniques. The ideal candidate will collaborate on innovative projects, developing frameworks and methodologies that integrate LLMs in real-world applications. This role requires a PhD or Master's degree in...