Senior LLMOps Engineer

5 days ago


Toronto, Canada TEEMA Solutions Group Full time

Location: Downtown TorontoHybrid: 4 days in office Ready to build what powers the next generation of AI? We’re looking for a Staff LLMOps Engineer to lead the design, deployment, and optimization of large language model (LLM) infrastructure on the cloud.You’ll be the driving force behind taking trained models from lab to production—scaling efficiently across multi-GPU clusters and pushing the boundaries of inference performance for enterprise-grade AI applications. If you thrive at the intersection of AI, cloud engineering, and systems optimization , this is your chance to shape the future of large-scale model serving in a high-impact environment. What You’ll Do Architect and operationalize LLM deployment pipelines on AWS and Kubernetes/EKS. Build and scale multi-GPU inference infrastructure for low latency, high availability, and cost efficiency. Optimize inference using frameworks like vLLM, SGLang, and DeepSpeed-Inference . Implement advanced serving techniques: continuous batching, speculative decoding, KV-cache management, and distributed scheduling. Collaborate with AI researchers to convert model training outputs into production-grade APIs and services . Establish observability and monitoring for latency, throughput, GPU utilization, and failure recovery. Automate provisioning, scaling, and upgrades using Terraform and CI/CD pipelines . Ensure compliance, security, and efficiency in multi-tenant LLM hosting for enterprise clients. What We’re Looking For 6+ years in DevOps, ML infrastructure, or cloud platform engineering. 2+ years of direct experience deploying and optimizing LLMs or large-scale ML models. Expertise with GPU-accelerated inference and distributed serving environments. Deep familiarity with cloud-native architectures (AWS, GCP, Azure) and Kubernetes . Strong foundation in Python, Bash, and IaC (Terraform) . Experience integrating monitoring tools (Prometheus, Grafana, Datadog) for performance visibility. Passion for building robust, scalable, and secure AI systems. Why Join Lead and own mission-critical AI infrastructure at a fast-scaling startup. Work alongside world-class engineers, data scientists, and innovators. Competitive salary + meaningful equity in a company redefining applied AI. A culture built on innovation, technical depth, and impact—your work truly matters. #J-18808-Ljbffr


  • LLMOps Engineer

    3 weeks ago


    Toronto, Canada Thrive Career Wellness Platform Full time

    We are seeking an experienced and highly skilled LLMOps Engineer to join our team at Thrive. This newly created role will be responsible for deploying, optimizing, and scaling large language model (LLM) applications across our platform. The successful candidate will own the operational backbone of our AI-driven products, ensuring performance, reliability,...

  • LLMOps Engineer

    1 week ago


    Toronto, Canada Thrive Career Wellness Platform Full time

    We are seeking an experienced and highly skilled LLMOps Engineer to join our team at Thrive. This newly created role will be responsible for deploying, optimizing, and scaling large language model (LLM) applications across our platform. The successful candidate will own the operational backbone of our AI-driven products, ensuring performance, reliability,...

  • LLMOps Engineer

    1 week ago


    Toronto, Canada Thrive Career Wellness Platform Full time

    We are seeking an experienced and highly skilled LLMOps Engineer to join our team at Thrive. This newly created role will be responsible for deploying, optimizing, and scaling large language model (LLM) applications across our platform. The successful candidate will own the operational backbone of our AI-driven products, ensuring performance, reliability,...


  • Toronto, Canada Thrive Career Wellness Platform Full time

    A leading career wellness company is seeking an experienced LLMOps Engineer to join their team in a hybrid work environment. This role involves deploying and optimizing large language models, collaborating with various teams, and ensuring cost-effective, reliable performance. The ideal candidate should have over 3 years of experience in LLMOps, strong Python...


  • Toronto, Canada Thrive Career Wellness Platform Full time

    A leading career wellness company is seeking an experienced LLMOps Engineer to join their team in a hybrid work environment. This role involves deploying and optimizing large language models, collaborating with various teams, and ensuring cost-effective, reliable performance. The ideal candidate should have over 3 years of experience in LLMOps, strong Python...


  • Toronto, Canada Thrive Career Wellness Platform Full time

    A leading career wellness company is seeking an experienced LLMOps Engineer to join their team in a hybrid work environment. This role involves deploying and optimizing large language models, collaborating with various teams, and ensuring cost-effective, reliable performance. The ideal candidate should have over 3 years of experience in LLMOps, strong Python...


  • Toronto, Canada TEEMA Solutions Group Full time

    Location: Downtown TorontoHybrid: 4 days in office Ready to build what powers the next generation of AI? We’re looking for a Staff LLMOps Engineer to lead the design, deployment, and optimization of large language model (LLM) infrastructure on the cloud.You’ll be the driving force behind taking trained models from lab to production—scaling efficiently...

  • Senior LLMOps Engineer

    23 hours ago


    Toronto, Canada TEEMA Solutions Group Full time

    Location: Downtown Toronto Hybrid: 4 days in office Ready to build what powers the next generation of AI? We’re looking for a Staff LLMOps Engineer to lead the design, deployment, and optimization of large language model (LLM) infrastructure on the cloud. You’ll be the driving force behind taking trained models from lab to production—scaling...


  • Toronto, Canada Talent To Hire Inc. Full time

    Senior AI Engineer - Agentic Systems / LLMOPS We’re looking for an AI Engineer with deep technical expertise in agentic AI systems, LLM orchestration, and cloud deployment. The ideal candidate has hands-on experience building, deploying, and optimizing multi-agent architectures, integrating Retrieval-Augmented Generation (RAG), and delivering...


  • Toronto, Canada Svitla Systems, Inc. Full time

    SvitlaSystems Inc. is looking foraSenior AI and ML Engineerfor a full-time position (40 hoursper week) inEurope. Our client is a leading expert network that provides business and government professionals with opportunities to communicate with industry and subject-matter experts to answer research questions. Customers consult with these experts over the phone...