Cloud AI Engineer

3 days ago


Toronto, Ontario, Canada Skyfall AI Full time

As an ML Engineer at Skyfall AI, you'll be responsible for deploying and optimizing large language models (LLMs) in production. This involves fine-tuning and RLHF-training LLMs, optimizing inference for cost and latency, and building scalable training pipelines using DeepSpeed, Accelerate, and Ray.

You'll work with our founding team, consisting of Maluuba founders who were previously pioneers in the deep learning revolution. They worked with AI pioneers such as Yoshua Bengio and Richard Sutton before being acquired by Microsoft for $160M and becoming Microsoft's AI research center in Canada.

Requirements
  • 3+ years of experience in ML engineering, model deployment, and large-scale training.
  • Experience with vector databases (FAISS, Pinecone, Weaviate) for retrieval-augmented generation (RAG).
  • Experience with multi-cloud ML deployment across AWS, GCP, and Azure.
  • Hands-on experience deploying LLMs or similar large-scale models in a production setting.
  • Expertise in multi-GPU training, model parallelism, and inference optimizations.
  • Strong knowledge of ML system performance tuning, latency optimization, and cost reduction strategies.
  • Experience in building and managing large-scale ML clusters across cloud or hybrid environments.
  • Solid understanding of LLM fine-tuning techniques, RLHF, and model evaluation metrics.


  • Toronto, Ontario, Canada Skyfall AI Full time

    Job OverviewSkyfall AI is on a mission to create the future of autonomous enterprises, and we're seeking a skilled Research Software Development Engineer (RSDE) to join our cutting-edge AI research team. This role is ideal for engineers who excel at the intersection of AI research and scalable software development, working on next-generation language models,...

  • Cloud AI Architect

    1 day ago


    Toronto, Ontario, Canada Focus Cloud Group Full time

    Job OverviewThe Focus Cloud Group is a leading global financial services consultancy that aims to leverage the power of Microsoft AI technology. As our ideal candidate, you will be responsible for driving the development of an internal Gen AI product that harnesses the capabilities of this technology.This role requires an expert in AI architecture, including...

  • Cloud AI Engineer

    3 days ago


    Toronto, Ontario, Canada Cerebras Full time

    Job OverviewWe are building a team of exceptional people to work together on big problems. As a software engineer on our AI cloud platform, you will play a critical role in shaping the future of AI technology. Our team is passionate about pushing the boundaries of what is possible with AI, and we're looking for like-minded individuals to join us.Our team...


  • Toronto, Ontario, Canada Arteria AI Inc Full time

    About UsArteria AI Inc. is a dynamic and inclusive team environment where innovation meets collaboration. We're passionate about transforming the business world through AI-enabled solutions.We're currently seeking a skilled Cloud Engineering Manager to join our team. As a seasoned professional, you will be responsible for leading our DevOps team in...


  • Toronto, Ontario, Canada Autodesk, Inc. Full time

    Autodesk, Inc. is seeking a highly skilled Machine Learning Cloud Optimization Developer to join our Research Engineering organization. As a key member of our team, you will collaborate with world-class researchers and engineers to build innovative ML-powered product features that empower our customers to create a better world.The role requires a software...


  • Toronto, Ontario, Canada Nucs AI, Inc. Full time

    We're currently seeking a skilled Back-End Developer to join our dynamic team as we continue to innovate in the healthcare sector. As a Back-End Developer at Nucs AI, you will play a critical role in building the foundation of our AI-powered medical software solutions.About Us:We are focused on revolutionizing healthcare by developing secure, scalable, and...


  • Toronto, Ontario, Canada Focus Cloud Group Full time

    At Focus Cloud Group, we are embarking on an exciting journey to establish a cutting-edge Generative AI group. We seek a seasoned leader to spearhead the development and quality assurance of our AI solutions.Salary Range:$180,000 - $250,000 CADRole Responsibilities:Software Development & Automation: Lead the design and delivery of scalable, secure, and...

  • Senior QA Engineer

    1 hour ago


    Toronto, Ontario, Canada Focus Cloud Group Full time

    We're seeking a highly skilled QA Lead to join our Focus Cloud Group Gen AI team. As a key member of our team, you'll be responsible for ensuring the highest quality of our Gen AI solutions through extensive testing and validation.Key ResponsibilitiesYour primary duties will include:Leading a team of Gen AI specialists, providing guidance and allocating...


  • Toronto, Ontario, Canada AI Tech Suite Full time

    MARZ is a leading provider of AI solutions for the VFX industry. We are looking for a talented Machine Learning Engineer - Cloud Native to join our team and help us deliver cutting-edge visual effects.Main Responsibilities:Design and develop ML pipelines for training, validation, and inference purposes.Implement RESTful APIs to support front-end...


  • Toronto, Ontario, Canada Cerebras Systems Full time

    Cerebras has developed a radically new chip and system to dramatically accelerate deep learning applications.Our system runs training and inference workloads orders of magnitude faster than contemporary machines, fundamentally changing the way ML researchers work and pursue AI innovation.We are innovating at every level of the stack – from chip, to...


  • Toronto, Ontario, Canada Boson AI Full time

    Company OverviewBoson AI is an early-stage startup building large language tools for interaction and entertainment. Our founders, Alex Smola, Mu Li, and a team of Deep Learning, Optimization, NLP, AutoML, and Statistics scientists and engineers are working on high-quality generative AI models for language and beyond.


  • Toronto, Ontario, Canada Cerebras Full time

    Cerebras has developed a radically new chip and system to dramatically accelerate deep learning applications. Our system runs training and inference workloads orders of magnitude faster than contemporary machines, fundamentally changing the way ML researchers work and pursue AI innovation.We are innovating at every level of the stack – from chip, to...


  • Toronto, Ontario, Canada Skyfall AI Full time

    Job DescriptionWe're seeking a highly skilled Research Software Development Engineer (RSDE) to join our AI research team, working on next-generation language models, reinforcement learning, and multi-agent systems.The successful candidate will play a key role in developing AI training infrastructure, pushing the boundaries of LLMs and RL, and contributing to...


  • Toronto, Ontario, Canada Stantec Consulting International Ltd. Full time

    About the RoleThis is an exciting opportunity to join our team as a Cloud AI Solutions Engineer. As an expert in designing, developing, and optimizing NLP chatbots and agents, you will play a crucial role in driving our initiatives forward.Your Key ResponsibilitiesDesign LLM application architecture for retrieval over both structured and unstructured data...


  • Toronto, Ontario, Canada Focus Cloud Group Full time

    Focusing on AI Development is crucial for us, and we need an experienced leader to help build and manage our AI solutions at Focus Cloud Group. As a Director-level position, this role involves overseeing the entire AI development lifecycle, from concept to deployment.Responsibilities:Design and deliver scalable, secure, and reusable software solutions using...

  • AI Engineer

    1 week ago


    Toronto, Ontario, Canada ShyftLabs Full time

    Position Overview-We are looking for an experienced AI Engineer to implement and optimize AI-powered products and solutions. In this role, you will work with cross-functional teams to apply, fine-tune, and scale AI models, ensuring they deliver real-world impact. You will focus on optimizing performance, integrating AI into production systems, and enabling...


  • Toronto, Ontario, Canada Focus Cloud Group Full time

    As a seasoned professional in Gen AI, you will join Focus Cloud Group, a leading global financial services consultancy looking to expand their expertise in artificial intelligence. We are seeking a highly skilled QA Lead to spearhead our Gen AI testing efforts.Key responsibilities include team leadership and hands-on QA testing, ensuring the quality of Gen...


  • Toronto, Ontario, Canada Skyfall AI Full time

    Machine Learning Engineer at Skyfall AI  Skyfall is disrupting the entire AI ecosystem by building the first world model for the enterprise. The goal of the 'Enterprise world Model' is to overcome the severe limitations of LLMs (Safety, Hallucinations, Expensive training) in order to provide the enterprises significant value by having a comprehensive...

  • Senior AI Engineer

    5 days ago


    Toronto, Ontario, Canada Tali AI Full time

    Company Overview: Tali AI is a fast-growing company dedicated to tackling the issue of physician burnout through the development of cutting-edge AI technology.About the Role: We are seeking an experienced Senior Machine Learning Engineer to play a crucial role in advancing our capabilities in natural language processing and speech recognition systems.Key...


  • Toronto, Ontario, Canada AI Tech Suite Full time

    At AI Tech Suite, we are looking for a skilled Cloud Platform Engineer to join our team. As a key member of our infrastructure team, you will be responsible for designing and developing cutting-edge machine learning pipelines that drive innovation in the field of visual effects.About MARZMARZ is a technology and VFX company specializing in feature-film...