AI/ML Infrastructure Engineer
5 days ago
About The Opportunity About the job The team is dedicated to developing a robust platform for training and serving machine learning models. This platform streamlines the productionization of AI and ML models by mitigating the incidental complexities involved in creating backend services for serving predictions and training models. Responsibilities Contribute to the client's ML Platform SDK and build tools for various ML operations. Collaborate with Machine Learning Engineers (MLE), researchers, and various product teams to deliver scalable ML platform tooling solutions that meet the timelines and specifications of given requirements. Work independently and collaboratively on squad projects that often requires learning and applying new technologies that may go beyond existing skillsets. Manage and maintain large scale production Kubernetes clusters for ML workloads, including ML platform infrastructure and necessary dev ops. Design, documents and implements reliable, testable and maintainable solutions ML infrastructure capabilities. Required Qualifications You have 6+ years of hands‑on experience implementing production ML infrastructure at scale in Python, Go or similar languages. You have knowledge of deep learning fundamentals, algorithms, and open‑source tools such as Huggingface, Ray, PyTorch or TensorFlow. You have an understanding of distributed training leveraging GPUs and Kubernetes. You have a general understanding of data processing for ML. You have experience with agile software processes and modular code design following industry standards. Seniority level Mid-Senior level Employment type Full-time Job function Information Technology Industries IT Services and IT Consulting Referrals increase your chances of interviewing at Avenue Code by 2x. Get notified about new Infrastructure Engineer jobs in Toronto, Ontario, Canada . #J-18808-Ljbffr
-
Staff Software Engineer, GPU Infrastructure
2 weeks ago
, , Canada Cohere AI Full timeWho are we? Our mission is to scale intelligence to serve humanity. We’re training and deploying frontier models for developers and enterprises who are building AI systems to power magical experiences like content generation, semantic search, RAG, and agents. We believe that our work is instrumental to the widespread adoption of AI. We obsess over what we...
-
, , Canada Cohere AI Full timeA leading AI infrastructure company in Canada is seeking a Staff Software Engineer to build and scale ML-optimized HPC infrastructure. You will work closely with AI researchers to ensure optimal performance of AI workloads and systems. The ideal candidate has deep expertise in ML infrastructure and strong skills in Kubernetes and Python. This role offers...
-
AI/ML Engineer
5 days ago
, , Canada Tek Tron IT Full timeJob Overview In this role, the Machine Learning Engineer will design, develop, and deploy AI and deep learning models for production, build scalable data pipelines for feature extraction and training, manage end‑to‑end MLOps workflows, conduct exploratory data analysis and feature engineering, collaborate with data scientists and software engineers,...
-
AI Platform Engineer
3 weeks ago
, BC, Canada Semantic Enterprise AI Full timeDirect message the job poster from Semantic Enterprise AI Employee Experience Advocate | Shaping Positive Work Cultures | Trusted HR Advisor About the Role Semantic Enterprise AI (SEAI) builds next-generation Decision Engine workflows that integrate machine learning, agentic automation, and advanced reasoning tools into enterprise products that empower...
-
Production ML Engineer for Enterprise LLM
3 weeks ago
, BC, Canada Semantic Enterprise AI Full timeA technology company specializing in AI solutions in British Columbia seeks a Machine Learning Engineer. The role involves designing ML tools for enterprise decision-making, owning the full development lifecycle, and collaborating with teams. Candidates should have significant experience in building production ML systems and proficiency in Python and cloud...
-
Staff Machine Learning Engineer
3 weeks ago
, , Canada Coinbase Full timeStaff Machine Learning Engineer - AI/ML Risk Platform Job ID: GPML06US Pay Range: $217,900 CAD - $217,900 CAD Pay Transparency Notice: The target annual salary for this position can range as detailed below. Full time offers from Coinbase also include bonus eligibility + equity eligibility + benefits (including medical, dental, and vision). About the Role We...
-
Sr. ML Engineer, AI Cloud
2 days ago
, ON, Canada Tenstorrent Full timeOverview Sr. ML Engineer, AI Cloud role at Tenstorrent. Tenstorrent is leading the industry on cutting-edge AI technology, revolutionizing performance expectations, ease of use, and cost efficiency. With AI redefining the computing paradigm, solutions must evolve to unify innovations in software models, compilers, platforms, networking, and semiconductors....
-
Machine Learning Engineer
2 weeks ago
, , Canada SupplyGuard AI Full timeSupplyGuard AI leverages advanced machine learning to predict supply chain risks before they become crises. Our AI models analyze thousands of data sources including financial reports, news feeds, regulatory changes, and trade data to provide early warning systems for supply chain disruptions. We're building the most sophisticated supply chain intelligence...
-
Machine Learning Engineer, Risk AI/ML
2 weeks ago
, , Canada Coinbase Full timeAbout the role Coinbase is on a mission to increase economic freedom worldwide, building the emerging on‑chain platform and a future global financial system. The Machine Learning Engineer will join our Risk AI/ML team to develop sophisticated models that protect customers and build trust. Your work will directly prevent fraud, account takeovers and scams...
-
AI Compiler Engineer
7 hours ago
U.S., Canada, Germany, Norway EnCharge AI Full time US$120,000 - US$200,000 per yearEnCharge AI is a leader in advanced AI hardware and software systems for edge-to-cloud computing. EnCharge's robust and scalable next-generation in-memory computing technology provides orders-of-magnitude higher compute efficiency and density compared to today's best-in-class solutions. The high-performance architecture is coupled with seamless software...