AI SRE/Automation Engineer
3 weeks ago
We Looking For: An AI-driven Site Reliability Engineer who can turn ML insights into automated infrastructure actions. Role Purpose: To operationalize and automate the Right‑Sizing platform’s recommendations, ensuring safe, reliable, and scalable adoption of cost optimization measures across Client’s cloud and on‑prem environments. Key Responsibilities Translate ML/forecasting model outputs into actionable system changes. Implement automation to adjust Kubernetes deployment configs, VM allocations, and storage sizing. Develop rollback and alerting mechanisms for failed or unsafe right‑sizing actions. Build CI/CD pipelines to continuously integrate and deploy updated capacity recommendations. Ensure compliance with OCC and internal governance requirements when applying changes. Collaborate with Data/ML engineers to fine‑tune recommendations before production rollout. Document automation workflows and train application teams on adoption. Work with Data/ML engineers to set up train/test pipelines for ETL. Required Skills & Experience Strong background in Site Reliability Engineering and Infrastructure Automation . Proficiency in Python (automation scripting), Terraform/Ansible , and CI/CD pipelines (GitHub actions and Azure DevOps). Experience managing Kubernetes deployments , YAML configs, and scaling policies. Hands‑on with Azure infrastructure (VM sizing, managed disks, blob storage, SQL Managed Instances). Familiarity with observability and monitoring tools (ELK, Dynatrace, LogicMonitor, PagerDuty). Knowledge of governance and compliance for infrastructure changes. Strong troubleshooting skills and ability to work across multiple platform teams. Seniority Level Entry level Employment Type Full‑time Job Function Engineering and Information Technology Industry IT Services and IT Consulting #J-18808-Ljbffr
-
SRE-DevSecOps Engineer
4 weeks ago
, , Canada High Tech Genesis Full timeOverview Join to apply for the SRE-DevSecOps Engineer role at High Tech Genesis Allowed Staffing Countries: Canada, Costa Rica, Mexico or Brazil, (Remote) Term: Contract High Tech Genesis is seeking a 3-month contractor who can hit the ground running to support our SaaS platform on AWS. Responsibilities Kubernetes/EKS Operations – Manage, troubleshoot, and...
-
AI SRE Engineer
3 weeks ago
New Canada, NS Tata Consultancy Services Full timeInclusion without Exception: Tata Consultancy Services (TCS) is an equal opportunity employer, and embraces diversity in race, nationality, ethnicity, gender, age, physical ability, neurodiversity, and sexual orientation, to create a workforce that reflects the societies we operate in. Our continued commitment to Culture and Diversity is reflected in our...
-
Senior GCP DevOps Engineer
2 weeks ago
, BC, Canada SimplyAsk.ai Full timeSimplyAsk.ai is a growing and innovative software company focused on helping organizations modernize their digital services and deploy practical AI solutions. Headquartered in Vancouver, Canada with team hubs in BC and Ontario, SimplyAsk.ai has over 70 employees and more than a decade of experience delivering enterprise-grade software solutions to prominent...
-
AI Automation Engineer
1 week ago
, , Canada BioRender Full timeJoin to apply for the AI Automation Engineer role at BioRender At BioRender, we’re on a mission to accelerate the world’s ability to learn, discover, and communicate science — transforming how knowledge is shared and making science open, collaborative, and easily understandable by all. We’re shaping the future of science communication and are looking...
-
, , Canada Oscilar Full timeOverview Join to apply for the DevOps/Site Reliability Engineer (SRE) role at Oscilar . Get AI-powered advice on this job and more exclusive features. Shape the future of trust in the age of AI At Oscilar, we're building the most advanced AI Risk Decisioning Platform. Banks, fintechs, and digitally native organizations rely on us to manage their fraud,...
-
, , Canada GitLab Full timeSenior Site Reliability Engineer, Environment Automation Remote, Canada GitLab is an open core software company that develops the most comprehensive AI-powered DevSecOps Platform, used by more than 100,000 organizations. Our mission is to enable everyone to contribute to and co-create the software that powers our world. When everyone can contribute,...
-
Staff Platform Engineer
6 days ago
, , Canada Refinitiv Full time# **Our Privacy Statement & Cookie Policy**This posting is for proactive recruitment purposes and may be used to fill current openings or future vacancies within our organization.**Staff Platform Engineer- Materia AI**As a Platform Engineer, you will contribute to leading the design, development, and evolution of our next-generation cloud-native platforms....
-
SRE / DevOps Engineer
3 weeks ago
, , Canada Kraken Full timeJoin to apply for the SRE / DevOps Engineer role at Kraken . Kraken is a mission‑focused company rooted in crypto values. With over a decade of commitment to crypto ethics, Kraken invites you to accelerate global crypto adoption and champion financial freedom for all. As a fully remote company, Kraken has employees in 70+ countries spanning 50+ languages,...
-
Senior Manager, Site Reliability Engineering
2 days ago
Toronto, Canada (Hybrid) Tubi Full timeAbout Tubi:Boldly built for every fandom, Tubi is a free streaming service that entertains over 100 million monthly active users. Tubi offers the world's largest collection of Hollywood movies and TV shows, thousands of creator-led stories and hundreds of Tubi Originals made for the most passionate fans. Headquartered in San Francisco and founded in 2014,...
-
Staff Software Engineer, AI
1 week ago
, , Canada Mozilla Corporation Full timeStaff Software Engineer, AI & Automation Remote Canada Why Mozilla? Mozilla Corporation is the non‑profit‑backed technology company that has shaped the internet for the better over the last 25 years. We make pioneering brands like Firefox, the privacy‑minded web browser. Now, with more than 225 million people around the world using our products each...