AI SRE/Automation Engineer
7 days ago
We Looking For: An AI-driven Site Reliability Engineer who can turn ML insights into automated infrastructure actions. Role Purpose: To operationalize and automate the Right‑Sizing platform’s recommendations, ensuring safe, reliable, and scalable adoption of cost optimization measures across Client’s cloud and on‑prem environments. Key Responsibilities Translate ML/forecasting model outputs into actionable system changes. Implement automation to adjust Kubernetes deployment configs, VM allocations, and storage sizing. Develop rollback and alerting mechanisms for failed or unsafe right‑sizing actions. Build CI/CD pipelines to continuously integrate and deploy updated capacity recommendations. Ensure compliance with OCC and internal governance requirements when applying changes. Collaborate with Data/ML engineers to fine‑tune recommendations before production rollout. Document automation workflows and train application teams on adoption. Work with Data/ML engineers to set up train/test pipelines for ETL. Required Skills & Experience Strong background in Site Reliability Engineering and Infrastructure Automation . Proficiency in Python (automation scripting), Terraform/Ansible , and CI/CD pipelines (GitHub actions and Azure DevOps). Experience managing Kubernetes deployments , YAML configs, and scaling policies. Hands‑on with Azure infrastructure (VM sizing, managed disks, blob storage, SQL Managed Instances). Familiarity with observability and monitoring tools (ELK, Dynatrace, LogicMonitor, PagerDuty). Knowledge of governance and compliance for infrastructure changes. Strong troubleshooting skills and ability to work across multiple platform teams. Seniority Level Entry level Employment Type Full‑time Job Function Engineering and Information Technology Industry IT Services and IT Consulting #J-18808-Ljbffr
-
SRE-DevSecOps Engineer
1 week ago
, , Canada High Tech Genesis Full timeOverview Join to apply for the SRE-DevSecOps Engineer role at High Tech Genesis Allowed Staffing Countries: Canada, Costa Rica, Mexico or Brazil, (Remote) Term: Contract High Tech Genesis is seeking a 3-month contractor who can hit the ground running to support our SaaS platform on AWS. Responsibilities Kubernetes/EKS Operations – Manage, troubleshoot, and...
-
SRE Engineer
5 days ago
, , Canada mthree Recruiting Portal Full timeA fantastic opportunity to be working with one of the worlds leading investment banks. As a Junior SRE you would be joining a growing HashiVault squad as part of the strategy to offer more services and a better user experience to the banks users. The current squad has been running for 3 years and are based in North America. You will be working to implement...
-
AI Automation Engineer
4 weeks ago
, , Canada BioRender Full timeJoin to apply for the AI Automation Engineer role at BioRender At BioRender, we’re on a mission to accelerate the world’s ability to learn, discover, and communicate science — transforming how knowledge is shared and making science open, collaborative, and easily understandable by all. We’re shaping the future of science communication and are looking...
-
, , Canada Oscilar Full timeOverview Join to apply for the DevOps/Site Reliability Engineer (SRE) role at Oscilar . Get AI-powered advice on this job and more exclusive features. Shape the future of trust in the age of AI At Oscilar, we're building the most advanced AI Risk Decisioning Platform. Banks, fintechs, and digitally native organizations rely on us to manage their fraud,...
-
Senior SRE Engineer
2 days ago
, , Canada Releady Full timeOVERVIEW This senior, client-facing observability role supports a major Washington-based airline. The engineer consults with the client’s internal engineering teams to identify systems, pain points, and reliability gaps, then designs and implements observability solutions—dashboards, metrics, SLIs/SLOs, alerting strategies, and visibility improvements....
-
AI & Automation Developer
1 week ago
, , Canada Innovorg Full timeAt Innovorg , we’re building a workforce intelligence platform that transforms how skills, learning, and certifications are managed and developed—helping organizations operate with clarity, scale, and readiness. We work with leading Digital Infrastructure clients, are backed by experienced investors, and are growing deliberately. This role is an...
-
, , Canada GitLab Full timeSenior Site Reliability Engineer, Environment Automation Remote, Canada GitLab is an open core software company that develops the most comprehensive AI-powered DevSecOps Platform, used by more than 100,000 organizations. Our mission is to enable everyone to contribute to and co-create the software that powers our world. When everyone can contribute,...
-
SRE / DevOps Engineer
5 days ago
, , Canada Kraken Full timeJoin to apply for the SRE / DevOps Engineer role at Kraken . Kraken is a mission‑focused company rooted in crypto values. With over a decade of commitment to crypto ethics, Kraken invites you to accelerate global crypto adoption and champion financial freedom for all. As a fully remote company, Kraken has employees in 70+ countries spanning 50+ languages,...
-
Staff Platform Engineer
4 weeks ago
, , Canada Refinitiv Full time# **Our Privacy Statement & Cookie Policy**This posting is for proactive recruitment purposes and may be used to fill current openings or future vacancies within our organization.**Staff Platform Engineer- Materia AI**As a Platform Engineer, you will contribute to leading the design, development, and evolution of our next-generation cloud-native platforms....
-
AI Research Engineer
2 days ago
, , Canada EnCharge AI Full timeEnCharge AI is a leader in advanced AI hardware and software systems for edge-to-cloud computing. EnCharge’s robust and scalable next-generation in‑memory computing technology provides orders‑of‑magnitude higher compute efficiency and density compared to today’s best‑in‑class solutions. The high‑performance architecture is coupled with...