Site Reliability Engineer

2 days ago


Canada Accion Labs Full time

Site Reliability Engineer (OpenShift & Infrastructure) Get AI-powered advice on this job and more exclusive features. Direct message the job poster from Accion Labs Responsibilities & Skills Install, configure, upgrade, and administer OpenShift clusters (OCP) in on-premise and cloud environments. Manage OCP internal networking, ingress, egress, and cluster services. Configure and integrate LDAP authentication and access management. Implement TLS and MTLS encryption, and manage certificate lifecycle for secure communications. Implement GitOps workflows using ArgoCD for continuous delivery and environment consistency. Automate platform and application provisioning using Terraform and Ansible. Configure and maintain F5 LTM load balancers. Configure and manage DNS, networking, and subnets. Build and manage monitoring, logging, and alerting frameworks (e.g., Prometheus, Grafana, ELK). Define and enforce SLIs/SLOs and error budgets for services running on OCP. Lead incident response, root cause analysis (RCA), and postmortems. Build automation for self‑healing, scaling, and zero-touch operations. Ensure high availability, disaster recovery, and failover strategies are implemented. Secure platform and workloads following enterprise security standards. Support application deployments and CI/CD pipelines on OpenShift. Troubleshoot networking, cluster, and deployment issues end-to-end. Apply SRE best practices to improve reliability, scalability, and performance. Collaborate with development and platform teams to optimize system operations. Seniority level Mid‑Senior level Employment type Contract Job function Information Technology #J-18808-Ljbffr



  • (s): Canada : Ontario : Toronto Scotiabank Global Site Full time $105,000 - $170,000 per year

    Requisition ID: 244026Join a purpose driven winning team, committed to results, in an inclusive and high-performing culture.Overview: As a Site Reliability Engineer (SRE), you will join the Digital Engineering Operations team, responsible for ensuring the operations and reliability of Scotiabank digital applications. You will have the opportunity to drive...


  • (s): Canada : Ontario : Toronto Scotiabank Global Site Full time US$80,000 - US$140,000 per year

    Requisition ID: 244027Join a purpose driven winning team, committed to results, in an inclusive and high-performing culture.Overview: As a Site Reliability Engineer (SRE), you will join the Digital Engineering Operations team, responsible for ensuring the operations and reliability of Scotiabank digital applications. You will have the opportunity to drive...


  • , , Canada Orion Innovation Full time

    Senior Site Reliability Engineer (SRE) with Kubernetes & Rancher Location: Canada - Remote (Working EST hours) Job Type: Full-time About the Role Are you an exceptional Site Reliability Engineer with a passion for building and maintaining highly resilient and secure systems? We are seeking a Senior SRE to join our team and play a critical role in managing...


  • , , Canada Thinkific Full time

    Join to apply for the Senior Site Reliability Engineer role at Thinkific Join to apply for the Senior Site Reliability Engineer role at Thinkific Are you an experienced Site Reliability Engineer looking for a new challenge? We’re looking for a Senior Site Reliability Engineer to join us at Thinkific. We’re looking for a Senior Site Reliability Engineer...


  • (s): Canada : Ontario : Toronto Scotiabank Global Site Full time $120,000 - $180,000 per year

    Requisition ID: 239640Join a purpose driven winning team, committed to results, in an inclusive and high-performing culture.The RoleAs a member of the Systems Reliability Engineering team, the System Reliability Engineer will collaborate closely with Engineering and development teams, peers, and business partners to continuously improve the stability,...


  • , , Canada Icon Full time

    Helping SaaS companies scale Engineering teams. Director, Site Reliability Engineering We are seeking an accomplished Director of Site Reliability Engineering (SRE) to lead the reliability, scalability, and performance initiatives across multiple enterprise technology domains, including AML, Risk, Finance, Corporate Treasury, and Human Resources systems....


  • , , Canada Orion Innovation Full time

    Job Description: Senior Site Reliability Engineer (SRE) with Kubernetes & Rancher Location: Canada - Remote (Working EST hours) Job Type: Full-time About the Role Are you an exceptional Site Reliability Engineer with a passion for building and maintaining highly resilient and secure systems? We are seeking a Senior SRE to join our team and play a critical...


  • , , Canada Akamai Technologies Full time

    Senior Site Reliability Engineer Join Akamai Technologies as we build a reliable, secure, and scalable Internet. We are looking for a Senior Site Reliability Engineer to help us solve complex performance and reliability challenges. Job Description Are you passionate about cutting‑edge technology and ready to tackle some of the Internet’s most difficult...


  • , , Canada Targeted Talent Full time

    Overview We are looking for an experienced Senior Site Reliability Engineer for our client. This is a permanent position that is remote to start with later relocation to Calgary or Winnipeg . Our client is a global enterprise company with a product that you've likely used. Experience with coding/software development, along with Site Reliability will be the...


  • , , Canada DuckDuckGo Full time

    6 days ago Be among the first 25 applicants Get AI-powered advice on this job and more exclusive features. Who We AreHi, we're DuckDuckGo, the online protection company and remote-first team of 300+ on a mission to raise the standard of trust online. Founded in 2008 and profitable since 2014, our annual revenue now exceeds $100 million USD. Millions use our...