Senior Site Reliability Engineer

3 days ago

Canada Clinia Full time

About Clinia Clinia builds the search, data, and cloud infrastructure that digital health enterprises across North America rely on to deliver trusted, connected care experiences. As a ~40‑person post‑Series A scale‑up, we operate in a regulated healthcare environment where system reliability, security, and correctness are critical. Senior Site Reliability Engineer (SRE) We are hiring a Senior Site Reliability Engineer to strengthen the reliability, observability, and scalability of our production systems as the company grows. This is a senior, hands‑on role with real ownership.

You will operate production cloud infrastructure, participate in an on‑call rotation, and drive systemic improvements that reduce incidents, operational risk, and long‑term toil.

What you will

do Own production reliability through on‑call rotation, incident response, and post‑incident reviews that result in durable system improvements. Design, build, and evolve cloud infrastructure using Terraform and infrastructure‑as‑code practices, primarily on AWS, with exposure to GCP and Azure. Operate, scale, and improve Kubernetes platforms, including Amazon EKS, Bottlerocket, and Cilium/eBPF‑based networking.

Deploy and manage services using Helm and FluxCD, with a strong emphasis on GitOps workflows and automation. Establish and maintain end‑to‑end observability across distributed systems using OpenTelemetry, Prometheus, and Grafana (Loki, Tempo, Mimir). Partner closely with software engineering and product teams to embed reliability, operability, and failure‑mode thinking into system design.

Identify recurring operational issues and replace them with clear automation, platform improvements, or architectural changes. What we’re looking for Proven experience as a Site Reliability Engineer, DevOps Engineer, or Infrastructure Engineer supporting production systems at scale. Hands‑on experience with on‑call rotations, incident management, and operating systems under real uptime and SLA expectations.

Strong experience managing AWS cloud environments using Terraform.

Experience

with GCP or Azure is a plus. Deep understanding of Kubernetes internals and cluster operations, including Helm, GitOps tools such as Flux, and community operators (e.g., CNPG). Solid foundations in Linux systems and TCP/IP networking, including security, compliance, and modern networking technologies such as eBPF and Cilium. Working knowledge of modern monitoring and observability practices, including OpenTelemetry and Prometheus.

Clear, direct communication during incidents and disciplined follow‑through on remediation work. Why you will love working here Equity via our global ESOP – you share in what you build. 4 weeks vacation plus summer hours. Group insurance from day one. Remote‑friendly culture – you can work from anywhere. ⚕️ 24/7 online doctor access for you and your family.

Senior Site Reliability Engineer

3 days ago

, , Canada Thinkific Full time

Join to apply for the Senior Site Reliability Engineer role at Thinkific Join to apply for the Senior Site Reliability Engineer role at Thinkific Are you an experienced Site Reliability Engineer looking for a new challenge? We’re looking for a Senior Site Reliability Engineer to join us at Thinkific. We’re looking for a Senior Site Reliability Engineer...
Senior Site Reliability Engineer

2 days ago

, , Canada DuckDuckGo Full time

6 days ago Be among the first 25 applicants Get AI-powered advice on this job and more exclusive features. Who We AreHi, we're DuckDuckGo, the online protection company and remote-first team of 300+ on a mission to raise the standard of trust online. Founded in 2008 and profitable since 2014, our annual revenue now exceeds $100 million USD. Millions use our...
Senior Site Reliability Engineer

4 weeks ago

, , Canada Sage Recruiting Inc. Full time

This range is provided by Sage Recruiting Inc.. Your actual pay will be based on your skills and experience — talk with your recruiter to learn more. Base pay range CA$180,000.00/yr - CA$200,000.00/yr Senior Site Reliability Engineer (Founding Role) Location: Canada About the Role This team is building a brand-new fintech platform from the ground up and is...
Senior Site Reliability Engineer

4 days ago

, , Canada TextNow Full time

This range is provided by TextNow. Your actual pay will be based on your skills and experience — talk with your recruiter to learn more. Base pay range CA$113,400.00/yr - CA$162,000.00/yr We believe communication belongs to everyone. We exist to democratize phone service. TextNow is evolving the way the world connects and that\'s because we\'re made up of...
Senior Site Reliability Engineer

2 days ago

, BC, Canada Orion Innovation Full time

Overview Senior Site Reliability Engineer (SRE) with Kubernetes and Rancher. Full-time role focused on building and maintaining highly resilient, secure systems, including in air-gapped environments. Responsibilities System Architecture & Management: Design, architect, and maintain highly reliable, multi-tenant systems using Kubernetes and related tools...
Site Reliability Engineer

2 days ago

(s): Canada : Ontario : Toronto Scotiabank Global Site Full time

Requisition ID: 245210Join a purpose driven winning team, committed to results, in an inclusive and high-performing culture.The TeamGlobal Banking and Markets Engineering (GBME) is the fast-moving, award-winning technology engine that powers Scotiabank's Corporate, Investment Banking and Capital Markets businesses.The RoleGBME is searching for a Site...
Site Reliability Engineer

2 days ago

(s): Canada : Ontario : Toronto Scotiabank Global Site Full time

Requisition ID: 244027Join a purpose driven winning team, committed to results, in an inclusive and high-performing culture.Overview: As a Site Reliability Engineer (SRE), you will join the Digital Engineering Operations team, responsible for ensuring the operations and reliability of Scotiabank digital applications. You will have the opportunity to drive...
Senior Site Reliability Engineer

3 days ago

, , Canada D-Wave Full time

Join to apply for the Senior Site Reliability Engineer role at D‑Wave . D‑Wave (NYSE: QBTS) is a leader in the development and delivery of quantum computing systems, software, and services. We are the world’s first commercial supplier of quantum computers, and the only company building both annealing and gate‑model quantum computers. Our mission is...
Site Reliability Engineer

2 days ago

, , Canada Bitcomplete Full time

Join us as a Senior Site Reliability Engineer to help us run an industry-scale GPU cluster via Kubernetes. Together with senior members of our team, you will combine your strong understanding of system scaling and security practices with your cloud-native expertise to stand up and maintain Kubernetes clusters from scratch. Your role will also be pivotal in...
Senior Site Reliability Engineer

3 days ago

, , Canada Paxos Full time

About Paxos Today’s financial infrastructure is archaic, expensive, inefficient and risky — supporting a system that leaves out more people than it lets in. So we’re rebuilding it. We’re on a mission to open the world’s financial system to everyone by enabling the instant movement of any asset, any time, in a trustworthy way. For over a decade,...

Americas

Europe

Asia / Oceania

Africa

Senior Site Reliability Engineer

What you will

Experience