Site Reliability Engineer

3 weeks ago

Canada Bitcomplete Full time

Join us as an Intermediate Site Reliability Engineer helping build reliable, scalable cloudinfrastructure. You’ll work alongside senior engineers to own projects, deepen platform skills, and support teams operating large distributed systems. You’ll focus on one of three streams: Kubernetes, Observability, or Developer Experience . What you'll be doing Improve infrastructure reliability, scale, and security across cloud-native systems. Deliver features and upgrades through infrastructure-as-code. Collaborate with product teams on debugging, migrations, and operational readiness. Support incident response, capacity planning, and performance improvements. Automate repeatable workflows to reduce operational load across engineering. Stream Focus Areas You’ll help operate and evolve shared Kubernetes platforms used by many product teams. Typical work: Maintain and upgrade clusters, networking, ArgoCD, and IaC patterns. Build or extend reusable infra modules (XRDs, Helm, Terraform) to standardize onboarding. Partner with teams to plan and execute migrations safely Handle inbound maintenance, patching, and legacy stack stability work. Observability Platform You’ll help deliver a modern telemetry platform powering metrics, logs, and traces for engineering teams. Typical work: Build and operate OTEL-based telemetry pipelines across environments. Support migrations to VictoriaMetrics and maintain data accuracy during transitions. Improve SLOs, alerting strategies, and reliability of observability systems. Contribute to IaC automation for observability deployments. Ideal tools: OTEL, Prometheus, VictoriaMetrics, VM Alert, Grafana, Terraform, GitHub Actions. Developer Experience / CI/CD You’ll help maintain and strengthen the CI/CD ecosystem powering builds, tests, and deployments. Typical work: Maintain pipelines, update dependencies, and improve the reliability of GitHub Actions. Migrate workloads away from legacy tooling to a new Tailscale / OIDC-based platform. Triage support requests, follow runbooks, and assist product teams during migrations. Reduce operational load by standardizing patterns and supporting migrations. Ideal tools: GitHub Actions, Docker, Tailscale, Terraform, and container registry best practices. Your Background 3 - 5 years of experience as an SRE. Minimum 1+ years as a software engineer. Keen to deepen your software engineering skills and play a bigger role in how our systems are built and operated. Comfortable writing and debugging code in Go, Python, or a similar language. Curious about platform reliability, excited to learn deeper system internals over time. Communicate clearly with engineers across teams and time zones. Focus on automation, reproducibility, and practical reliability over “heroics.” Bring some experience in cloud infrastructure and want to grow into owning larger systems. About Us CAD $117,610 - $158,240 annually.Our ranges include base salary and conservative bonus target. Interested? We're excited about working with you, so get in touch Submit your application here . We believe people from diverse backgrounds, with different identities and experiences, make our company better. No matter your background, we'd love to hear from you Alignment with our values is just as important as experience. Also, please let us know if there are ways we can make our interview process better for you - we're always happy to listen and accommodate where possible. #J-18808-Ljbffr

Site Reliability Engineer

2 days ago

(s): Canada : Ontario : Toronto Scotiabank Global Site Full time

Requisition ID: 245210Join a purpose driven winning team, committed to results, in an inclusive and high-performing culture.The TeamGlobal Banking and Markets Engineering (GBME) is the fast-moving, award-winning technology engine that powers Scotiabank's Corporate, Investment Banking and Capital Markets businesses.The RoleGBME is searching for a Site...
Site Reliability Engineer

1 day ago

(s): Canada : Ontario : Toronto Scotiabank Global Site Full time

Requisition ID: 244027Join a purpose driven winning team, committed to results, in an inclusive and high-performing culture.Overview: As a Site Reliability Engineer (SRE), you will join the Digital Engineering Operations team, responsible for ensuring the operations and reliability of Scotiabank digital applications. You will have the opportunity to drive...
Site Reliability Engineer

1 day ago

(s): Canada : Ontario : Toronto Scotiabank Global Site Full time

Requisition ID: 244026Join a purpose driven winning team, committed to results, in an inclusive and high-performing culture.Overview: As a Site Reliability Engineer (SRE), you will join the Digital Engineering Operations team, responsible for ensuring the operations and reliability of Scotiabank digital applications. You will have the opportunity to drive...
Site Reliability Engineer

1 day ago

(s): Canada : Ontario : Toronto Scotiabank Global Site Full time

Requisition ID: 247129Join a purpose driven winning team, committed to results, in an inclusive and high-performing culture.As a SRE, you will implement, measure and gather insights from Operational Level Indicators identifying areas for service improvements covering availability, performance, resilience, incidents and chronic problems. You will implement...
Senior Site Reliability Engineer

27 minutes ago

, , Canada Thinkific Full time

Join to apply for the Senior Site Reliability Engineer role at Thinkific Join to apply for the Senior Site Reliability Engineer role at Thinkific Are you an experienced Site Reliability Engineer looking for a new challenge? We’re looking for a Senior Site Reliability Engineer to join us at Thinkific. We’re looking for a Senior Site Reliability Engineer...
Senior Site Reliability Engineer

29 minutes ago

, , Canada DuckDuckGo Full time

6 days ago Be among the first 25 applicants Get AI-powered advice on this job and more exclusive features. Who We AreHi, we're DuckDuckGo, the online protection company and remote-first team of 300+ on a mission to raise the standard of trust online. Founded in 2008 and profitable since 2014, our annual revenue now exceeds $100 million USD. Millions use our...
Senior Site Reliability Engineer

29 minutes ago

, , Canada TextNow Full time

This range is provided by TextNow. Your actual pay will be based on your skills and experience — talk with your recruiter to learn more. Base pay range CA$113,400.00/yr - CA$162,000.00/yr We believe communication belongs to everyone. We exist to democratize phone service. TextNow is evolving the way the world connects and that\'s because we\'re made up of...
Site Reliability Engineer

3 weeks ago

, , Canada Dayforce Full time

Base pay range CA$67,700.00/yr - CA$120,900.00/yr Dayforce is a global human capital management (HCM) company headquartered in Toronto, Ontario, and Minneapolis, Minnesota, with operations across North America, Europe, Middle East, Africa (EMEA), and the Asia Pacific Japan (APJ) region. Our award‑winning Cloud HCM platform offers a unified solution...
Site Reliability Engineer

3 weeks ago

, , Canada mthree Recruiting Portal Full time

Market leading investment bank requires a Site Reliability Engineer join their Technology Operations Management department. The team is responsible to allow the Firm to manage its technology and data related risks. The department are entrusted with the responsibility of protecting the financial interests of millions world-wide, they are required to ensure...
Senior Site Reliability Engineer

3 weeks ago

, , Canada Sage Recruiting Inc. Full time

This range is provided by Sage Recruiting Inc.. Your actual pay will be based on your skills and experience — talk with your recruiter to learn more. Base pay range CA$180,000.00/yr - CA$200,000.00/yr Senior Site Reliability Engineer (Founding Role) Location: Canada About the Role This team is building a brand-new fintech platform from the ground up and is...

Americas

Europe

Asia / Oceania

Africa

Site Reliability Engineer