Current jobs related to Staff Site Reliability Engineer - Toronto, Ontario - Confluent


  • Toronto, Ontario, Canada Okta Full time

    Get to know OktaOkta is The World's Identity Company. We free everyone to safely use any technology, anywhere, on any device or app. Our flexible and neutral products, Okta Platform and Auth0 Platform, provide secure access, authentication, and automation, placing identity at the core of business security and growth.At Okta, we celebrate a variety of...


  • Toronto, Ontario, Canada RBC Full time

    Job DescriptionWhat is the opportunity?Join our Commercial, Core Banking and Payments Technology (CCBPT) team as a Senior Site Reliability Engineer, where you'll play a key role in supporting our cloud and distributed environments for the Personal Commercial Credit SRE & Ops team. This exciting opportunity will challenge you to work with cutting-edge...


  • Toronto, Ontario, Canada RBC Full time

    Job DescriptionWhat is the opportunity?Join RBC as a Lead Site Reliability Engineer and take the lead in ensuring the reliability, scalability, and performance of our critical production systems and infrastructure. This is your chance to drive innovation through cutting-edge engineering practices, automation, and process optimization. Collaborate with...


  • Toronto, Ontario, Canada Funded Full time

    Windscribe is a leading cyber security and privacy company launched in April 2016 and now with more than 70 million users. We believe that the internet was created so that people across the globe could have access to any type of information, no matter where they are. Our mission is to transform the internet with easy-to-use yet powerful privacy and security...


  • Toronto, Ontario, Canada OpenTable Full time $100,000 - $130,000

    With millions of diners, 60,000+ restaurant partners and 25+ years of experience, OpenTable, part of Booking Holdings, Inc. (NASDAQ: BKNG), is an industry leader with a passion for helping restaurants thrive. Our world-class technology empowers restaurants to focus on what matters most – their team, their guests, and their bottom line – while enabling...


  • Toronto, Ontario, Canada RBC Full time

    Job DescriptionWhat is the opportunity?This is an exciting opportunity to join a high-impact team responsible for ensuring the reliability, scalability, and performance of critical ATM production systems. As a Senior Service Reliability Engineer, you will play a pivotal role in shaping the future of our ATM services by driving innovation, implementing...


  • Toronto, Ontario, Canada Serigor Full time

    Company Description Serigor is all about helping you make the right decision about the right technical support for the right fineness in management utilities at any time in a firm standing. Serigor helps organizations stay ahead by building sustainable competitive advantage. Job Description The SRE Role· SREs are engineers with the right mix of knowledge...


  • Toronto, Ontario, Canada RBC Full time

    Job DescriptionWhat is the opportunity?This is an exciting opportunity to join a high-performing team that plays a critical role in ensuring the reliability, scalability, and performance of pre-production environments for ATM systems.  As a Senior Service Reliability Developer, you will be at the forefront of driving innovation and operational excellence in...


  • Toronto, Ontario, Canada J.D. Irving Full time

    DescriptionWhat We Offer:Irving Consumer Products is proud to offer a competitive salary and benefits package, including but not limited to:Flexible medical, dental, and vision planEmployee & family assistance programRRSP matching programHealth & Wellness reimbursementsProduct purchase programOpportunities to take part in job-related training and...


  • Toronto, Ontario, Canada hireVouch Full time

    Job Title: Staff (or Senior) Software Engineer (Backend - )Location: Toronto, Canada (Remote)Our client is a California-based company at the forefront of Artificial Intelligence, dedicated to delivering innovative solutions that empower businesses and individuals. They are seeking a seasoned Staff  (or Senior) Software Engineer to join their team. Their...

Staff Site Reliability Engineer

8 hours ago


Toronto, Ontario, Canada Confluent Full time

We're not just building better tech. We're rewriting how data moves and what the world can do with it. With Confluent, data doesn't sit still. Our platform puts information in motion, streaming in near real-time so companies can react faster, build smarter, and deliver experiences as dynamic as the world around them.

It takes a certain kind of person to join this team. Those who ask hard questions, give honest feedback, and show up for each other. No egos, no solo acts. Just smart, curious humans pushing toward something bigger, together.

One Confluent. One Team. One Data Streaming Platform.

About The Role
Confluent Cloud processes millions of events per second across AWS, GCP, and Azure. When incidents happen in a multi-cloud streaming platform, they happen at scale—data in motion, exactly-once semantics, and cascading failure modes that require deep systems thinking. We need an expert-level engineer who can drive proactive reliability improvements that prevent these incidents before they occur.

This role combines hands-on technical work with strategic program ownership. You'll spend roughly 75% of your time on engineering: building automation, improving tooling, analyzing systemic failure patterns, and designing reliability improvements. The remaining 25% is teaching and coordination: coaching teams through post-mortems, training incident commanders, and evolving our incident response practices.

You'll be part of a global team with follow-the-sun coverage, with clean handoffs that keep everyone working sustainable hours. This role sits within Cloud Architecture and Reliability - Supportability, a horizontal team that owns reliability standards and tooling across engineering. You're the person who makes us need incident management less.

What You Will Do

  • Analyze systemic failure patterns and design reliability improvements that prevent incident recurrence
  • Own Rootly configuration, workflows, and integrations with PagerDuty, Jira, Confluence, and Slack
  • Define and maintain SLO/SLA frameworks; use error budgets to guide reliability investments
  • Own standards, practices, and continuous improvement of incident response across engineering
  • Edit and review customer-facing incident documents (CRCAs) to ensure quality and clarity
  • Develop and deliver training programs; coach teams through post-mortems
  • Partner with engineering leaders to elevate reliability practices org-wide

What You Will Bring

  • 10+ years of relevant experience in SRE, incident management, or reliability engineering
  • Cloud experience with at least one of AWS, GCP, or Azure (we run all three)
  • Experience navigating reliability/incident programs at 500+ engineer organizations
  • Deep expertise with incident management tooling (Rootly, PagerDuty, or similar)
  • Strong understanding of distributed systems and failure modes at scale
  • Deep experience with observability: metrics, logging, tracing
  • Kubernetes and container orchestration experience
  • Understanding of CI/CD pipelines and release processes
  • Strong written communication (design docs, runbooks, post-mortems)
  • Experience driving org-wide process and cultural changes
  • Kafka/event streaming expertise preferred, or demonstrated rapid mastery of complex systems

Ready to build what's next? Let's get in motion.
Come As You Are
Belonging isn't a perk here. It's the baseline. We work across time zones and backgrounds, knowing the best ideas come from different perspectives. And we make space for everyone to lead, grow, and challenge what's possible.

We're proud to be an equal opportunity workplace. Employment decisions are based on job-related criteria, without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, disability, veteran status, or any other classification protected by law.

Compensation Range: CA$225.1K - CA$264.5K