Senior Site Reliability Engineer

21 hours ago


Canada Jobgether Full time
This position is posted by Jobgether on behalf of a partner company. We are currently looking for a Senior Site Reliability Engineer in Canada. We are looking for an experienced Senior Site Reliability Engineer to help scale and secure a high-traffic, rapidly growing platform. In this role, you will be responsible for ensuring system reliability, performance, and availability as infrastructure demands continue to grow. You will work hands-on with cloud infrastructure, databases, and observability tooling while partnering closely with product and engineering teams. The role offers significant ownership over core infrastructure systems and the opportunity to build long-term, resilient foundations. This is an ideal position for someone who thrives in fast-moving environments and enjoys solving complex reliability challenges at scale. Accountabilities:
  • Act as a primary responder for system incidents and outages, ensuring high availability and fast recovery.
  • Own and continuously improve monitoring, alerting, and log management systems.
  • Manage, optimize, and scale database infrastructure including MySQL, PostgreSQL, ClickHouse, and Redis.
  • Maintain and enhance server infrastructure, deployment pipelines, and release processes.
  • Collaborate closely with engineering teams to design and operate scalable, resilient systems.
  • Build and maintain internal SRE tooling and automation to improve reliability and efficiency.
Requirements:3+ years of experience in Site Reliability Engineering, DevOps, or Infrastructure Engineering roles.Deep expertise with AWS and Kubernetes in production environments.Proven experience managing incident response and production outages.Hands-on experience with database operations, performance tuning, and optimization.Strong understanding of observability, monitoring, and logging best practices.Comfortable working in a fast-paced, high-growth environment with evolving priorities.Strong alignment with company values and a collaborative, ownership-driven mindset.Proficient in English, spoken and written, at CEFR Level C2 / ILR Level 5.Based in North or South America for timezone alignment.Bonus: experience with SOC 2 compliance, scaling platforms to 1M+ MAU, or working with ClickHouse. Benefits:Competitive salary: $130,000 – $140,000 USD per year, plus equity and annual compensation reviews.Fully remote work from anywhere.High autonomy and trust, with a strong focus on outcomes.Generous paid time off: 35 days annually, plus a paid sabbatical after 5 years.Comprehensive medical coverage for you and your family, or reimbursement options where applicable.Parental leave and family support benefits.Home office setup stipend.Learning and development budget for continuous growth.Annual bonus potential for eligible roles.Twice-yearly fully paid company retreats in international locations. Why Apply Through Jobgether? We use an AI-powered matching process to ensure your application is reviewed quickly, objectively, and fairly against the role's core requirements. Our system identifies the top-fitting candidates, and this shortlist is then shared directly with the hiring company. The final decision and next steps (interviews, assessments) are managed by their internal team. We appreciate your interest and wish you the best  Why Apply Through Jobgether? 
Data Privacy Notice: By submitting your application, you acknowledge that Jobgether will process your personal data to evaluate your candidacy and share relevant information with the hiring employer. This processing is based on legitimate interest and pre-contractual measures under applicable data protection laws (including GDPR). You may exercise your rights (access, rectification, erasure, objection) at any time.

#LI-CL1 We may use artificial intelligence (AI) tools to support parts of the hiring process, such as reviewing applications, analyzing resumes, or assessing responses. These tools assist our recruitment team but do not replace human judgment. Final hiring decisions are ultimately made by humans. If you would like more information about how your data is processed, please contact us.

  • , , Canada Thinkific Full time

    Join to apply for the Senior Site Reliability Engineer role at Thinkific Join to apply for the Senior Site Reliability Engineer role at Thinkific Are you an experienced Site Reliability Engineer looking for a new challenge? We’re looking for a Senior Site Reliability Engineer to join us at Thinkific. We’re looking for a Senior Site Reliability Engineer...


  • , , Canada DuckDuckGo Full time

    6 days ago Be among the first 25 applicants Get AI-powered advice on this job and more exclusive features. Who We AreHi, we're DuckDuckGo, the online protection company and remote-first team of 300+ on a mission to raise the standard of trust online. Founded in 2008 and profitable since 2014, our annual revenue now exceeds $100 million USD. Millions use our...


  • (s): Canada : Ontario : Toronto Scotiabank Global Site Full time

    Requisition ID: 245210Join a purpose driven winning team, committed to results, in an inclusive and high-performing culture.The TeamGlobal Banking and Markets Engineering (GBME) is the fast-moving, award-winning technology engine that powers Scotiabank's Corporate, Investment Banking and Capital Markets businesses.The RoleGBME is searching for a Site...


  • , , Canada Sage Recruiting Inc. Full time

    This range is provided by Sage Recruiting Inc.. Your actual pay will be based on your skills and experience — talk with your recruiter to learn more. Base pay range CA$180,000.00/yr - CA$200,000.00/yr Senior Site Reliability Engineer (Founding Role) Location: Canada About the Role This team is building a brand-new fintech platform from the ground up and is...


  • , , Canada TextNow Full time

    This range is provided by TextNow. Your actual pay will be based on your skills and experience — talk with your recruiter to learn more. Base pay range CA$113,400.00/yr - CA$162,000.00/yr We believe communication belongs to everyone. We exist to democratize phone service. TextNow is evolving the way the world connects and that\'s because we\'re made up of...


  • (s): Canada : Ontario : Toronto Scotiabank Global Site Full time

    Requisition ID: 244026Join a purpose driven winning team, committed to results, in an inclusive and high-performing culture.Overview: As a Site Reliability Engineer (SRE), you will join the Digital Engineering Operations team, responsible for ensuring the operations and reliability of Scotiabank digital applications. You will have the opportunity to drive...


  • (s): Canada : Ontario : Toronto Scotiabank Global Site Full time

    Requisition ID: 244027Join a purpose driven winning team, committed to results, in an inclusive and high-performing culture.Overview: As a Site Reliability Engineer (SRE), you will join the Digital Engineering Operations team, responsible for ensuring the operations and reliability of Scotiabank digital applications. You will have the opportunity to drive...


  • , BC, Canada Orion Innovation Full time

    Overview Senior Site Reliability Engineer (SRE) with Kubernetes and Rancher. Full-time role focused on building and maintaining highly resilient, secure systems, including in air-gapped environments. Responsibilities System Architecture & Management: Design, architect, and maintain highly reliable, multi-tenant systems using Kubernetes and related tools...


  • , , Canada D-Wave Full time

    Join to apply for the Senior Site Reliability Engineer role at D‑Wave . D‑Wave (NYSE: QBTS) is a leader in the development and delivery of quantum computing systems, software, and services. We are the world’s first commercial supplier of quantum computers, and the only company building both annealing and gate‑model quantum computers. Our mission is...


  • , , Canada Bitcomplete Full time

    Join us as a Senior Site Reliability Engineer to help us run an industry-scale GPU cluster via Kubernetes. Together with senior members of our team, you will combine your strong understanding of system scaling and security practices with your cloud-native expertise to stand up and maintain Kubernetes clusters from scratch. Your role will also be pivotal in...