Technical Lead for Site Reliability

1 week ago


Old Toronto, Canada Sentry Full time

About Sentry

Sentry is a leading provider of performance and error monitoring tools that help companies build high-quality software faster. With a strong track record of innovation and customer satisfaction, we are committed to empowering developers to write better code.

We are seeking a highly skilled Technical Lead for our Site Reliability team. As a key member of our engineering organization, you will lead a team of talented engineers in ensuring the resilience and scalability of our products. Your expertise in distributed systems, cloud computing, and system administration will be invaluable in driving our technical vision forward.

As a Technical Lead, you will work closely with your team to set direction, anticipate challenges, and develop achievable milestones. You will also collaborate with other engineering functions to build valuable features that contribute towards achieving our business goals. Your ability to inspire and cultivate a strong team identity will be critical in fostering a healthy and collaborative culture that embodies our values.

This role presents a unique opportunity to shape the future of our company's technical capabilities. If you are passionate about mentoring, process development, and continuous improvement, we encourage you to apply.

In this role, you will:
  • Grow and develop a team of talented SREs
  • Set direction for the team, anticipating strategic and scaling-related challenges
  • Participate in quarterly planning sessions to help the team develop achievable milestones
  • Communicate deliverable outcomes to engineering, product, and design teams
  • Contribute to Sentry's cloud strategy
  • Foster a healthy and collaborative culture that embodies our values
  • Be part of the escalation path for our on-call process, possibly being on-call when needed

You'll love this job if you:

  • Enjoy mentoring and helping other engineers grow
  • Take on challenges that push you out of your comfort zone daily
  • Like developing processes to reduce toil and improve efficiency
  • Get excited about converting learnings from incidents into actions that make engineering better
  • Enjoy working in a team of SREs who are passionate about constantly improving how we operate

Requirements:

  • 5+ years of industry experience in software engineering
  • Ideally 2+ years of people management experience
  • Experience working with distributed systems in Cloud environments (AWS, GCP, or Azure)
  • Experience with tools that manage systems, including Terraform, Kubernetes, Salt, and Envoy
  • Good written and spoken English communication skills
  • Bonus points if you have experience working with globally distributed teams
  • Live in the Toronto, Canada area or be willing to relocate

The estimated salary for this position is $195,000 - $215,000 per year, depending on location and experience.



  • Old Toronto, Canada RBC Full time

    b>RBC is seeking a Lead SRE for our US Cash Management Technology. This is a brand-new system to serve our corporate clients. You will be heavily involved in shaping the future technology landscape of RBC, by delivering key business values for a transformational project in our Banking Technology while implementing strategic components servicing across all...


  • Old Toronto, Canada TD Full time

    Job OverviewWe are seeking a highly skilled Site Reliability Engineering Lead to join our team at TD. As a key member of our technology group, you will be responsible for ensuring the stability, scalability, and reliability of our platforms.About the RoleThe ideal candidate will have a minimum of 8 years of experience in site reliability engineering, with a...


  • Old Toronto, Canada Infotree Global Solutions Full time

    About Infotree Global SolutionsInfotree Global Solutions is a leading provider of innovative solutions, and we're seeking an experienced Site Reliability Engineer to lead our team.Your RoleAs our Site Reliability Engineering Lead, you will be responsible for supervising a team of skilled engineers and ensuring the reliability and scalability of our global...


  • Toronto, Ontario, Canada Compunnel Inc. Full time

    Compunnel Inc. is a leading provider of innovative technology solutions.We are seeking an experienced Site Reliability Engineering Lead to join our team in Toronto, Canada.The estimated salary for this position is $170,000 per year, considering the location and industry standards.About the JobThis role is perfect for someone who is passionate about driving...


  • Toronto, Canada CDW Full time

    **Supervisor, Site Reliability** Office Location: Downtown Toronto **This Is You** Bring your IT career and talents to CDW where you can have a greater impact, be inspired by our mission and excited about your job and future. The #1 name in Canada for IT services and solutions, we are an innovative Fortune 200 leader driving meaningful technological change...


  • Old Toronto, Canada Tecsys Inc. Full time

    p>Having recognized the advantages of remote work, including employee morale, productivity, reduced commuting on employee wellbeing and the environment, we are proud to be a digital-first company. The technologies and programs in which we invested have provided a fantastic foundation to this end. Our digital-first work environment, together with our...


  • Old Toronto, Canada Lorien Full time

    p>Hybrid - ManchesterWe are currently working with a leading gambling company dedicated to providing exceptional gaming experiences. They are looking for an experienced Site Reliability Engineer with a strong skill set in system reliability to join its world-class technology team. This role is ideal for someone who has 4+ years of experience within the...


  • Old Toronto, Canada Tecsys Full time

    Tecsys is a fast-growing innovator offering supply chain solutions to industry-leading healthcare systems, hospitals, and pharmacy businesses to distributors, retailers, and 3PLs. As a Cloud Infrastructure Specialist, you will be responsible for ensuring the reliability and uptime of our platform and applications in a data-driven way to support internal and...


  • Old Toronto, Canada Tecsys Full time

    p>Having recognized the advantages of remote work, including employee morale, productivity, reduced commuting on employee wellbeing and the environment, we are proud to be a digital-first company. The technologies and programs in which we invested have provided a fantastic foundation to this end. Our digital-first work environment, together with our...


  • Old Toronto, Canada Sentry Full time

    p>The Site Reliability Engineering team is responsible for the deployment, configuration, maintenance, and monitoring of Sentry's hosted platform. We do this by leveraging automation tools to automatically spin up and scale services to meet the traffic demands of 1,000,000+ developers. Sentry receives over a billion events a day and processes terabytes of...


  • Toronto, Ontario, Canada Sentry Full time

    About SentryWe're on a mission to help developers write better software faster, so we can get back to enjoying technology. With more than $217 million in funding and 100,000+ organizations that believe we're on to something, we're building performance and error monitoring tools that help companies like Disney, Microsoft, and Atlassian spend less time fixing...


  • Toronto, Ontario, Canada Royal Bank of Canada Full time

    Royal Bank of Canada is seeking a highly skilled Site Reliability Engineering (SRE) leader to join our team in Toronto, Canada. As an SRE leader, you will be responsible for leading the development and implementation of SRE solutions that improve the reliability and performance of our applications.The ideal candidate will have 5+ years of experience as a...


  • Old Toronto, Canada Thomson Reuters Full time

    h3>(Canada) Site Reliability Engineer (Contract)Contract (9 months 4 days)Published 3 days agoNew RelicData DogSite Reliability Engineer - in the Service Management OrganizationDo you have experience in IT Service Management, working with cloud providers, software development, and technology infrastructure?The Site Reliability Engineer will analyze chronic...


  • Old Toronto, Canada Olx Full time

    p>Site Reliability EngineerRemote Poland, PolandOLX – Engineering / Full-time / Remote At OLX, we work together to build a more sustainable world through trade. We make it safe, smart, and convenient to buy and sell cars, find housing, get jobs, buy and sell household goods, and more. Our colleagues around the world help to serve millions of people around...


  • Old Toronto, Canada Thomson Reuters Full time

    h3>(Canada) Site Reliability Engineer (Contract)Contract (5 months 29 days)Published 8 months agoCLOSEDGCPSite Reliability Engineer - in the Service Management OrganizationDo you have experience in IT Service Management, working with cloud providers, software development, and technology infrastructure?The Site Reliability Engineer will analyze chronic and...


  • Old Toronto, Canada Lyons Consulting Group Full time

    p>Provide hands-on SRE with 24x7 SRE support, including incident management, problem management, root cause analysis, monitoring, alerting, and maintenance of infrastructure compliance.Track, audit, monitor and implement on technical work streams.Act as portfolio SME (Domain Expert) – understand & document common components, core functionalities,...


  • Toronto, Canada Northbridge Financial Corporation Full time

    What is it like to be a Senior Site Reliability Engineer at Northbridge Financial The Senior Site Reliability Engineer oversees the creation and implementation of Service Level Objectives (SLOs). The Senior SRE handles service reliability solutions and processes of increasing complexity, and are responsible for mentoring and leading less experienced...


  • Old Toronto, Canada Sentry Full time

    Bad software is everywhere, and we’re tired of it. Sentry is on a mission to help developers write better software faster, so we can get back to enjoying technology.With more than $217 million in funding and 100,000+ organizations that believe we’re on to something, we're building performance and error monitoring tools that help companies like Disney,...


  • Toronto, Ontario, Canada Royal Bank of Canada Full time

    Job SummaryRoyal Bank of Canada is seeking an experienced professional to lead our Site Reliability Engineering (SRE) efforts for our US Cash Management Technology. This is a unique opportunity to shape the future technology landscape of the company, delivering key business values and implementing strategic components across all RBC functions defined in our...


  • Old Toronto, Canada Loblaw Companies Ltd - Head Office Full time

    Cloud Engineering OpportunityWe are seeking an experienced Site Reliability Engineer to join our team at Loblaw Companies Ltd - Head Office. This role offers a unique opportunity to design, develop, and maintain cloud native solutions using services like Kubernetes, AppEngine, Cloud Functions, CloudSql, BigQuery, Pub/Sub on Google Cloud Platform and...