Site Reliability Engineer

4 weeks ago


Toronto, Ontario, Canada SGS Full time
Job Description

The Site Reliability Engineer will play a critical role in ensuring the reliability, supportability, scalability, and performance of our .NET stack applications built with MVC, Angular, and Web API.

  • Partner with developers and product operations teams to understand application requirements and translate them into operational practices.
  • Design, implement, and maintain infrastructure automation tools using Infrastructure as Code (IaC) methodologies.
  • Monitor application health and performance metrics, proactively identifying and resolving potential issues.
  • Implement incident response procedures to ensure timely resolution of outages and service disruptions.
  • Establish and improve best practices for product solution design / architecture, and development.
  • Participate in peer and team code reviews by developing comprehensive coding standards and guidelines to ensure consistency, maintainability, and quality in software development.
  • Collaborate with engineers to develop and implement disaster recovery plans.
  • Continuously improve monitoring and alerting processes to ensure efficient problem identification and resolution.
  • Stay up-to-date on the latest advancements in .NET infrastructure and SRE best practices.

Qualifications

  • Bachelor degree required
  • Minimum 3+ years of experience in a related technical role (Systems Administrator, Network Engineer) required
  • Experience with configuration management tools like Ansible, Puppet, or Chef preferred
  • Azure experience required
  • Familiarity with monitoring and alerting tools (.NET performance counters, Azure App Insight, Prometheus, Grafana) is a plus preferred
  • Ability to manage and coordinate multiple projects in a fast-paced, highly professional environment.
  • Strong understanding of system administration principles, including operating systems (Windows Server preferred) and networking concepts.

At SGS, we are seeking a highly skilled Site Reliability Engineer to join our team. The ideal candidate will have a strong background in .NET infrastructure and SRE best practices, with experience in configuration management tools and Azure.

The successful candidate will be responsible for ensuring the reliability, supportability, scalability, and performance of our .NET stack applications. This will involve partnering with developers and product operations teams to understand application requirements and translate them into operational practices.

The Site Reliability Engineer will also be responsible for designing, implementing, and maintaining infrastructure automation tools using Infrastructure as Code (IaC) methodologies. Additionally, they will monitor application health and performance metrics, proactively identifying and resolving potential issues.

Other key responsibilities will include implementing incident response procedures, establishing and improving best practices for product solution design / architecture, and development, and participating in peer and team code reviews.

The ideal candidate will have a bachelor's degree and at least 3+ years of experience in a related technical role. They will also have experience with configuration management tools like Ansible, Puppet, or Chef, and Azure experience is required.

We are looking for a highly skilled and experienced Site Reliability Engineer to join our team at SGS. If you have a strong background in .NET infrastructure and SRE best practices, and experience in configuration management tools and Azure, we encourage you to apply.



  • Toronto, Ontario, Canada The Toronto-Dominion Bank (Canada) Full time

    Job SummaryWe are seeking a highly skilled Site Reliability Engineer to join our team at The Toronto-Dominion Bank (Canada). As a Site Reliability Engineer, you will play a critical role in ensuring the reliability, scalability, and performance of our systems and applications.Key ResponsibilitiesProvide technical leadership and expertise in designing and...


  • Toronto, Ontario, Canada SGS Full time

    Job Title: Site Reliability EngineerWe are seeking a highly skilled Site Reliability Engineer to join our team at SGS Canada. As a key member of our infrastructure team, you will play a critical role in ensuring the reliability, supportability, scalability, and performance of our .NET stack applications.Key Responsibilities:Partner with developers and...


  • Toronto, Ontario, Canada Northbridge Financial Corporation Full time

    Senior Site Reliability EngineerThe Senior Site Reliability Engineer plays a crucial role in ensuring the reliability and efficiency of our systems. This position oversees the creation and implementation of Service Level Objectives (SLOs) and handles service reliability solutions and processes of increasing complexity.Key Responsibilities:Interface with...


  • Toronto, Ontario, Canada The Home Depot Canada Full time

    Unlock Your Potential at The Home Depot CanadaAs a Site Reliability Engineering Manager, you will lead a team of Site Reliability Engineers to ensure the reliability, performance, and operational support of our eCommerce systems, with a focus on Google Cloud Platform (GCP) environments.Key Responsibilities:Lead and mentor a team of Site Reliability Engineers...


  • Toronto, Ontario, Canada Criteo Full time

    About the Role:We are seeking a skilled Senior Site Reliability Engineer to join our team at Criteo. As a key member of our Product Reliability Engineering group, you will work closely with product engineering to improve the reliability of our apps, systems, and pipelines.Your Responsibilities:Collaborate with product engineering to identify and prioritize...


  • Toronto, Ontario, Canada Criteo Full time

    About the Role:We are seeking a skilled Site Reliability Engineer to join our team at Criteo. As a Site Reliability Engineer, you will work closely with product engineering to improve the reliability of our apps, systems, and pipelines.Key Responsibilities:Collaborate with product engineering to design, develop, and deploy scalable and reliable systems.Work...


  • Toronto, Ontario, Canada Lyons Consulting Group Full time

    Job SummaryLyons Consulting Group is seeking a highly skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will be responsible for ensuring the reliability and performance of our infrastructure and applications.Key ResponsibilitiesProvide hands-on SRE support, including incident management, problem management, root cause...


  • Toronto, Ontario, Canada Vantage Full time

    Senior Site Reliability EngineerWe are seeking a highly skilled Senior Site Reliability Engineer to join our team at Vantage. As a key member of our engineering team, you will play a pivotal role in ensuring the seamless operation of our large-scale, distributed systems.Key ResponsibilitiesCollaborate with software engineers to drive project success and...


  • Toronto, Ontario, Canada Royal Bank of Canada> Full time

    Job SummaryJob DescriptionWhat is the Opportunity?Royal Bank of Canada is seeking a highly skilled Site Reliability Engineer to join our team. As a key member of our Site Reliability Engineering team, you will be responsible for designing, building, and managing complex platforms to support business processes, reduce toil, and develop new technology...


  • Old Toronto, Ontario, Canada Thomson Reuters Full time

    About the RoleWe are seeking a highly skilled Site Reliability Engineering Specialist to join our team at Thomson Reuters. As a Site Reliability Engineer, you will be responsible for designing, implementing, and maintaining scalable systems and services that meet the needs of our customers.Key ResponsibilitiesDesign and implement scalable systems and...


  • Toronto, Ontario, Canada Compunnel Inc. Full time

    Compunnel Inc. is a leading provider of innovative technology solutions.We are seeking an experienced Site Reliability Engineering Lead to join our team in Toronto, Canada.The estimated salary for this position is $170,000 per year, considering the location and industry standards.About the JobThis role is perfect for someone who is passionate about driving...


  • Toronto, Ontario, Canada Vantage Full time

    Senior Site Reliability EngineerWe are seeking a highly skilled Senior Site Reliability Engineer to join our team at Vantage. As a key member of our engineering team, you will play a pivotal role in ensuring the seamless operation of our large-scale, distributed systems.Key Responsibilities:Collaborate with software engineers to drive project forward through...


  • Toronto, Ontario, Canada Behavox Full time

    About the RoleAt Behavox, we're building a scalable and fault-tolerant platform to manage and analyze massive volumes of data. Our platform is designed to handle millions of data items, allowing our clients to search, filter, and visualize relationships between entities in the system.As a Site Reliability Engineer, you'll be responsible for ensuring the...


  • Toronto, Ontario, Canada State Street Full time

    At State Street, we are seeking a Cloud Platform/Site Reliability Engineer to join our team.Key Responsibilities:Design and implement scalable cloud infrastructure solutions.Ensure high availability and reliability of cloud-based systems.Collaborate with cross-functional teams to drive cloud adoption and innovation.Requirements:Strong background in cloud...


  • Toronto, Ontario, Canada KPMG Canada Full time

    About the RoleWe are seeking a highly skilled Site Reliability Engineer to join our team at KPMG Canada. As a key member of our Operations team, you will play a critical role in ensuring the smooth operation of our Managed Service.Key ResponsibilitiesDesign and implement scalable and reliable cloud infrastructure solutionsCollaborate with cross-functional...


  • Old Toronto, Ontario, Canada Thomson Reuters Full time

    Site Reliability Engineer (Contract)Contract (5 months 29 days)Closed OpportunityThomson Reuters is seeking a skilled Site Reliability Engineer to join our Service Management Organization.The ideal candidate will have experience in IT Service Management, working with cloud providers, software development, and technology infrastructure.As a Site Reliability...


  • Toronto, Ontario, Canada Northbridge Financial Corporation Full time

    Site Reliability Engineer Role OverviewThe Senior Site Reliability Engineer at Northbridge Financial Corporation is responsible for overseeing the creation and implementation of Service Level Objectives (SLOs). This role involves handling service reliability solutions and processes of increasing complexity, as well as mentoring and leading less experienced...


  • Toronto, Ontario, Canada Northbridge Financial Corporation Full time

    Site Reliability Engineer Role OverviewThe Senior Site Reliability Engineer at Northbridge Financial Corporation is responsible for overseeing the creation and implementation of Service Level Objectives (SLOs). This role involves handling service reliability solutions and processes of increasing complexity, as well as mentoring and leading less experienced...


  • Toronto, Ontario, Canada Index Exchange Full time

    About the Role:We are seeking a highly skilled Staff Site Reliability Engineer to own and develop on-premise and hybrid cloud environments, focusing on low-latency performance on Kubernetes platforms supporting a robust developer experience framework.The ideal candidate will have a deep technical understanding of on-premise and hybrid cloud architectures and...


  • Toronto, Ontario, Canada Broadridge Full time

    Broadridge: A Culture of EmpowermentWe're a company that believes in empowering others to achieve more. If you're passionate about developing your career while helping others grow, come be a part of our team at Broadridge.About the RoleWe're seeking a Site Reliability Engineer Lead to join our team. As a key member of our engineering team, you'll be...