Reliability Engineering Specialist

1 week ago


Toronto, Ontario, Canada SGS Full time

**Job Title:** Reliability Engineering Specialist

At SGS, we are seeking a skilled Reliability Engineering Specialist to join our team. This role plays a critical part in ensuring the reliability, supportability, scalability, and performance of our .NET stack applications built with MVC, Angular, and Web API.

As a key member of our team, you will partner with developers and product operations teams to understand application requirements and translate them into operational practices. You will design, implement, and maintain infrastructure automation tools using Infrastructure as Code (IaC) methodologies, monitor application health and performance metrics, proactively identifying and resolving potential issues.

Your responsibilities will include implementing incident response procedures to ensure timely resolution of outages and service disruptions, establishing and improving best practices for product solution design/architecture and development, participating in peer and team code reviews by developing comprehensive coding standards and guidelines.

Additionally, you will collaborate with engineers to develop and implement disaster recovery plans, continuously improve monitoring and alerting processes to ensure efficient problem identification and resolution, and stay up-to-date on the latest advancements in .NET infrastructure and SRE best practices.

We offer an estimated salary of $85,000 - $110,000 per year, depending on experience, based on national averages and industry standards. We also provide competitive benefits, including medical, dental, and vision insurance, 401(k) matching, paid time off, and opportunities for professional growth and development.



  • Toronto, Ontario, Canada The Engineering Institute of Canada Full time

    Job SummaryAs a Senior Technical Specialist, Equipment Reliability, you will play a key role in developing and maintaining a deep technical understanding of our insured's businesses to enable world-class insurance engineering services. Your expertise in rotating equipment, specifically prime movers for power generation, will be highly...


  • Old Toronto, Ontario, Canada Thomson Reuters Full time

    About the RoleWe are seeking a highly skilled Site Reliability Engineering Specialist to join our team at Thomson Reuters. As a Site Reliability Engineer, you will be responsible for designing, implementing, and maintaining scalable systems and services that meet the needs of our customers.Key ResponsibilitiesDesign and implement scalable systems and...


  • Toronto, Ontario, Canada The Engineering Institute of Canada Full time

    Job SummaryAs a Senior Technical Specialist, Equipment Reliability at The Engineering Institute of Canada, you will play a critical role in developing and maintaining a deep technical understanding of our insured's businesses to enable world-class insurance engineering services. Your expertise in rotating equipment, specifically prime movers for power...


  • Toronto, Ontario, Canada Criteo Full time

    About the Role:This is a challenging opportunity for an experienced engineer to join Criteo's PRE team as a Site Reliability Engineer. The role involves working closely with product engineering to improve the reliability of our apps, systems, and pipelines, assessing where optimization is needed most, and telling stories with meaningful monitoring.Key...


  • Toronto, Ontario, Canada Riverside Natural Foods Full time

    Company OverviewRiverside Natural Foods is a forward-thinking company that prioritizes innovation, sustainability, and employee well-being. Our mission is to create delicious, nutritious snacks that are good for our customers, the planet, and our employees.Salary and BenefitsWe offer a competitive salary range of $55,000 - $65,000 per year, depending on...


  • Toronto, Ontario, Canada Lorven Technologies Full time

    Job Title : Reliability Systems SpecialistLocation : RemoteDuration : Long termA Bachelor's degree in Computer Science or related technical field, or equivalent practical experience.Advanced knowledge of reliability engineering practices and technologies.Hands-on experience in reliability tools (Ansible, Azure Automation, Catchpoint).Azure, Linux.Dynatrace,...

  • Reliability Engineer

    2 weeks ago


    Toronto, Ontario, Canada Scotiabank Full time

    Role OverviewThis role is responsible for ensuring the stability and reliability of applications and services across a portfolio. The ideal candidate will have experience in IT Service Delivery and IT Service Management, with a strong understanding of SRE and service management principles.Job DescriptionThe System Reliability Engineer will work closely with...


  • Toronto, Ontario, Canada Metrolinx Full time

    Job Title: Senior Reliability EngineerJob Summary:Metrolinx is a leading transportation agency in the Greater Golden Horseshoe region, operating GO Transit, UP Express, and the PRESTO fare payment system. We are committed to providing reliable and efficient transportation services to our customers. As a Senior Reliability Engineer, you will play a critical...


  • Toronto, Ontario, Canada Randstad Canada Full time

    Job SummaryAre you a technical specialist looking for a challenging contract role with opportunities for growth and development? This position might be a good fit for you.As a Hardware Design Reliability Specialist, you will be responsible for the hardware reliability activities regarding the hardware products within our company's perimeter.Key...


  • Old Toronto, Ontario, Canada Emburse Full time

    Job SummaryAs a Site Reliability Engineer - Automation Specialist at Emburse, you will be responsible for developing software and software fixes to integrate internal systems. You will ensure code quality, test and distribute code updates, and monitor the health and stability of the servers.Key ResponsibilitiesMeet and beat Key Performance Indicators, SLAs,...


  • Toronto, Ontario, Canada Estée Lauder Companies Full time

    Reliability Engineering Manager RoleWe are seeking a highly skilled Reliability Engineering Manager to join our team at Estée Lauder Companies. As a key member of the Plant Management Team, you will be responsible for leading maintenance and reliability processes to achieve operational excellence.The ideal candidate will have a strong background in plant...


  • Toronto, Ontario, Canada Metrolinx Full time

    Job Summary: We are seeking a highly skilled Reliability Engineering Expert to join our team at Metrolinx. In this role, you will be responsible for ensuring the reliability, availability, maintainability, and safety (RAMS) of our GO Transit Bus fleet and infrastructure assets. You will analyze performance metrics and asset failure history to identify...


  • Toronto, Ontario, Canada The Toronto-Dominion Bank (Canada) Full time

    Job SummaryWe are seeking a highly skilled Site Reliability Engineer to join our team at The Toronto-Dominion Bank (Canada). As a Site Reliability Engineer, you will play a critical role in ensuring the reliability, scalability, and performance of our systems and applications.Key ResponsibilitiesProvide technical leadership and expertise in designing and...


  • Toronto, Ontario, Canada SGS Full time

    We are seeking a Reliable Software Specialist to join our team at SGS, where you will play a critical part in ensuring the reliability, supportability, scalability, and performance of our .NET stack applications built with MVC, Angular, and Web API.The successful candidate will have a strong understanding of system administration principles, including...


  • Toronto, Ontario, Canada Estée Lauder Companies Full time

    Estée Lauder Companies is a leading manufacturer and marketer of high-quality skincare, makeup, fragrance, and haircare products.We are seeking a highly skilled Reliability Engineering Manager to join our team at our Canadian Supply Chain and Canadian Innovation Centre. The successful candidate will be responsible for maintaining the reliability and...


  • Toronto, Ontario, Canada Criteo Full time

    About the Role:Criteo is seeking a talented Site Reliability Engineer to join our PRE team.What You'll Do: As a Site Reliability Engineer, you'll work closely with product engineering to improve the reliability of our apps, systems, and pipelines. You'll assess where optimization is needed most and tell stories with meaningful monitoring.How You'll Make an...


  • Toronto, Ontario, Canada SGS Full time

    Job Title: Site Reliability EngineerWe are seeking a highly skilled Site Reliability Engineer to join our team at SGS Canada. As a key member of our infrastructure team, you will play a critical role in ensuring the reliability, supportability, scalability, and performance of our .NET stack applications.Key Responsibilities:Partner with developers and...


  • Toronto, Ontario, Canada Riverside Natural Foods Full time

    Riverside Natural Foods, a leading manufacturer of organic and natural snacks, is seeking an experienced Asset Reliability Specialist Intern to support its Asset Management Reliability Program. As a key member of our team, you will be responsible for implementing Best-in-Class Asset Reliability Methods across multi-site locations.Job SummaryThis co-op...


  • Old Toronto, Ontario, Canada Chelsea Avondale Full time

    Job Title: Asset Reliability EngineerAt Chelsea Avondale, we're pushing the boundaries of home insurance innovation. Our team of experts has developed cutting-edge risk modeling and insurance pricing technologies, which we deploy through our own insurance company.We're a group of talented individuals from diverse backgrounds, including insurance, software...


  • Toronto, Ontario, Canada The Home Depot Canada Full time

    About The Home Depot CanadaThe Home Depot Canada is a leading retailer of home improvement products and services, committed to delivering exceptional customer experiences and driving business growth. We are seeking a highly skilled Cloud Reliability Engineering Manager to join our team and lead our Site Reliability Engineers in ensuring the reliability,...