Current jobs related to Site Reliability Engineer - Old Toronto - Equifax, Inc.


  • Old Toronto, Ontario, Canada Snaphunt Full time

    The OpportunityWe're seeking a skilled Site Reliability Engineer to join our team at Snaphunt. As a Site Reliability Engineer, you will play a critical role in ensuring the reliability, scalability, and performance of our software systems.Your ResponsibilitiesDesign, build, and maintain scalable and reliable cloud infrastructure on AWS.Collaborate with...


  • Old Toronto, Ontario, Canada Snaphunt Full time

    The OpportunityWe're seeking a skilled Site Reliability Engineer to join our team at Snaphunt. As a Site Reliability Engineer, you will play a critical role in ensuring the reliability, scalability, and performance of our software systems.Your ResponsibilitiesDesign, build, and maintain scalable and reliable cloud infrastructure on AWS.Collaborate with...


  • Old Toronto, Ontario, Canada Rogers Communications Full time

    Unlock Your Potential at Rogers Sports & MediaWe're on the lookout for a talented Site Reliability Engineer to join our dynamic team at Rogers Sports & Media. As a key player in our organization, you'll have the opportunity to work on exciting projects and collaborate with a diverse group of professionals who share your passion for innovation and...


  • Old Toronto, Ontario, Canada Rogers Communications Full time

    Unlock Your Potential at Rogers Sports & MediaWe're on the lookout for a talented Site Reliability Engineer to join our dynamic team at Rogers Sports & Media. As a key player in our organization, you'll have the opportunity to work on exciting projects and collaborate with a diverse group of professionals who share your passion for innovation and...


  • Old Toronto, Ontario, Canada Northbridge Financial Corporation Full time

    Senior Site Reliability EngineerAbout the RoleWe are seeking a highly skilled Senior Site Reliability Engineer to join our team at Northbridge Financial Corporation. As a key member of our engineering team, you will be responsible for designing, developing, and implementing site reliability solutions that align with our business goals.Key...


  • Old Toronto, Ontario, Canada Northbridge Financial Corporation Full time

    Senior Site Reliability EngineerAbout the RoleWe are seeking a highly skilled Senior Site Reliability Engineer to join our team at Northbridge Financial Corporation. As a key member of our engineering team, you will be responsible for designing, developing, and implementing site reliability solutions that align with our business goals.Key...


  • Old Toronto, Canada Northbridge Financial Corporation Full time

    What is it like to be a Senior Site Reliability Engineer at Northbridge Financial? The Senior Site Reliability Engineer oversees the creation and implementation of Service Level Objectives (SLOs). The Senior SRE handles service reliability solutions and processes of increasing complexity and is responsible for mentoring and leading less experienced SREs. We...


  • Old Toronto, Canada Northbridge Financial Corporation Full time

    What is it like to be a Senior Site Reliability Engineer at Northbridge Financial? The Senior Site Reliability Engineer oversees the creation and implementation of Service Level Objectives (SLOs). The Senior SRE handles service reliability solutions and processes of increasing complexity and is responsible for mentoring and leading less experienced SREs. We...


  • Toronto, Canada CB Canada Full time

    Site Reliability Engineer On behalf of our client in the Banking Sector, PROCOM is looking for a Site Reliability Engineer. Site Reliability Engineer – Job Description Azure cloud Jira and confluence CICD Experience with automating (provisioning, configuration management, deployment) and integrating Azure PaaS solutions (Azure App services, Azure...


  • Old Toronto, Canada GlossGenius Full time

    About GlossGenius: GlossGenius is building an ecosystem enabling entrepreneurs to succeed. We empower small business owners to focus on being creators, not admins, by offering a range of business management tools including booking and scheduling, marketing, analytics, payment processing, and much more. Over 75,000 small business owners have chosen to rely on...


  • Old Toronto, Canada GlossGenius Full time

    About GlossGenius: GlossGenius is building an ecosystem enabling entrepreneurs to succeed. We empower small business owners to focus on being creators, not admins, by offering a range of business management tools including booking and scheduling, marketing, analytics, payment processing, and much more. Over 75,000 small business owners have chosen to rely on...


  • Old Toronto, Ontario, Canada Manulife Insurance Malaysia Full time

    Senior Site Reliability EngineerAbout the RoleWe are seeking a highly skilled Senior Site Reliability Engineer to join our Identity and Access Management team at Manulife Insurance Malaysia. As a key member of our engineering team, you will play a crucial role in ensuring the reliability, scalability, and performance of our software solutions.Key...


  • Old Toronto, Ontario, Canada Manulife Insurance Malaysia Full time

    Senior Site Reliability EngineerAbout the RoleWe are seeking a highly skilled Senior Site Reliability Engineer to join our Identity and Access Management team at Manulife Insurance Malaysia. As a key member of our engineering team, you will play a crucial role in ensuring the reliability, scalability, and performance of our software solutions.Key...


  • Old Toronto, Canada Manulife Insurance Malaysia Full time

    Senior Site Reliability EngineerJob DescriptionDo you want to be part of a team that redefines how we get work done? We are changing the way we develop, and we want you to be part of it! We are seeking a self-motivated Senior Site Reliability Engineer in our Identity and Access Management space, who is obsessed with delivering value, is forward-thinking, and...


  • Old Toronto, Canada Manulife Insurance Malaysia Full time

    Senior Site Reliability EngineerJob DescriptionDo you want to be part of a team that redefines how we get work done? We are changing the way we develop, and we want you to be part of it! We are seeking a self-motivated Senior Site Reliability Engineer in our Identity and Access Management space, who is obsessed with delivering value, is forward-thinking, and...


  • Toronto, Ontario, Canada Bourse de Montreal Inc. Full time

    Job Title: Site Reliability EngineerAt Bourse de Montreal Inc., we're seeking a highly skilled Site Reliability Engineer to join our Global Technology Services (GTS) team. As a Site Reliability Engineer, you will play a critical role in ensuring the reliability, scalability, and performance of our technology infrastructure.Key Responsibilities:Evaluate new...


  • Toronto, Ontario, Canada Bourse de Montreal Inc. Full time

    Job Title: Site Reliability EngineerAt Bourse de Montreal Inc., we're seeking a highly skilled Site Reliability Engineer to join our Global Technology Services (GTS) team. As a Site Reliability Engineer, you will play a critical role in ensuring the reliability, scalability, and performance of our technology infrastructure.Key Responsibilities:Evaluate new...


  • Toronto, Ontario, Canada Bourse de Montreal Inc. Full time

    Job Title: Site Reliability EngineerAt Bourse de Montreal Inc., we're seeking a highly skilled Site Reliability Engineer to join our Global Technology Services (GTS) team. As a Site Reliability Engineer, you will play a critical role in ensuring the reliability, scalability, and performance of our technology infrastructure.Key Responsibilities:Evaluate new...


  • Toronto, Ontario, Canada Bourse de Montreal Inc. Full time

    Job Title: Site Reliability EngineerAt Bourse de Montreal Inc., we're seeking a highly skilled Site Reliability Engineer to join our Global Technology Services (GTS) team. As a Site Reliability Engineer, you will play a critical role in ensuring the reliability, scalability, and performance of our technology infrastructure.Key Responsibilities:Evaluate new...


  • Toronto, Ontario, Canada Lorven Technologies Full time

    Job Title: Site Reliability EngineerLocation: Toronto, CADuration: Long termWe are seeking a highly skilled Site Reliability Engineer to join our team at Lorven Technologies. As a Site Reliability Engineer, you will be responsible for ensuring the reliability and scalability of our cloud-based infrastructure.Key ResponsibilitiesDesign and implement scalable...

Site Reliability Engineer

3 months ago


Old Toronto, Canada Equifax, Inc. Full time

Synopsis of the role

Site Reliability Engineering (SRE) combines software and systems engineering to create scalable and highly reliable software systems. SREs are responsible for the availability, latency, performance, efficiency, change management, monitoring, emergency response, and capacity planning of their services.

What experience you need

  • 8-10 years of experience doing hands-on DevOps engineering, Reliability engineering and production support for large scale IT systems on cloud platforms like GCP and AWS

  • A good level of hands on experience in Kubernetes (GKE, EKS)

  • Strong scripting skills (Python, Shell, Groovy)

  • Good command over Linux, Networking on Cloud and Docker

  • Ability to understand and code pipelines for CI/CD automation using Jenkins

  • Capable of coding infrastructure using terraform.

  • Exposure to maintaining databases like MongoDB, Postgres.

What you’ll do

  • Design, architect and develop cloud native solutions using services like GKE, Cloud Functions, CloudSQL, BigQuery, Pub/Sub, Composer, Dataflow etc on Google cloud platform

  • Build and own infrastructure through Terraform code and maintain a high quality code base

  • Work closely with development teams to remove repetitive processes using Automation (Jenkins, Python, Groovy, gcloud)

  • Troubleshoot production incidents using tools like DataDog, Google Cloud Operations suite, Grafana, ChaosSearch

  • Participate in the SRE team’s on-call rotations, respond to incidents and provide expert support in resolving customer impacting production issues

  • Plan and Implement Disaster Recovery for the systems and conduct regular DR tests to ensure business continuity during the event of a disaster

  • Actively contribute to the SRE operational artifacts

    • Engineering documentation

    • Standard operating procedures

  • Perform cloud cost optimization on the resources owned by SRE

  • Proactively keep up with all the security scans and reports to maintain a secure system and perform regular patching of all cloud resources

What could set you apart

  • A good exposure to security patching of resources on google cloud
  • Ability to document engineering solutions and share the information across the team

  • Ability to help with developing standard operating procedures for SRE operations within the company

  • Willingness to go through official product documentations to build academically correct and secure systems

  • Exposure to Vertex AI on google cloud is a plus

  • Exposure to maintaining databases like MongoDB, Postgres

  • Availability to work extended hours during production incidents and production changes.

#J-18808-Ljbffr