(Canada) Site Reliability Engineer

6 days ago


Old Toronto, Canada Thomson Reuters Full time
(Canada) Site Reliability Engineer (Contract)

Contract (9 months 4 days)

Published 3 days ago

New Relic

Data Dog

Site Reliability Engineer - in the Service Management Organization

Do you have experience in IT Service Management, working with cloud providers, software development, and technology infrastructure?

The Site Reliability Engineer will analyze chronic and major issues, evaluate products and their services, make recommendations to improve service outcomes, design solutions in partnership with product, engineering, and architecture teams, build, test, operationalize tools and applications to improve customer experience and reduce costs. Additionally, the Lead Site Reliability Engineer will provide oversight and coaching to engineers and be an escalation for our global command center engineers. We have multiple opportunities at different levels of seniority.

About the Role

In this opportunity as Site Reliability Engineer, you will be responsible for:

  • Service design, service operations, release, development and support
  • Leading the work to drive efficiencies and reduce service operations risks.
  • Leading the research of new capabilities, testing new solutions, recommending and implementing new technologies to improve customer experience and reduce costs.
  • Collaborating and partnering with cross functional teams to solve intractable problems and devising solutions to improve the products and services we offer our customers.

About You

You're a fit for the role of Site Reliability Engineer if:

  • You are proficient cloud technologies, services, use of their APIs, and configuration tools.
  • You use AI/ML tools to help improve service, reduce costs, and worked with AI-Operations solutions.
  • You are familiar with programming languages such as Python, Java, C#.
  • You have designed and supported scalable systems and services.
  • You are proficient with Networking, Widows, Linux, Container, PostgreSQL, or related infrastructure services at scale.
  • You can automate tasks to improve service operations and support.
  • You use configuration management tools to manage configuration at scale.
  • You apply the scientific method to system components to identify improvements.
  • You are proficient in Observability tools such as Data Dog or New Relic
  • You are proficient in data analysis from sources such as SQL, S3, Athena, etc.
#J-18808-Ljbffr

  • Old Toronto, Canada Thomson Reuters Full time

    (Canada) Site Reliability Engineer (Contract) Contract (9 months 4 days) Published 3 days ago New Relic Data Dog Site Reliability Engineer - in the Service Management OrganizationDo you have experience in IT Service Management, working with cloud providers, software development, and technology infrastructure?The Site Reliability Engineer will...


  • Old Toronto, Ontario, Canada CB Canada Full time

    Site Reliability EngineerOn behalf of our client in the Banking Sector, PROCOM is looking for a Site Reliability Engineer.Site Reliability Engineer – Job DescriptionAzure cloudJira and confluenceCICD Experience with automating (provisioning, configuration management, deployment) and integrating Azure PaaS solutions (Azure App services, Azure Kubernetes...


  • Old Toronto, Canada CB Canada Full time

    Site Reliability Engineer On behalf of our client in the Banking Sector, PROCOM is looking for a Site Reliability Engineer. Site Reliability Engineer – Job Description Azure cloud Jira and confluence CICD Experience with automating (provisioning, configuration management, deployment) and integrating Azure PaaS solutions (Azure App services, Azure...


  • Old Toronto, Canada CB Canada Full time

    Site Reliability Engineer On behalf of our client in the Banking Sector, PROCOM is looking for a Site Reliability Engineer. Site Reliability Engineer – Job Description Azure cloud Jira and confluence CICD Experience with automating (provisioning, configuration management, deployment) and integrating Azure PaaS solutions (Azure App services, Azure...


  • Toronto, Canada CB Canada Full time

    Site Reliability Engineer On behalf of our client in the Banking Sector, PROCOM is looking for a Site Reliability Engineer. Site Reliability Engineer – Job Description Azure cloud Jira and confluence CICD Experience with automating (provisioning, configuration management, deployment) and integrating Azure PaaS solutions (Azure App services, Azure...


  • Old Toronto, Canada Reperio Human Capital Full time

    Site Reliability Engineer 100421 Desired skills: Site Reliability Engineer, SRE, Cloud, Permanent, Remote Site Reliability Engineer Location: Ireland/UK Salary: €70K+ Type: Permanent, Full-time We're seeking experienced Site Reliability Engineers who excel at ensuring the reliability and scalability of production systems, and possess extensive experience...


  • Old Toronto, Canada Reperio Human Capital Full time

    Site Reliability Engineer 100421 Desired skills: Site Reliability Engineer, SRE, Cloud, Permanent, Remote Location: Ireland/UK Salary: €70K+ Type: Permanent, Full-time We're seeking experienced Site Reliability Engineers who excel at ensuring the reliability and scalability of production systems, and possess extensive experience with monitoring and...


  • Old Toronto, Canada E-Solutions Full time

    Job Title: Site Reliability Engineer Location: Toronto, ON Skills and Responsibilities: Collaborate with teams to enhance application and transaction scalability using Azure Kubernetes Service (AKS) and Azure scalability features. Develop application monitoring strategies using New Relic, Devo, and Azure Monitor, including creating monitors and...


  • Old Toronto, Canada E-Solutions Full time

    Job Title: Site Reliability Engineer Location: Toronto, ON Skills and Responsibilities: Collaborate with teams to enhance application and transaction scalability using Azure Kubernetes Service (AKS) and Azure scalability features. Develop application monitoring strategies using New Relic, Devo, and Azure Monitor, including creating monitors and...


  • Old Toronto, Canada Autodesk Full time

    Position Overview Autodesk, the leading Design and Make Software Company, is looking for a Principal Site Reliability Engineer to join the Autodesk Platform Services Engineering team in Toronto, Canada. On this position, you will help build trusted services of APS (Autodesk Platform Services) as measured by Service Level Objectives (SLOs) and Mean Time to...


  • Old Toronto, Canada Epsilon Solutions Ltd. Full time

    Job Title: Site Reliability EngineerLocation: Toronto, ONSkills And Responsibilities Collaborate with teams to enhance application and transaction scalability using Azure Kubernetes Service (AKS) and Azure scalability features. Develop application monitoring strategies using New Relic, Devo, and Azure Monitor, including creating monitors and dashboards....


  • toronto, Canada OnX Canada Full time

    OnX is looking for a Site Reliability Engineer for one our clients in Toronto. Client: Financial Services Location: Toronto, mostly remote Duration: 6 months with potential extension JBoss in middleware experience is super important Responsibilities: Following the senior technicians plans to buil


  • Toronto, Ontario, Canada OnX Canada Full time

    OnX is looking for a Site Reliability Engineer for one our clients in TorontoClient:Financial Services Location: Toronto, mostly remote Duration: 6 months with potential extension JBoss in middleware experience is super important Responsibilities: Following the senior technicians plans to buil


  • Toronto, Ontario, Canada Zortech Solutions Full time

    Role: SRE (Site Reliability Engineer)Location: Canada/RemoteDuration: 6+ MonthsJob Description: Years of Experience: 7+ SRE/SRO experience: - dynatrace, GCP logging / metrics / alerting / thresholds / performance


  • Old Toronto, Canada Skillfinder Full time

    SITE RELIABILITY ENGINEER - WARSAW, POLAND Contract (hybrid working) - 12 months + Role Overview My client serves a variety of world class financial services clients with their state of the art integrated investment management system. For their office in Warsaw, they are seeking a team of Site Reliability Engineers to assist them with a major client...


  • Old Toronto, Canada Skillfinder Full time

    SITE RELIABILITY ENGINEER - WARSAW, POLAND Contract (hybrid working) - 12 months + Role Overview My client serves a variety of world class financial services clients with their state of the art integrated investment management system. For their office in Warsaw, they are seeking a team of Site Reliability Engineers to assist them with a major client...


  • Old Toronto, Canada eTeam Full time

    Remote Work Duration 4 months - Preference is to find candidates who are willing to be converted to full-time employees. The conversion decision will be made based on performance. Job Description Role Description: Defining and measuring reliability goals—SLIs, SLOs, and error budgets for user journey. Designing for and implementing observability (ELK,...


  • Old Toronto, Canada Northbridge Financial Corporation Full time

    Manager, Site Reliability Engineering page is loaded Manager, Site Reliability Engineering Apply locations Toronto, ON time type Full time posted on Posted 2 Days Ago job requisition id R3417 What it’s like to be a Site Reliability Engineer Manager at Northbridge Financial The Manager, Site Reliability Engineering is a...


  • Old Toronto, Canada Northbridge Financial Corporation Full time

    Manager, Site Reliability Engineering page is loaded Manager, Site Reliability Engineering Apply locations Toronto, ON time type Full time posted on Posted 2 Days Ago job requisition id R3417 What it’s like to be a Site Reliability Engineer Manager at Northbridge Financial The Manager, Site Reliability Engineering is a...


  • Old Toronto, Canada Equifax, Inc. Full time

    Synopsis of the role Site Reliability Engineering (SRE) combines software and systems engineering to create scalable and highly reliable software systems. SREs are responsible for the availability, latency, performance, efficiency, change management, monitoring, emergency response, and capacity planning of their services. What experience you need ...