Senior Site Reliability Engineer
1 day ago
Role Title:
SRE Dynatrace Specialist
Line of Business:
DevOps and Automation
Duration:
5 months (possibility of extension)
Working hours:
9am-5pm EST
Office location:
Toronto, ON M5B 1B8
Hybrid work requirements:
2 days/week in office
Role Mandate:
The DevOps and Automation is looking for a Site Reliability Engineer with strong expertise in Dynatrace to ensure the reliability, performance and observability of large scale, distributed systems.
Team Structure:
If technical assistance is required to understand the application or its critical transaction flow, it can be provided. However, this role is primarily focused on independent work
Role Responsibilities:
Monitoring application flow (transactions) to check on anomalies and identify/resolve errors
Using Dynatrace to enable more reporting on critical transactions
Ensuring the reliability, performance, and accuracy of critical business-level dashboards
Configure and maintain Dynatrace OneAgents, ActiveGate, dashboards, synthetic monitoring, RUM, and distributed tracing.
Develop custom alerting rules, anomaly detection settings, and service-level dashboards aligned with SLOs/SLIs.
Build end-to-end observability solutions using Dynatrace's full-stack capabilities.
Create performance baselines, evaluate trends, and identify optimization opportunities.
Integrate Dynatrace with CI/CD pipelines, ticketing systems, and incident management tools.
Use Dynatrace to perform root cause analysis (RCA) and drive long-term remediation strategies.
Establish service-level objectives (SLOs) and reliability standards for critical systems.
Develop internal best practices for monitoring, tracing, and performance engineering.
Must Have Skills:
3-5 years of direct experience with Dynatrace tool
Dynatrace – Full orchestration (all Dynatrace phases, Infra, Synthetic Monitoring, RUM (Real User Monitoring) OS, DB and Incident Mgmt)
Very familiar with Dynatrace integration with ServiceNow
Very Familiar with leveraging Dynatrace Davis AI and enabling to its full potential
Strong knowledge and experience with Dynatrace SRG, workflows, defining guardians, objectives and integration with JIRA/SNOW
Familiar with integrating SRG into deployment pipelines to in force the usage of SRG as go/no-go decisions
Very familiar with Dynatrace WCCS framework (Gen-3 Dashboards) from business centric view, experience enabling true E2E observability
Strong understanding of SRE golden signals (app tier, web tier, DB tier) and how to setup alerts/thresholds accordingly as per SLO/SLI/SLAs
Proficiency with Dynatrace DQL
Nice to Have Skills:
Former FI experience
Technical degree preferred
Dynatrace certifications (Certified Associate or Certified Professional).
-
Senior Site Reliability Engineer
2 weeks ago
Toronto, Ontario, Canada Fivetran Full timeAbout the RoleFivetran is looking for a high-performance engineer to be a part of a team of Site Reliability Engineers. You will be working closely with engineering teams, product managers, as well as support and sales engineers to build the future of the Fivetran Data Platform Reliability. As a member of the Site Reliability Engineering team, you will...
-
Senior Site Reliability Engineer
6 days ago
Toronto, Ontario, Canada Tubi Full timeAbout Tubi:Boldly built for every fandom, Tubi is a free streaming service that entertains over 100 million monthly active users. Tubi offers the world's largest collection of Hollywood movies and TV shows, thousands of creator-led stories and hundreds of Tubi Originals made for the most passionate fans. Headquartered in San Francisco and founded in 2014,...
-
Senior Manager, Site Reliability Engineering
1 week ago
Toronto, Ontario, Canada Tubi Full timeAbout Tubi:Boldly built for every fandom, Tubi is a free streaming service that entertains over 100 million monthly active users. Tubi offers the world's largest collection of Hollywood movies and TV shows, thousands of creator-led stories and hundreds of Tubi Originals made for the most passionate fans. Headquartered in San Francisco and founded in 2014,...
-
Site Reliability Engineer
1 week ago
Toronto, Ontario, Canada Global Technical Talent, an Inc. 5000 Company Full timePrimary Job Title:Site Reliability Engineer IVAlternate / Related Job Titles:Site Reliability EngineerSenior SREIT Reliability EngineerSystems Integration EngineerLocation & Onsite Flexibility:Toronto, ON —Hybrid (4 days onsite)Office Address:66 Wellington Street West, 19th Floor, Toronto, ONContract DetailsPosition Type:ContractContract...
-
Site Reliability Engineer
1 week ago
Toronto, Ontario, Canada Compass Digital Full timeJoin Compass Digital as an Intermediate Site Reliability Engineer and help power the future of hospitality tech You'll design, build, and automate cloud-native systems that are reliable, observable, and scalable—working with AWS, Go, TypeScript, serverless, containers, and cutting-edge DevOps tools.WHO WE ARECompass Digital is an organization that drives...
-
Site Reliability Engineer II
2 weeks ago
Toronto, Ontario, Canada Fivetran Full timeAbout the RoleFivetran is looking for a high-performance engineer to be a part of a team of Site Reliability Engineers. You will be working closely with engineering teams, product managers, as well as support and sales engineers to build the future of the Fivetran Data Platform Reliability. As a member of the Site Reliability Engineering team, you will...
-
Senior Site Reliability Engineer
5 days ago
Toronto, Ontario, Canada Fellow Insights Inc Full timeAt Fellow, our mission is to transform how teams collaborate and make meetings productive for everyone. As we continue to grow, we're seeking a Site Reliability Engineer who will play a pivotal role in scaling and optimizing our infrastructure, ensuring that our AI Meeting Assistant and broader platform remain reliable, secure, and high-performing. In this...
-
Senior Site Reliability Engineer
6 days ago
Toronto, Ontario, Canada RBC Full timeJob DescriptionWhat is the opportunity?This is an exciting opportunity to join a high-impact team responsible for ensuring the reliability, scalability, and performance of critical ATM production systems. As a Senior Service Reliability Engineer, you will play a pivotal role in shaping the future of our ATM services by driving innovation, implementing...
-
Senior Site Reliability Engineer
2 hours ago
Toronto, Ontario, Canada RBC Full timeJob DescriptionWhat is the Opportunity?We are seeking an experienced and skilled Senior Site Reliability Engineer and System's Specialist to join our team, responsible for ensuring the stability, reliability, and performance of our mission-critical application.The ideal candidate will possess a strong technical background in Linux administration, scripting,...
-
Site Reliability Engineer
2 weeks ago
Toronto, Ontario, Canada Tecsys Inc. Full timeHaving recognized the advantages of remote work, including employee morale, productivity, reduced commuting on employee wellbeing and the environment, we are proud to be a digital-first company. The technologies and programs in which we invested have provided a fantastic foundation to this end. Our digital-first work environment, together with our...