Current jobs related to SRE/Observability Engineer - Toronto - Astra-North Infoteck Inc. ~ Conquering today’s challenges, achieving tomorrow’s vision!
-
Senior SRE, Observability
3 weeks ago
Toronto, Canada Chainlink Labs Full timeA leading blockchain technology firm is seeking a Senior Site Reliability Engineer specializing in observability. In this role, you will build and orchestrate an observability platform, ensuring reliable performance that exceeds set SLAs. You will collaborate across engineering teams, contribute to the design of monitoring services, and support multiple...
-
Senior SRE, Observability
4 weeks ago
Toronto, Canada Chainlink Labs Full timeA leading blockchain technology firm is seeking a Senior Site Reliability Engineer specializing in observability. In this role, you will build and orchestrate an observability platform, ensuring reliable performance that exceeds set SLAs. You will collaborate across engineering teams, contribute to the design of monitoring services, and support multiple...
-
Observability SRE: AI-Driven Reliability
3 weeks ago
Toronto, Canada Flinks Technology Inc. Full timeA leading financial technology firm in Toronto is seeking an experienced Observability Site Reliability Engineer (SRE) to own observability and reliability strategies across their product lines. The ideal candidate will have 5-8 years in reliability roles, experience with observability tools like Grafana and Prometheus, and strong programming skills. The...
-
Observability SRE: AI-Driven Reliability
4 weeks ago
Toronto, Canada Flinks Technology Inc. Full timeA leading financial technology firm in Toronto is seeking an experienced Observability Site Reliability Engineer (SRE) to own observability and reliability strategies across their product lines. The ideal candidate will have 5-8 years in reliability roles, experience with observability tools like Grafana and Prometheus, and strong programming skills. The...
-
Lead Observability Engineer – Sumo Logic
6 minutes ago
Toronto, Ontario, Canada E-IT Full timeRole : Lead Observability Engineer – Sumo Logic & SRELocation :RemoteHire-type: ContractWe are seeking a highly skilled Lead Observability Engineer to lead a critical implementation of Sumo Logic for a client migrating from Dynatrace. This role requires deep expertise in Sumo Logic, Site Reliability Engineering (SRE) practices, and Kubernetes (EKS)...
-
Toronto, Canada S.i. Systèmes Full timeSenior Site Reliability Engineer (SRE) - Dynatrace SMELocation: Toronto (Hybrid)Duration: 4 months, with possibility of extensionStart Date: ASAPOverviewOur Major Banking Client is seeking a Senior Site Reliability Engineer (SRE) with deep, hands-on Dynatrace expertise to accelerate their enterprise observability transformation. This is a SME-level role...
-
Site Reliability Engineer(SRE)
2 weeks ago
Toronto, Canada Serigor inc. Full timeSerigor is all about helping you make the right decision about the right technical support for the right fineness in management utilities at any time in a firm standing. Serigor helps organizations stay ahead by building sustainable competitive advantage. Job Description The SRE Role SREs are engineers with the right mix of knowledge and skills in software...
-
Site Reliability Engineer
4 weeks ago
Toronto, Canada Flinks Technology Inc. Full timeAbout Flinks Flinks is where financial data moves—with purpose, trust, and impact.We’re on a mission to simplify access to financial data and help businesses build better, faster, and more secure financial products and experiences. Since 2016, we’ve been bridging the gap between fintechs, financial institutions, and consumers by enabling seamless,...
-
Site Reliability Engineer
3 weeks ago
Toronto, Canada Flinks Technology Inc. Full timeAbout Flinks Flinks is where financial data moves—with purpose, trust, and impact. We’re on a mission to simplify access to financial data and help businesses build better, faster, and more secure financial products and experiences. Since 2016, we’ve been bridging the gap between fintechs, financial institutions, and consumers by enabling seamless,...
-
Site Reliability Engineer
4 weeks ago
Toronto, Canada Flinks Technology Inc. Full timeAbout Flinks Flinks is where financial data moves—with purpose, trust, and impact. We’re on a mission to simplify access to financial data and help businesses build better, faster, and more secure financial products and experiences. Since 2016, we’ve been bridging the gap between fintechs, financial institutions, and consumers by enabling seamless,...
SRE/Observability Engineer
33 minutes ago
Toronto – Hybrid or Remote. Role Summary We are looking for an Observability Engineer to help implement, operate, and improve observability capabilities across our applications and platforms. This role focuses on hands‑on onboarding, instrumentation, dashboarding, and alerting, working under established standards and guidance from senior engineers. You will collaborate with application, SRE, and operations teams to ensure systems are observable, supportable, and production‑ready. Key Responsibilities Observability Implementation Implement and maintain metrics, logs, and traces for applications and infrastructure Assist with onboarding applications into observability platforms (e.g., Dynatrace, ELK, Datadog) Configure dashboards, alerts, and basic anomaly detection Application Support & Instrumentation Work with development teams to enable structured logging, basic distributed tracing, and core metrics Validate observability requirements during Production Readiness Reviews (PRR) Troubleshoot missing or low‑quality telemetry Monitoring & Alerting Configure alerts based on golden signals (latency, errors, traffic, saturation) Help reduce alert noise by tuning thresholds and alert logic Support incident response by gathering logs, metrics, and traces Operations & Reliability Support root cause analysis using observability Maintain dashboards and documentation used by on‑call and support teams Participate in on‑call rotations (as applicable) Automation & Continuous Improvement Assist in automating observability onboarding and validation tasks Create and maintain reusable dashboards and alert templates Follow established observability standards and best practices Required Qualifications 2–4 years of experience in Observability, or SRE Working knowledge of metrics, logs, and basic tracing concepts Hands‑on experience with at least one observability platform (Dynatrace, Elastic/ELK, Datadog, New Relic, etc.) Basic understanding of SLI/SLOs and service health indicators Experience with cloud platforms or hybrid environments Ability to write scripts (Python, Bash, PowerShell) for automation and troubleshooting Preferred Qualifications Experience with OpenTelemetry or APM agents Familiarity with Kubernetes or containerized workloads Experience working with incident management tools (PagerDuty, ServiceNow) Exposure to Dynatrace/Kibana ELK or similar cloud‑native monitoring Experience in regulated or enterprise environments #J-18808-Ljbffr