Site Reliability Engineer

4 weeks ago

Toronto, Canada Flinks Technology Inc. Full time

About Flinks

Flinks is where financial data moves—with purpose, trust, and impact.

We’re on a mission to simplify access to financial data and help businesses build better, faster, and more secure financial products and experiences. Since 2016, we’ve been bridging the gap between fintechs, financial institutions, and consumers by enabling seamless, secure data connectivity.

From instant account funding to smarter lending, our solutions help power some of the most innovative financial products in North America. We partner with lenders, banks, and fintechs to streamline onboarding, prevent fraud, and fuel real-time decision-making with enriched, reliable data.

As pioneers in Canada’s open banking movement, we're not waiting for the future—we're building it. If you're bold, curious, and ready to help shape the future of finance, we’d love to meet you.

What You'll Be Doing

As the Observability SRE, you will own the end-to-end observability, monitoring, and reliability strategy across all Flinks product lines. Your mission is to ensure every product—Data Connectivity, Payments, Enrichment, and Document Services—has the right telemetry, actionable alerts, and reliability insights.

- Company-wide Observability & Monitoring: Define and maintain an observability framework across products; ensure coverage for APIs, scraping systems, payments, enrichment, and document services; establish SLIs/SLOs aligned to client expectations.
- Alerting & Incident Management: Build consistent, low-noise alerting rules; integrate observability into Incident.io workflows; lead cross-product RCA; maintain a “single source of truth” for reliability metrics.
- Reliability Analysis & Insights: Deliver monthly/quarterly scorecards linking reliability to client outcomes (e.g., churn risk, adoption blockers); analyze trends and recurring failures; translate data into executive insights.
- Automation & AI-Enabled Observability: Automate anomaly detection, escalation, and self-healing; partner with the AI team; optimize logging and monitoring spend.
- Collaboration & Enablement: Champion observability practices across teams; train PMs, QA, and Engineers; ensure insights influence roadmaps; collaborate with Tech Leadership to build observability in from the start.

Who You Are

- Experience: 5–8 years in SRE, Observability, or Reliability roles, ideally across multiple product environments (fintech, SaaS, or data platforms).
- Technical Skills: Strong in observability tooling (Grafana, Prometheus, OpenTelemetry, ELK); Hands on experience with tracing and profiling tools (APM, OTEL, Pyroscope); experience with distributed systems, APIs, and data pipelines; strong automation skills (Kubernetes).
- Strong programming skills with working knowledge of at least one programming language; C# and Go are preferred, but experience in other languages will also be considered valuable.
- Mindset:

- Systems thinker who sees the big picture.
- Business-aware, connecting reliability to retention and profitability.
- Proactive, anticipating failures before they occur.
- Collaborative, working across product, QA, engineering, and reliability.

Great to haves

- Experience in fintech or high-availability SaaS environments.
- Familiarity with payments infrastructure and fraud detection systems.
- Contributions to open-source observability tools or frameworks.

Why This Role Matters at Flinks

- Ensures all products have consistent reliability and observability standards.
- Provides a single source of truth for performance and reliability across the org.
- Directly improves client trust, profitability, and operational efficiency.
- Enables proactive stability management across Flinks’ core product lines.
- Supports our shift to a cohesive, reliable, platform-first mindset at scale.

The Interview Process

- Head of People
- Director of IT Ops
- Technical Challenge
- Panel Interview

#J-18808-Ljbffr

Site Reliability Engineer

8 minutes ago

(s): Canada : Ontario : Toronto Scotiabank Global Site Full time

Requisition ID: 245210Join a purpose driven winning team, committed to results, in an inclusive and high-performing culture.The TeamGlobal Banking and Markets Engineering (GBME) is the fast-moving, award-winning technology engine that powers Scotiabank's Corporate, Investment Banking and Capital Markets businesses.The RoleGBME is searching for a Site...
Site Reliability Engineer

2 weeks ago

Toronto, Canada Global Technical Talent Full time

Primary Job Title Site Reliability Engineer IV Alternate / Related Job Titles Site Reliability Engineer Senior SRE IT Reliability Engineer Systems Integration Engineer Location & Onsite Flexibility Toronto, ON — Hybrid (4 days onsite) Office Address: 66 Wellington Street West, 19th Floor, Toronto, ON Contract Details Position Type: Contract Contract...
Site Reliability Engineer

3 days ago

Toronto, Canada Moneris Full time

Your Moneris Career - The OpportunityAs the Site Reliability Engineer, you will help ensure the reliability, performance, and scalability of our systems. You will work with development and operations teams to build and maintain robust infrastructure, automate processes, and improve overall system health.Location: You will be based in our Toronto office,...
Site Reliability Engineer

3 weeks ago

Toronto, Canada Tecsys Inc. Full time

Having recognized the advantages of remote work, including employee morale, productivity, reduced commuting on employee wellbeing and the environment, we are proud to be a digital-first company. The technologies and programs in which we invested have provided a fantastic foundation to this end. Our digital-first work environment, together with our...
Site Reliability Engineer

1 week ago

Toronto, Canada Verto Health Full time

About Verto Health At Verto Health, we’re transforming how healthcare organizations connect and collaborate through delivery of digital twin & AI-enabled journeys for population health. Our solutions use patented technology to transform structured and unstructured data, from any source, into seamless patient journeys - reducing administrative burden for...
Site Reliability Engineer

1 week ago

Toronto, Canada Verto Health Full time

About Verto HealthAt Verto Health, we’re transforming how healthcare organizations connect and collaborate through delivery of digital twin & AI-enabled journeys for population health. Our solutions use patented technology to transform structured and unstructured data, from any source, into seamless patient journeys - reducing administrative burden for...
Site Reliability Engineer

11 minutes ago

Toronto, Ontario, Canada Moneris Full time

Your Moneris Career - The OpportunityAs the Site Reliability Engineer (SRE), you will play a crucial role in ensuring the reliability, performance, and scalability of our systems. You will work closely with development and operations teams to build and maintain robust infrastructure, automate processes, and improve overall system healthLocation:You will be...
Site Reliability Engineer

2 weeks ago

Toronto, Canada Scotiabank Full time

Requisition ID: 244027 Overview As a Site Reliability Engineer (SRE), you will join the Digital Engineering Operations team, responsible for ensuring the operations and reliability of Scotiabank digital applications. You will have the opportunity to drive the success of Scotia Digital production services by enhancing availability, scalability, performance,...
Site Reliability Engineer

4 weeks ago

Toronto, Canada Scotiabank Full time

Select how often (in days) to receive an alert: Requisition ID: Join a purpose driven winning team, committed to results, in an inclusive and high-performing culture. Overview As a Site Reliability Engineer (SRE), you will join the Digital Engineering Operations team, responsible for ensuring the operations and reliability of Scotiabank digital applications....
Site Reliability Engineer/ Pulumi

3 weeks ago

Toronto, Canada Motion Recruitment Full time

Site Reliability Engineer – Platform Infrastructure (Remote – Canada) Help redefine the future of how teams meet and collaborate by joining a product‑focused organization building AI‑driven productivity tools for global teams. This is a unique opportunity to shape the reliability, scale, and automation of an innovative collaboration platform that is...

Americas

Europe

Asia / Oceania

Africa

Site Reliability Engineer