Senior Site Reliability Engineer, Observability
2 weeks ago
OverviewSenior Site Reliability Engineer, Observability at Chainlink Labs. Chainlink Labs is the primary contributing developer of Chainlink, the decentralized computing platform powering the verifiable web. The Observability Team enables Chainlink development and empowers engineers to build and support crucial products and services that have a profound impact in the blockchain industry. This role emphasizes reliability, self-service, and reducing cognitive load across engineering teams.Location: Remote (global) with overlap recommended with Eastern Standard Time (EST).ResponsibilitiesBuild and orchestrate modern OTEL-based observability platform.Support multiple telemetry types, including metrics, logs, and traces.Define and support modern governance in observability and problems at scale.Ensure reliability, security, and performance exceed defined SLAs.Collaborate with engineers across the company to troubleshoot issues, deploy new products and services, and increase velocity while reducing cognitive load.Lead the design and deployment of monitoring and observability services to detect and alert the team of needed action.Ingest, aggregate, transform, and utilize data from multiple sources in the real-time data pipeline.Oversee availability, performance, and supportability of observability infrastructure.Create processes around alert response operations and support the team to ensure reliable delivery of data.Make recommendations to ensure sufficient metrics are collected for alerts with new feature releases.Champion reliability and security by taking care to do work correctly the first time.Requirements7+ years of relevant professional experience in DevOps, infrastructure, SRE, or related fields.Ability to develop software beyond typical infrastructure requirements and configurations.Experience programming in C, C++, Java, Python, Go, Perl, or Ruby.Expert knowledge in designing, developing, and managing large real-time systems.Experience with monitoring and logging: Prometheus, Grafana, and centralized logging solutions (e.g., ELK, Splunk, or Grafana Stack).Experience with distributed systems and container orchestration; comfortable deploying services on Kubernetes clusters.Strong communication skills, including giving/receiving feedback and participating in planning meetings and code reviews.Desired QualificationsInterest in blockchain, Web 3.0, and decentralized technologies.Experience running infrastructure in the blockchain/web3 space.Ability to scale systems sustainably through automation and drive reliability and velocity improvements.Experience working remotely in a distributed team.Desire to grow, automate services, and reduce toil.Tools and daily useAWS; Terraform/Terragrunt; Kubernetes, Calico and ArgoCD; Prometheus and Grafana; GitHub Actions; Packer.All roles with Chainlink Labs are global and remote-based. Overlap with EST is encouraged where possible.We carefully review all applications and aim to provide a response to every candidate within two weeks after the job posting closes. The closing date is listed on the job advert.Equal OpportunityChainlink Labs is an equal opportunity employer. All qualified applicants will receive equal consideration for employment in compliance with applicable laws. If you need assistance or accommodation due to a disability or special need when applying, please contact us via the provided form.Global Data Privacy NoticeInformation collected as part of your Chainlink Labs Careers profile and any job applications you submit is subject to our Privacy Policy. By submitting your application, you consent to our use and processing of your data as required. #J-18808-Ljbffr
-
Senior Site Reliability Engineer, Observability
3 weeks ago
Toronto, Canada Chainlink Labs Full timeOverview Senior Site Reliability Engineer, Observability at Chainlink Labs. Chainlink Labs is the primary contributing developer of Chainlink, the decentralized computing platform powering the verifiable web. The Observability Team enables Chainlink development and empowers engineers to build and support crucial products and services that have a profound...
-
Toronto, Canada Chainlink Labs Full timeOverviewSenior Site Reliability Engineer, Observability at Chainlink Labs. Chainlink Labs is the primary contributing developer of Chainlink, the decentralized computing platform powering the verifiable web. The Observability Team enables Chainlink development and empowers engineers to build and support crucial products and services that have a profound...
-
Site Reliability Engineer
1 week ago
Toronto, Canada Flinks Technology Inc. Full timeAbout Flinks Flinks is where financial data moves—with purpose, trust, and impact.We’re on a mission to simplify access to financial data and help businesses build better, faster, and more secure financial products and experiences. Since 2016, we’ve been bridging the gap between fintechs, financial institutions, and consumers by enabling seamless,...
-
Site Reliability Engineer
6 days ago
Toronto, Canada Flinks Technology Inc. Full timeAbout Flinks Flinks is where financial data moves—with purpose, trust, and impact.We’re on a mission to simplify access to financial data and help businesses build better, faster, and more secure financial products and experiences. Since 2016, we’ve been bridging the gap between fintechs, financial institutions, and consumers by enabling seamless,...
-
Site Reliability Engineer
3 days ago
Toronto, Canada Flinks Technology Inc. Full timeAbout Flinks Flinks is where financial data moves—with purpose, trust, and impact. We’re on a mission to simplify access to financial data and help businesses build better, faster, and more secure financial products and experiences. Since 2016, we’ve been bridging the gap between fintechs, financial institutions, and consumers by enabling seamless,...
-
Site Reliability Engineer
7 days ago
Toronto, Canada Flinks Technology Inc. Full timeAbout Flinks Flinks is where financial data moves—with purpose, trust, and impact. We’re on a mission to simplify access to financial data and help businesses build better, faster, and more secure financial products and experiences. Since 2016, we’ve been bridging the gap between fintechs, financial institutions, and consumers by enabling seamless,...
-
Toronto, Ontario, Canada Chainlink Labs Full time $120,000 - $180,000 per yearAbout UsChainlink Labs is one of the primary contributing developers of Chainlink, the industry-standard oracle platform bringing the capital markets onchain and powering the majority of decentralized finance. The Chainlink stack provides the essential data, interoperability, compliance, and privacy standards needed to power advanced blockchain use cases for...
-
Senior Site Reliability Engineer
3 days ago
Toronto, Canada Tubi Full timeJoin to apply for the Senior Site Reliability Engineer role at Tubi . About Tubi Boldly built for every fandom, Tubi is a free streaming service that entertains over 100 million monthly active users. Tubi offers the world's largest collection of Hollywood movies and TV shows, thousands of creator-led stories and hundreds of Tubi Originals made for the most...
-
Senior Site Reliability Engineer
6 days ago
Toronto, Canada Tubi Full timeJoin to apply for the Senior Site Reliability Engineer role at Tubi. About Tubi Boldly built for every fandom, Tubi is a free streaming service that entertains over 100 million monthly active users. Tubi offers the world's largest collection of Hollywood movies and TV shows, thousands of creator-led stories and hundreds of Tubi Originals made for the most...
-
Senior Site Reliability Engineer
4 weeks ago
Toronto, Montreal, Calgary, Vancouver, Edmonton, Old Toronto, Ottawa, Mississauga, Quebec, Winnipeg, Halifax, Saskatoon, Burnaby, Hamilton, Victoria, Surrey, Halton Hills, London, Regina, Markham, Brampton, Vaughan, Kelowna, Laval, Southwestern Ontario, R, Canada Orion Innovation Full timeJob Description: Senior Site Reliability Engineer (SRE) with Kubernetes & Rancher Location: Canada - Remote [Working EST hours] Job Type: Full-time About the Role Are you an exceptional Site Reliability Engineer with a passion for building and maintaining highly resilient and secure systems? We are seeking a Senior SRE to join our team and play a critical...