Sr./Staff - Infrastructure/Site Reliability Engineer (SRE)
3 days ago
Overview Join to apply for the DevOps/Site Reliability Engineer (SRE) role at Oscilar . Get AI-powered advice on this job and more exclusive features. Shape the future of trust in the age of AI At Oscilar, we're building the most advanced AI Risk Decisioning Platform. Banks, fintechs, and digitally native organizations rely on us to manage their fraud, credit, and compliance risk with the power of AI. If you're passionate about solving complex problems and making the internet safer for everyone, this is your place. Why Join Us Mission-driven teams: Work alongside industry veterans from Meta, Uber, Citi, and Confluent, all united by a shared goal to make the digital world safer. Ownership and impact: We believe in extreme ownership. You'll be empowered to take responsibility, move fast, and make decisions that drive our mission forward. Innovate at the cutting edge: Your work will shape how modern finance detects fraud and manages risk. About The Role Oscilar is growing fast, and so is the complexity of our systems. We’re looking for an experienced SRE to take ownership of reliability across our multi-region, cloud-native platform. You’ll have the mandate and autonomy to design, implement, and evolve systems that stay performant and resilient—through traffic spikes, dependency failures, and global deployments. You’ll be shaping how we scale, how we build observability, and how we run infrastructure that supports billions of events and large-scale data pipelines. What You’ll Own Architect and operate resilient cloud infrastructure (AWS, Pulumi, Kubernetes). Lead initiatives to improve availability, latency, and performance at scale. Design and evolve our CI/CD pipelines to optimize for speed, safety, and repeatability. Define the metrics, alerts, and runbooks that form our observability backbone. Run chaos experiments and failure simulations to harden the platform. Mentor engineers and set best practices for SRE across the company. What You Bring Proven track record as a senior SRE, DevOps, or infrastructure engineer in high-scale environments. Expert-level skills in AWS and Infrastructure as Code (Pulumi, Terraform). Strong programming ability in Go and Java. Deep understanding of distributed systems (Kafka, ClickHouse) and microservices architecture. Mastery of container orchestration (Kubernetes) and production debugging. Strong sense of ownership, and the judgment to balance velocity with reliability. Benefits Compensation: Competitive salary and equity packages, including a 401k plan Flexibility: Remote-first culture — work from anywhere Health: 100% Employer covered comprehensive health, dental, and vision insurance with a top tier plan for you and your dependents (US and Canada) Balance: Unlimited PTO policy Technical: AI First company; both Co-Founders are engineers at heart; and over 50% of the company is Engineering and Product Culture: Family-Friendly environment; Regular team events and offsites Development: Unparalleled learning and professional development opportunities Gear: Home office setup assistance Impact: Making the internet safer by protecting online transactions Seniority level Mid-Senior level Employment type Full-time Job function Engineering and Information Technology Industries Technology, Information and Internet Referrals increase your chances of interviewing at Oscilar by 2x #J-18808-Ljbffr
-
, , Canada Remoteworldwide Full timeStaff Infrastructure Site Reliability Engineer Staff Infrastructure Site Reliability Engineer Posted: 04/05/2025 Anywhere in the world Remote Senior About the Team: Netlify’s SRE team is scaling to meet the demands of our rapidly growing platform and user base. Our SRE team is responsible for ensuring the reliability, scalability, and efficiency of...
-
Staff Site Reliability Engineer
4 weeks ago
, BC, Canada Branch Full timeOverview At Branch, we’re transforming how brands and users interact across digital platforms. Our mobile marketing and deep linking solutions deliver seamless experiences that increase ROI, decrease wasted spend, and eliminate siloed attribution. Our team values ownership, collaboration, and a motto: Build Together, Grow Together, Win Together. As a Staff...
-
Site Reliability Engineer
3 weeks ago
, , Canada Orion Innovation Full timeSenior Site Reliability Engineer (SRE) with Kubernetes & Rancher Location: Canada - Remote (Working EST hours) Job Type: Full-time About the Role Are you an exceptional Site Reliability Engineer with a passion for building and maintaining highly resilient and secure systems? We are seeking a Senior SRE to join our team and play a critical role in managing...
-
Senior Site Reliability Engineer
3 weeks ago
, , Canada Orion Innovation Full timeJob Description: Senior Site Reliability Engineer (SRE) with Kubernetes & Rancher Location: Canada - Remote (Working EST hours) Job Type: Full-time About the Role Are you an exceptional Site Reliability Engineer with a passion for building and maintaining highly resilient and secure systems? We are seeking a Senior SRE to join our team and play a critical...
-
Site Reliability Engineer
5 days ago
, , Canada Compass Digital Full timeAs a Reliability Engineer you will work in focus areas such as observability, release automation, incident and problem response improvements, security, code quality, patch management and SRE advocacy. You will have the opportunity to use the latest and greatest cloud and open-source technology to enable our product and test engineering teams through...
-
Director, Site Reliability Engineering
3 weeks ago
, , Canada Icon Full timeHelping SaaS companies scale Engineering teams. Director, Site Reliability Engineering We are seeking an accomplished Director of Site Reliability Engineering (SRE) to lead the reliability, scalability, and performance initiatives across multiple enterprise technology domains, including AML, Risk, Finance, Corporate Treasury, and Human Resources systems....
-
Senior Site Reliability Engineer
3 weeks ago
, , Canada Orion Innovation Full timeWe are seeking a highly specialized and experienced Senior Site Reliability Engineer (SRE) to drive the reliability, performance, and automation of our core platform. This role requires an exceptional blend of deep programming expertise in both Ruby and Go , coupled with hands‑on mastery of Linux systems, advanced networking concepts (specifically IPSec),...
-
Staff Site Reliability Engineer, Database
3 days ago
, , Canada Alpaca Full timeStaff Site Reliability Engineer, Database Who We Are: Alpaca is a US-headquartered self-clearing broker-dealer and brokerage infrastructure for stocks, ETFs, options, crypto, fixed income, 24/5 trading, and more. Our recent Series C funding round brought our total investment to over $170 million, fueling our ambitious vision. Amongst our subsidiaries, Alpaca...
-
Senior Site Reliability Engineer
4 weeks ago
, , Canada Glia Technologies, Inc. Full timeOur award-winning technology powers conversations with customers for some of the world’s largest enterprises. We believe that combining the human touch with technology is the best way to create amazing customer experiences. When human abilities such as problem-solving, creative thinking and relationship building are enhanced with technology... magical...
-
Lead Site Reliability Engineer
24 hours ago
, , Canada Masabi Full timeIntroducing Masabi At Masabi, we’re driving the fare payment revolution, powering the journeys of millions all over the world. We build fare collection platforms that allow riders to seamlessly buy and present tickets for public transport either on their mobile phones, from a ticket machine, or even by tapping their bank card to travel. The Role We’re...