Sr./Staff - Infrastructure/Site Reliability Engineer (SRE)

4 weeks ago


Canada Oscilar Full time

Overview Join to apply for the DevOps/Site Reliability Engineer (SRE) role at Oscilar . Get AI-powered advice on this job and more exclusive features. Shape the future of trust in the age of AI At Oscilar, we're building the most advanced AI Risk Decisioning Platform. Banks, fintechs, and digitally native organizations rely on us to manage their fraud, credit, and compliance risk with the power of AI. If you're passionate about solving complex problems and making the internet safer for everyone, this is your place. Why Join Us Mission-driven teams: Work alongside industry veterans from Meta, Uber, Citi, and Confluent, all united by a shared goal to make the digital world safer. Ownership and impact: We believe in extreme ownership. You'll be empowered to take responsibility, move fast, and make decisions that drive our mission forward. Innovate at the cutting edge: Your work will shape how modern finance detects fraud and manages risk. About The Role Oscilar is growing fast, and so is the complexity of our systems. We’re looking for an experienced SRE to take ownership of reliability across our multi-region, cloud-native platform. You’ll have the mandate and autonomy to design, implement, and evolve systems that stay performant and resilient—through traffic spikes, dependency failures, and global deployments. You’ll be shaping how we scale, how we build observability, and how we run infrastructure that supports billions of events and large-scale data pipelines. What You’ll Own Architect and operate resilient cloud infrastructure (AWS, Pulumi, Kubernetes). Lead initiatives to improve availability, latency, and performance at scale. Design and evolve our CI/CD pipelines to optimize for speed, safety, and repeatability. Define the metrics, alerts, and runbooks that form our observability backbone. Run chaos experiments and failure simulations to harden the platform. Mentor engineers and set best practices for SRE across the company. What You Bring Proven track record as a senior SRE, DevOps, or infrastructure engineer in high-scale environments. Expert-level skills in AWS and Infrastructure as Code (Pulumi, Terraform). Strong programming ability in Go and Java. Deep understanding of distributed systems (Kafka, ClickHouse) and microservices architecture. Mastery of container orchestration (Kubernetes) and production debugging. Strong sense of ownership, and the judgment to balance velocity with reliability. Benefits Compensation: Competitive salary and equity packages, including a 401k plan Flexibility: Remote-first culture — work from anywhere Health: 100% Employer covered comprehensive health, dental, and vision insurance with a top tier plan for you and your dependents (US and Canada) Balance: Unlimited PTO policy Technical: AI First company; both Co-Founders are engineers at heart; and over 50% of the company is Engineering and Product Culture: Family-Friendly environment; Regular team events and offsites Development: Unparalleled learning and professional development opportunities Gear: Home office setup assistance Impact: Making the internet safer by protecting online transactions Seniority level Mid-Senior level Employment type Full-time Job function Engineering and Information Technology Industries Technology, Information and Internet Referrals increase your chances of interviewing at Oscilar by 2x #J-18808-Ljbffr



  • , , Canada Remoteworldwide Full time

    Staff Infrastructure Site Reliability Engineer Staff Infrastructure Site Reliability Engineer Posted: 04/05/2025 Anywhere in the world Remote Senior About the Team: Netlify’s SRE team is scaling to meet the demands of our rapidly growing platform and user base. Our SRE team is responsible for ensuring the reliability, scalability, and efficiency of...


  • Canada Quantum World Technologies Inc. Full time

    Role:: SRE Site Reliability Engineering Remote 100%6 Months Rate- CAD 30 to CAD 35+ HST


  • Canada Quantum World Technologies Inc. Full time

    Role:: SRE Site Reliability Engineering Remote 100% 6 Months Rate- CAD 30 to CAD 35+ HST


  • (s): Canada : Ontario : Toronto Scotiabank Global Site Full time

    Requisition ID: 244026Join a purpose driven winning team, committed to results, in an inclusive and high-performing culture.Overview: As a Site Reliability Engineer (SRE), you will join the Digital Engineering Operations team, responsible for ensuring the operations and reliability of Scotiabank digital applications. You will have the opportunity to drive...


  • (s): Canada : Ontario : Toronto Scotiabank Global Site Full time

    Requisition ID: 244027Join a purpose driven winning team, committed to results, in an inclusive and high-performing culture.Overview: As a Site Reliability Engineer (SRE), you will join the Digital Engineering Operations team, responsible for ensuring the operations and reliability of Scotiabank digital applications. You will have the opportunity to drive...


  • , , Canada Compass Digital Full time

    As a Reliability Engineer you will work in focus areas such as observability, release automation, incident and problem response improvements, security, code quality, patch management and SRE advocacy. You will have the opportunity to use the latest and greatest cloud and open-source technology to enable our product and test engineering teams through...


  • , BC, Canada Orion Innovation Full time

    Overview Senior Site Reliability Engineer (SRE) with Kubernetes and Rancher. Full-time role focused on building and maintaining highly resilient, secure systems, including in air-gapped environments. Responsibilities System Architecture & Management: Design, architect, and maintain highly reliable, multi-tenant systems using Kubernetes and related tools...


  • , , Canada Alpaca Full time

    Staff Site Reliability Engineer, Database Who We Are: Alpaca is a US-headquartered self-clearing broker-dealer and brokerage infrastructure for stocks, ETFs, options, crypto, fixed income, 24/5 trading, and more. Our recent Series C funding round brought our total investment to over $170 million, fueling our ambitious vision. Amongst our subsidiaries, Alpaca...


  • , , Canada Masabi Full time

    Introducing Masabi At Masabi, we’re driving the fare payment revolution, powering the journeys of millions all over the world. We build fare collection platforms that allow riders to seamlessly buy and present tickets for public transport either on their mobile phones, from a ticket machine, or even by tapping their bank card to travel. The Role We’re...


  • , , Canada Thinkific Full time

    Join to apply for the Senior Site Reliability Engineer role at Thinkific Join to apply for the Senior Site Reliability Engineer role at Thinkific Are you an experienced Site Reliability Engineer looking for a new challenge? We’re looking for a Senior Site Reliability Engineer to join us at Thinkific. We’re looking for a Senior Site Reliability Engineer...