Current jobs related to Senior DevOps Engineer/ Site Reliability - Canada - The New Network


  • Canada iVedha Inc. Full time

    Senior Site Reliability Engineer (SRE) – Azure CertifiedRole Summary:We are looking for a Senior Site Reliability Engineer (SRE) with 8+ years of experience in DevOps, cloud infrastructure, automation, and software development. The ideal candidate should be Azure-certified, proficient in GitHub Actions, Ansible scripting, and Infrastructure as Code (IaC),...


  • Canada Regie Full time

    ai is a Series B-funded, AI-native sales engagement automation platform focused on transforming business-critical prospecting—the top of the funnel—into a precise, scalable, and repeatable process. As the volume of sales activity required to book a meeting continues to grow exponentially, traditional tools have failed to keep pace—leaving critical...


  • Canada Regie Full time

    About This RoleWe're seeking an experienced Senior Site Reliability Engineer/DevOps who can design and maintain production-grade infrastructure with high availability and low latency.This role involves extensive hands-on experience with AWS and its core services. You'll be responsible for architecting a unified monitoring and alerting system for engineering...


  • Canada iVedha Inc. Full time

    iVedha Inc. is a leading provider of innovative solutions in cloud computing and DevOps. We are currently seeking an experienced Senior Site Reliability Engineer to join our team.About the Role:This is a unique opportunity to work with a talented team of engineers and contribute to the design and implementation of highly available, scalable, and...


  • Canada Regie Full time

    Company Overview: Regie.ai is a Series B-funded, AI-native sales engagement automation platform focused on transforming business-critical prospecting—the top of the funnel—into a precise, scalable, and repeatable process. As the volume of sales activity required to book a meeting continues to grow exponentially, traditional tools have failed to keep...


  • Canada Luxoft Full time

    Project description Do you like to work with existing and new software product development teams? This position is to instrument end-to-end observability and visibility for business-critical systems with log ingestion, metrics, and traces. You will function as a site reliability engineer (SRE) that will collaborate with product teams, infrastructure SMEs,...


  • Canada Mojio Full time

    Title: Senior Site Reliability  EngineerLocation: USA or Canada - RemoteAt Mojio, we're on a mission to give every vehicle a voice. Founded in , we've grown from a disruptive startup to a global leader in the connected mobility space, trusted by some of the world's biggest brands as customers, investors, and partners, including Amazon, Bosch, Deutsche...

  • DevOps Engineer

    6 days ago


    Canada iVedha Inc. Full time

    About iVedha Inc.We are a cutting-edge technology company dedicated to providing innovative solutions in cloud computing and DevOps. Our team of experts is passionate about building scalable, resilient, and automated cloud environments.Job Description:Senior Site Reliability EngineerWe are seeking an experienced Senior Site Reliability Engineer to join our...


  • Canada LivePerson, Inc Full time

    Overview: LivePerson is looking for a Devops Engineer III/Senior Site Reliability Engineer  for the GPT (Global Product & Technology) Division. You will be part of the LivePerson SRE team building and managing highly available, distributed systems. You will have the opportunity to be part of a strong team and enjoy the work environment of a start-up,...


  • Canada Brim Financial Full time

    Company OverviewBrim Financial is one the fastest growing enterprise technology companies, according to Deloitte's Technology Fast 50 in North America. Brim's Credit-Card-as-a-Service has been recognized as best-in-class for product capabilities by Aite-Novarica Group in their analysis of global Credit-Card-as-a-Service providers. Brim's robust platform and...


  • Canada Brim Financial Full time

    Company Overview Brim Financial is one the fastest growing enterprise technology companies, according to Deloitte's Technology Fast 50 in North America. Brim's Credit-Card-as-a-Service has been recognized as best-in-class for product capabilities by Aite-Novarica Group in their analysis of global Credit-Card-as-a-Service providers. We are seeking a highly...


  • Canada The Nationwide Group Full time

    Senior DevOps Engineer (AWS)JOB DESCRIPTIONThe Nationwide Group (TNG) is a pioneer in designing and developing outsourced financial services software, exclusively focused on creating comprehensive and customizable solutions for the real estate industry. Utilizing world-class technology, TNG delivers solutions to the entire mortgage life cycle through its...


  • Canada National Bank Full time

    As a Specialist in site reliability engineering on the National Bank Data Protection team, you will ensure the operational reliability of data protection assets. With your experience and knowledge in the operational management of high-availability assets (HA), you will have a positive impact on the Bank's stability and reputation with its internal and...


  • Canada Telna Full time

    Site Reliability Engineer – Security Focus Location: Remote Department: Site Reliability Engineering Type: Full-Time Overview We're looking for a seasoned Site Reliability Engineer (SRE) with a strong background in infrastructure security to join our team. This role is not just about uptime—it's about building secure, robust, and auditable...


  • Canada Oracle Full time

    As part of the Software Development organisation that is responsible for building Oracle's vision for Low Code SaaS, this Organisation delivers Enterprise-grade service management to support Oracle Business Units and customers, focusing on next generation of cloud SaaS on Oracle Cloud Infrastructure. We are looking for a  Principal Site Reliability...


  • Canada Axon Full time

    Your Impact You are an SRE Engineering Manager with experience managing the operations and uptime of large-scale platforms. You have deep interest in Kubernetes and cloud-native technologies. You are excited about the care, feeding and growth of high-availability, scalable cloud-based platforms. You want to lead a team of SRE experts in delivering...


  • canada | ca Iris Software Inc. Full time

    Iris's Fortune 100 direct client is looking for Senior DevOps Engineer. Please find below Job description and share me your updated resume at Saurav.upadhyay@irissoftware.com .Position: Senior DevOps EngineerLocation: Toronto ONSkills: Devops , Jenkins, Docker, Kubernetes Job Description :- Hands on Experience of CICD pipelineCreating and Debugging CI CD...


  • Canada Telna Full time

    Site Reliability Engineer – Security Focus Location: Remote Department: Site Reliability Engineering Type: Full-Time OverviewWe're looking for a seasoned Site Reliability Engineer (SRE) with a strong background in infrastructure security to join our team. This role is not just about uptime—it's about building secure, robust, and auditable systems across...


  • Canada The Holy Grace Group Full time

    Job DescriptionWe are seeking a highly skilled and motivated DevOps Engineer to join our dynamic team. As a DevOps Engineer, you will play a crucial role in designing, implementing, and maintaining our continuous integration, delivery, and deployment pipelines. Your expertise in automating processes, optimizing infrastructure, and ensuring seamless software...


  • Canada Brim Financial Full time

    We're Brim Financial, a leading provider of credit-card-as-a-service solutions. Our platform has been recognized as best-in-class for product capabilities, and we're seeking a highly skilled Senior DevOps Engineer to help us continue to innovate and grow.About the OpportunityThis is an exciting chance to join our team and contribute to our mission of...

Senior DevOps Engineer/ Site Reliability

1 month ago


Canada The New Network Full time
ABOUT THE ROLE

We are looking for a Senior DevOps/Site Reliability Engineer to take cloud infrastructure to the next level with complex AWS builds, infrastructure-as-code, and observability/logging/APM solutions. You'll work in an embedded reliability team, alongside app and data engineers, to monitor, benchmark, and scale our client's products. You will work with first class technologies and staff to leverage all the goodies AWS has to offer, as well as creating a bridge between our bare metal infrastructure and our Ruby on Rails production app. Predictability, reliability, and scalability are your three favourite words.

- Develop and maintain infrastructure-as-code CloudFormation templates, emphasizing serverless resources (ECS, Fargate, lambda)
- Instrumentation and daily metrics analysis of both infrastructure performance and our Ruby on Rails applications, using AWS tooling (Athena, CloudTrail, etc) and third party observability platforms (DataDog, OTel)
- Manage deployment pipelines, including blue/green and intelligent auto-scaling
- Maintain and stay ahead of resource dependencies, particularly database (RDS, ElastiCache/Redis), including updates, playbooks, downtime planning
- Project costs and implement AWS cost savings programs and reserved instances
- Work alongside our risk and security teams to ensure ongoing SOC-2 and cybersecurity compliances
- Extensive collaboration with app developers on shared metrics, database performance, load testing
- Extensive collaboration with data engineers on facilitating data warehouse development, ELT, ETL
- Participating in our agile development process: sprint planning, story grooming and stand ups
- Adherence to our SDLC and secure coding practises and environment

REQUIRED SUPERPOWERS:

- 5+ years infrastructure experience
- 2+ years AWS experience including certification and deployment of production applications
- Proficiency with IaC, specifically CloudFormation
- Experience with containerization (Docker, ECS, ECR)
- Experience analyzing and acting on performance issues using observability platforms (DataDog, NewRelic, OTel)
- Has the ability to build quick when we need to experiment and build clean when MVP becomes core functionality
- Has strong SQL and data analysis skills and an eagerness to dig into data as part of problem solving

This is a full-time permanent position on our client's team. The successful candidate will be working in a fully remote environment and must be Canadian based.