Senior Site Reliability Developer

2 days ago


Toronto, Canada RBC Full time

Job Description What is the opportunity? This is an exciting opportunity to join a high-performing team that plays a critical role in ensuring the reliability, scalability, and performance of pre‑production environments for ATM systems. As a Senior Service Reliability Developer, you will be at the forefront of driving innovation and operational excellence in a mission‑critical domain. It is ideal for someone who is passionate about building and maintaining highly reliable, scalable, and secure systems while driving automation and operational excellence. What will you do? As a Senior Service Reliability Engineer in the ATM team, you will be responsible for ensuring the reliability, performance, and scalability of our production environment. Your day‑to‑day responsibilities will include: Architect and develop solutions to enhance reliability and performance across pre‑production and production environments, ensuring seamless ATM operations. Design and implement advanced monitoring tools to proactively detect and resolve system issues, maintaining high availability and performance. Develop and deploy automation frameworks for deployment, monitoring, and incident response to minimize manual intervention and improve efficiency. Build and optimise CI/CD pipelines to enable faster, more reliable software deliveries. Lead incident response efforts, conduct root cause analysis (RCA), and implement long‑term fixes to prevent recurrence. Participate in on‑call rotations to address high‑severity production incidents, including off‑hours troubleshooting, cross‑functional coordination, and urgent remediation to ensure 24/7 system reliability. Partner with operations, QE, and engineering teams to align on best practices, share knowledge, and ensure smooth system integration. Champion DevOps and SRE principles by fostering collaboration, continuous improvement, and Infrastructure as Code (IaC). Mentor junior team members, share expertise, and contribute to team growth and technical excellence. What do you need to succeed? Must have: To excel as a Senior Service Reliability Developer in the ATM team, you must have the following skills and expertise: Proficiency in building and managing CI/CD pipelines, infrastructure as code (IaC) tools (e.g., Terraform, Ansible), and automation frameworks to streamline deployments and operations. Experience with monitoring tools (e.g., Prometheus, Grafana, Splunk) and the ability to diagnose and resolve complex system issues in real‑time. Strong coding skills in languages such as Python, Go, or Java, along with scripting expertise in Bash, PowerShell, or similar. Hands‑on experience with cloud platforms (e.g., AWS, Azure, GCP), containerization tools like Docker and Kubernetes, and a solid understanding of networking concepts (e.g., DNS, load balancing, firewalls, and VPNs) to ensure secure and efficient system communication. A proactive approach to identifying and solving technical challenges, coupled with the ability to work effectively in cross‑functional teams. Core Technologies: Advanced MDT task sequence creation and troubleshooting Site administration, software deployment, operating system deployment DISM, WIM file management, sysprep automation Advanced scripting for automation and customization Patch management and driver integration MSI creation, App‑V, application compatibility Containerization basics Monitoring solutions (Prometheus, Grafana, or similar) Nice‑to‑have: Familiarity with advanced networking protocols, SDN (Software‑Defined Networking), and network performance optimisation techniques. Understanding of security principles, including vulnerability management, secure coding practices, and compliance standards (e.g., PCI DSS). Hands‑on experience with observability platforms (e.g., Datadog, New Relic, or Elastic Stack) to gain deeper insights into system performance and reliability. Advanced scripting skills (e.g., Python, Bash, PowerShell) and experience in creating automation scripts for tasks such as system monitoring, deployment, and incident response. What's in it for you? We thrive on the challenge to be our best, progressive thinking to keep growing, and working together to deliver trusted advice to help our clients thrive and communities prosper. We care about each other, reaching our potential, making a difference to our communities, and achieving success that is mutual. A comprehensive Total Rewards Program including bonuses and flexible benefits, competitive compensation, commissions, and stock where applicable. Leaders who support your development through coaching and managing opportunities. Ability to make a difference and lasting impact. Work in a dynamic, collaborative, progressive, and high‑performing team. A world‑class training program in financial services. Flexible work/life balance options. Opportunities to do challenging work. Opportunities to take on progressively greater accountabilities. Opportunities to build close relationships with clients. Job Skills Agile Methodology, Group Problem Solving, IT Systems Integration, Organizational Leadership, Software Development Life Cycle (SDLC), Software Engineering, System Applications, System Integration Testing (SIT), Systems Software Additional Job Details Address: RBC WATERPARK PLACE, 88 QUEENS QUAY W:TORONTO City: Toronto Country: Canada Work hours/week: 37.5 Employment Type: Full time Platform: TECHNOLOGY AND OPERATIONS Job Type: Regular Pay Type: Salaried Posted Date: 2025-12-01 Application Deadline: 2026-01-12 Note: Applications will be accepted until 11:59 PM on the day prior to the application deadline date above. Inclusion and Equal Opportunity Employment At RBC, we believe an inclusive workplace that has diverse perspectives is core to our continued growth as one of the largest and most successful banks in the world. Maintaining a workplace where our employees feel supported to perform at their best, effectively collaborate, drive innovation, and grow professionally helps to bring our Purpose to life and create value for our clients and communities. RBC strives to deliver this through policies and programs intended to foster a workplace based on respect, belonging and opportunity for all. Join our Talent Community Stay in-the-know about great career opportunities at RBC. Sign up and get customized info on our latest jobs, career tips and Recruitment events that matter to you. Expand your limits and create a new future together at RBC. Find out how we use our passion and drive to enhance the well‑being of our clients and communities at jobs.rbc.com. #J-18808-Ljbffr



  • Toronto, Canada Autodesk Full time

    Position Overview We are seeking a highly motivated and experienced Senior Site Reliability Developer (SRE) to manage critical cloud infrastructure and site reliability operations for Autodesk's global Product Access journey. This pivotal role focuses on ensuring the highest reliability, availability, and performance of our AWS-hosted cloud infrastructure....


  • Toronto, Canada Royal Bank of Canada> Full time

    Job DescriptionWhat is the opportunity?This is an exciting opportunity to join a high-performing team that plays a critical role in ensuring the reliability, scalability, and performance of pre-production environments for ATM systems.  As a Senior Service Reliability Developer, you will be at the forefront of driving innovation and operational excellence in...


  • TORONTO, Canada Royal Bank of Canada Full time

    Job Description What is the opportunity? This is an exciting opportunity to join a high-performing team that plays a critical role in ensuring the reliability, scalability, and performance of pre-production environments for ATM systems. As a Senior Service Reliability Developer, you will be at the forefront of driving innovation and operational excellence...


  • Toronto, Canada Autodesk, Inc. Full time

    **Job Requisition ID #**25WD92369**Position Overview**We are seeking a highly motivated and experienced Senior Site Reliability Engineer (SRE) to manage critical cloudinfrastructure and site reliability operations for Autodesk's global Product Access journey. This pivotal role focuses on ensuringthe highest reliability, availability, and performance of our...


  • Toronto, Canada Autodesk Full time

    Job Requisition ID 25WD92369 Position Overview We are seeking a highly motivated and experienced Senior Site Reliability Developer (SRE) to manage critical cloud infrastructure and site reliability operations for Autodesk's global Product Access journey. This pivotal role focuses on ensuring the highest reliability, availability, and performance of our...


  • Toronto, Canada Autodesk Full time

    Job Requisition ID 25WD92369 Position Overview We are seeking a highly motivated and experienced Senior Site Reliability Developer (SRE) to manage critical cloud infrastructure and site reliability operations for Autodesk's global Product Access journey. This pivotal role focuses on ensuring the highest reliability, availability, and performance of our...


  • Toronto, Ontario, Canada Radio-Canada Full time

    Position TitleSenior Developer, Site Reliability (Digital Strategy And Product) (English Services)Status Of EmploymentContractee Long-Term (Fixed Term)Position Language RequirementLanguage Skills:Work at CBC/Radio-CanadaAt CBC/Radio-Canada, we create content that informs, entertains and connects Canadians on multiple platforms. Our successes and...


  • Toronto, Ontario, Canada Autodesk Full time

    Position OverviewWe are seeking a highly motivated and experienced Senior Site Reliability Developer (SRE) to manage critical cloud infrastructure and site reliability operations for Autodesk's global Product Access journey. This pivotal role focuses on ensuring the highest reliability, availability, and performance of our AWS-hosted cloud infrastructure....


  • Toronto, Ontario, Canada CBCRadio-Canada Full time

    Position Title: Senior Developer, Site Reliability (Digital Strategy And Product) (English Services)Status of Employment:Contractee Long-Term (Fixed Term)Position Language Requirement:Language Skills:Work at CBC/Radio-CanadaAt CBC/Radio-Canada, we create content that informs, entertains and connects Canadians on multiple platforms. Our successes and...


  • AMER - Canada - Ontario - Toronto - University Ave Autodesk Full time

    Job Requisition ID # 25WD92369Position OverviewWe are seeking a highly motivated and experienced Senior Site Reliability Developer (SRE) to manage critical cloud infrastructure and site reliability operations for Autodesk's global Product Access journey. This pivotal role focuses on ensuring the highest reliability, availability, and performance of our...