Lead Site Reliability Engineer

3 days ago


Toronto ON MW A, Canada RBC Full time $900,000 - $1,250,000 per year

Job Description

What is the opportunity?

Join RBC as a Lead Site Reliability Engineer and take the lead in ensuring the reliability, scalability, and performance of our critical production systems and infrastructure. This is your chance to drive innovation through cutting-edge engineering practices, automation, and process optimization. Collaborate with cross-functional teams, manage key vendor relationships, and tackle complex, high-stakes challenges in a dynamic and supportive environment. With a focus on operational excellence and compliance, this role offers the opportunity to make a meaningful impact at one of the world's most respected financial institutions. If you're a visionary leader with expertise in modern infrastructure technologies and a passion for solving complex problems, this is your opportunity to elevate your career while shaping the future of RBC's technology landscape.

What will you do?

  • Lead strategic direction for ~4,000 ATM fleet operations, ensuring 99.7% availability
  • Drive continuous improvement and process optimization across ATM operations
  • Lead technology upgrade and operational change management initiatives
  • Serve as primary relationship owner for vendor field services, maintenance, and support.
  • Develop strategic partnerships with vendors and internal technology teams
  • Deliver executive-level reporting and communication to senior leadership and business stakeholders
  • Establish and monitor performance metrics, SLAs, and KPIs for vendor and operational excellence
  • Define and maintain ATM-specific SLOs/SLAs/SLIs (e.g., transaction success rates, uptime, latency)
  • Ensure end-to-end reliability of ATM ecosystem (hardware, software, network)
  • Oversee regulatory compliance, security standards, and audit requirements with full accountability
  • Manage risk, business continuity, and disaster recovery planning
  • Act as final escalation point for critical outages and emergency response coordination
  • Lead 24/7 incident response, ensuring rapid resolution of customer-impacting issues
  • Perform RCA for AI/ML-related incidents and implement preventive measures
  • Implement real-time monitoring, alerting, and observability tools (hardware, transactions, network)
  • Automate routine tasks (software updates, configurations, log analysis)
  • Collaborate with data scientists, engineers, and operations teams on complex issues
  • Align daily standups/project calls with development, QE, and management teams.

What will you need to succeed?

Must have:

  • Bachelor's degree in business administration, Information Technology, Engineering, or related field
  • Minimum 5-7 years of experience in ATM technology management or financial services technology
  • Proven experience managing vendor relationships and large-scale technology operations
  • Strong knowledge of Vendor and other ATM technology platforms and capabilities
  • Demonstrated leadership experience managing cross-functional teams and stakeholders
  • Knowledge of banking regulations, compliance requirements, and security standards
  • Experience with budget management, financial analysis, and cost optimization
  • Experience with ITIL framework and service management best practices
  • Strong project management skills with experience in technology deployment projects
  • Experience with PowerShell scripting and automation concepts (intermediate level)
  • Knowledge of SCCM and remote management technologies
  • Knowledge of cloud technologies and hybrid infrastructure management

Nice-to-have:

  • Experience managing and optimizing large-scale ATM ecosystems, including hardware, software, and network infrastructure, to ensure seamless operations.
  • Familiarity with financial transaction processing systems, including payment networks and protocols (e.g., ISO 8583), to support secure and efficient transactions.
  • Hands-on experience with cloud-based infrastructure and hybrid environments, leveraging tools like Kubernetes, Docker, or Terraform for scalability and automation.
  • Proficiency with advanced monitoring and observability tools (e.g., Prometheus, Grafana, Splunk) to enhance system reliability, performance, and proactive issue resolution.

What's in it for you?

  • Lead and shape the reliability, scalability, and performance of RBC's critical production systems, directly impacting millions of customers.
  • Work with cutting-edge technologies and drive innovation in a high-availability, mission-critical environment.
  • Collaborate with a diverse, talented team of professionals across development, security, quality assurance, and operations to solve complex challenges.
  • Access unparalleled professional growth opportunities, including leadership development, technical training, and exposure to large-scale, complex systems.
  • Thrive in a supportive and inclusive workplace culture that values your expertise, fosters innovation, and recognizes your contributions.
  • Enjoy competitive compensation, comprehensive benefits, and a strong focus on work-life balance to support your overall well-being.
  • Make a meaningful impact in a role that combines technical expertise, strategic leadership, and the opportunity to shape the future of RBC's technology landscape.
LI-POST
TECHPJ

Job Skills

Agile Methodology, Group Problem Solving, IT Systems Integration, Organizational Leadership, Product Services, Software Development Life Cycle (SDLC), System Applications, System Integration Testing (SIT), Systems Software

Additional Job Details

Address:

RBC WATERPARK PLACE, 88 QUEENS QUAY W:TORONTO

City:

Toronto

Country:

Canada

Work hours/week:

37.5

Employment Type:

Full time

Platform:

TECHNOLOGY AND OPERATIONS

Job Type:

Regular

Pay Type:

Salaried

Posted Date:

Application Deadline:

Note: Applications will be accepted until 11:59 PM on the day prior to the application deadline date above

Inclusion and Equal Opportunity Employment

At RBC, we believe an inclusive workplace that has diverse perspectives is core to our continued growth as one of the largest and most successful banks in the world. Maintaining a workplace where our employees feel supported to perform at their best, effectively collaborate, drive innovation, and grow professionally helps to bring our Purpose to life and create value for our clients and communities. RBC strives to deliver this through policies and programs intended to foster a workplace based on respect, belonging and opportunity for all.

Join our Talent Community

Stay in-the-know about great career opportunities at RBC. Sign up and get customized info on our latest jobs, career tips and Recruitment events that matter to you.

Expand your limits and create a new future together at RBC. Find out how we use our passion and drive to enhance the well-being of our clients and communities



  • (s): Canada : Ontario : Toronto Scotiabank Global Site Full time $105,000 - $170,000 per year

    Requisition ID: 244026Join a purpose driven winning team, committed to results, in an inclusive and high-performing culture.Overview: As a Site Reliability Engineer (SRE), you will join the Digital Engineering Operations team, responsible for ensuring the operations and reliability of Scotiabank digital applications. You will have the opportunity to drive...


  • (s): Canada : Ontario : Toronto Scotiabank Global Site Full time US$80,000 - US$140,000 per year

    Requisition ID: 244027Join a purpose driven winning team, committed to results, in an inclusive and high-performing culture.Overview: As a Site Reliability Engineer (SRE), you will join the Digital Engineering Operations team, responsible for ensuring the operations and reliability of Scotiabank digital applications. You will have the opportunity to drive...


  • Toronto, ON MW A, Canada RBC Full time $80,000 - $120,000 per year

    Job DescriptionWhat is the opportunity?This is an exciting opportunity to join a high-impact team responsible for ensuring the reliability, scalability, and performance of critical ATM production systems. As a Senior Service Reliability Engineer, you will play a pivotal role in shaping the future of our ATM services by driving innovation, implementing...


  • Toronto, Ontario, Canada AceStack Full time $120,000 - $200,000 per year

    Job Title: Lead Site Reliability Engineer – Banking Domain (Wealth Management Preferred)Location: Toronto Downtown, ON (Onsite – 5 Days/Week)Duration: ContractExperience: 14+ YearsAbout the Role:We are looking for a highly skilled Site Reliability Engineering (SRE) Lead with a strong background in the Banking domain, ideally within Wealth Management. The...


  • Toronto, Canada Dayforce US, Inc. Full time

    Dayforce is a global human capital management (HCM) company headquartered in Toronto, Ontario, and Minneapolis, Minnesota, with operations across North America, Europe, Middle East, Africa (EMEA), and the Asia Pacific Japan (APJ) region. Our award-winning Cloud HCM platform offers a unified solution database and continuous calculation engine, driving...


  • Toronto, Canada Dayforce US, Inc. Full time

    Dayforce is a global human capital management (HCM) company headquartered in Toronto, Ontario, and Minneapolis, Minnesota, with operations across North America, Europe, Middle East, Africa (EMEA), and the Asia Pacific Japan (APJ) region. Our award-winning Cloud HCM platform offers a unified solution database and continuous calculation engine, driving...


  • Toronto, Canada Dayforce US, Inc. Full time

    Dayforce is a global human capital management (HCM) company headquartered in Toronto, Ontario, and Minneapolis, Minnesota, with operations across North America, Europe, Middle East, Africa (EMEA), and the Asia Pacific Japan (APJ) region. Our award-winning Cloud HCM platform offers a unified solution database and continuous calculation engine, driving...


  • Toronto, Canada Dayforce US, Inc. Full time

    Dayforce is a global human capital management (HCM) company headquartered in Toronto, Ontario, and Minneapolis, Minnesota, with operations across North America, Europe, Middle East, Africa (EMEA), and the Asia Pacific Japan (APJ) region. Our award-winning Cloud HCM platform offers a unified solution database and continuous calculation engine, driving...


  • Toronto, Canada SimCorp Full time

    Join to apply for the Lead Site Reliability Engineer role at SimCorp. About SimCorp Join some of the most innovative thinkers in FinTech as we lead the evolution of financial technology. If you are an innovative, curious, collaborative person who embraces challenges and wants to grow, learn and pursue outcomes with our prestigious financial clients, say...


  • Etobicoke, ON MV C, Canada AVT Reliability Canada Full time $55,000 - $70,000 per year

    Job SummaryAVT Reliability is a global leader in asset management, condition monitoring, and engineering support for industrial manufacturing and engineering clients. We specialize in predictive maintenance, condition monitoring, and asset reliability services across the petrochemical, manufacturing, and energy sectors.As we continue to grow, we are...