Lead Site Reliability Engineer
3 days ago
Job Description
What is the opportunity?
Join RBC as a Lead Site Reliability Engineer and take the lead in ensuring the reliability, scalability, and performance of our critical production systems and infrastructure. This is your chance to drive innovation through cutting-edge engineering practices, automation, and process optimization. Collaborate with cross-functional teams, manage key vendor relationships, and tackle complex, high-stakes challenges in a dynamic and supportive environment. With a focus on operational excellence and compliance, this role offers the opportunity to make a meaningful impact at one of the world's most respected financial institutions. If you're a visionary leader with expertise in modern infrastructure technologies and a passion for solving complex problems, this is your opportunity to elevate your career while shaping the future of RBC's technology landscape.
What will you do?
- Lead strategic direction for ~4,000 ATM fleet operations, ensuring 99.7% availability
- Drive continuous improvement and process optimization across ATM operations
- Lead technology upgrade and operational change management initiatives
- Serve as primary relationship owner for vendor field services, maintenance, and support.
- Develop strategic partnerships with vendors and internal technology teams
- Deliver executive-level reporting and communication to senior leadership and business stakeholders
- Establish and monitor performance metrics, SLAs, and KPIs for vendor and operational excellence
- Define and maintain ATM-specific SLOs/SLAs/SLIs (e.g., transaction success rates, uptime, latency)
- Ensure end-to-end reliability of ATM ecosystem (hardware, software, network)
- Oversee regulatory compliance, security standards, and audit requirements with full accountability
- Manage risk, business continuity, and disaster recovery planning
- Act as final escalation point for critical outages and emergency response coordination
- Lead 24/7 incident response, ensuring rapid resolution of customer-impacting issues
- Perform RCA for AI/ML-related incidents and implement preventive measures
- Implement real-time monitoring, alerting, and observability tools (hardware, transactions, network)
- Automate routine tasks (software updates, configurations, log analysis)
- Collaborate with data scientists, engineers, and operations teams on complex issues
- Align daily standups/project calls with development, QE, and management teams.
What will you need to succeed?
Must have:
- Bachelor's degree in business administration, Information Technology, Engineering, or related field
- Minimum 5-7 years of experience in ATM technology management or financial services technology
- Proven experience managing vendor relationships and large-scale technology operations
- Strong knowledge of Vendor and other ATM technology platforms and capabilities
- Demonstrated leadership experience managing cross-functional teams and stakeholders
- Knowledge of banking regulations, compliance requirements, and security standards
- Experience with budget management, financial analysis, and cost optimization
- Experience with ITIL framework and service management best practices
- Strong project management skills with experience in technology deployment projects
- Experience with PowerShell scripting and automation concepts (intermediate level)
- Knowledge of SCCM and remote management technologies
- Knowledge of cloud technologies and hybrid infrastructure management
Nice-to-have:
- Experience managing and optimizing large-scale ATM ecosystems, including hardware, software, and network infrastructure, to ensure seamless operations.
- Familiarity with financial transaction processing systems, including payment networks and protocols (e.g., ISO 8583), to support secure and efficient transactions.
- Hands-on experience with cloud-based infrastructure and hybrid environments, leveraging tools like Kubernetes, Docker, or Terraform for scalability and automation.
- Proficiency with advanced monitoring and observability tools (e.g., Prometheus, Grafana, Splunk) to enhance system reliability, performance, and proactive issue resolution.
What's in it for you?
- Lead and shape the reliability, scalability, and performance of RBC's critical production systems, directly impacting millions of customers.
- Work with cutting-edge technologies and drive innovation in a high-availability, mission-critical environment.
- Collaborate with a diverse, talented team of professionals across development, security, quality assurance, and operations to solve complex challenges.
- Access unparalleled professional growth opportunities, including leadership development, technical training, and exposure to large-scale, complex systems.
- Thrive in a supportive and inclusive workplace culture that values your expertise, fosters innovation, and recognizes your contributions.
- Enjoy competitive compensation, comprehensive benefits, and a strong focus on work-life balance to support your overall well-being.
- Make a meaningful impact in a role that combines technical expertise, strategic leadership, and the opportunity to shape the future of RBC's technology landscape.
TECHPJ
Job Skills
Agile Methodology, Group Problem Solving, IT Systems Integration, Organizational Leadership, Product Services, Software Development Life Cycle (SDLC), System Applications, System Integration Testing (SIT), Systems Software
Additional Job Details
Address:
RBC WATERPARK PLACE, 88 QUEENS QUAY W:TORONTO
City:
Toronto
Country:
Canada
Work hours/week:
37.5
Employment Type:
Full time
Platform:
TECHNOLOGY AND OPERATIONS
Job Type:
Regular
Pay Type:
Salaried
Posted Date:
Application Deadline:
Note: Applications will be accepted until 11:59 PM on the day prior to the application deadline date above
Inclusion and Equal Opportunity Employment
At RBC, we believe an inclusive workplace that has diverse perspectives is core to our continued growth as one of the largest and most successful banks in the world. Maintaining a workplace where our employees feel supported to perform at their best, effectively collaborate, drive innovation, and grow professionally helps to bring our Purpose to life and create value for our clients and communities. RBC strives to deliver this through policies and programs intended to foster a workplace based on respect, belonging and opportunity for all.
Join our Talent Community
Stay in-the-know about great career opportunities at RBC. Sign up and get customized info on our latest jobs, career tips and Recruitment events that matter to you.
Expand your limits and create a new future together at RBC. Find out how we use our passion and drive to enhance the well-being of our clients and communities
-
Site Reliability Engineer
10 hours ago
(s): Canada : Ontario : Toronto Scotiabank Global Site Full time $105,000 - $170,000 per yearRequisition ID: 244026Join a purpose driven winning team, committed to results, in an inclusive and high-performing culture.Overview: As a Site Reliability Engineer (SRE), you will join the Digital Engineering Operations team, responsible for ensuring the operations and reliability of Scotiabank digital applications. You will have the opportunity to drive...
-
Site Reliability Engineer
6 hours ago
(s): Canada : Ontario : Toronto Scotiabank Global Site Full time US$80,000 - US$140,000 per yearRequisition ID: 244027Join a purpose driven winning team, committed to results, in an inclusive and high-performing culture.Overview: As a Site Reliability Engineer (SRE), you will join the Digital Engineering Operations team, responsible for ensuring the operations and reliability of Scotiabank digital applications. You will have the opportunity to drive...
-
Senior Site Reliability Engineer
2 days ago
Toronto, ON MW A, Canada RBC Full time $80,000 - $120,000 per yearJob DescriptionWhat is the opportunity?This is an exciting opportunity to join a high-impact team responsible for ensuring the reliability, scalability, and performance of critical ATM production systems. As a Senior Service Reliability Engineer, you will play a pivotal role in shaping the future of our ATM services by driving innovation, implementing...
-
Lead Site Reliability Engineer
1 week ago
Toronto, Ontario, Canada AceStack Full time $120,000 - $200,000 per yearJob Title: Lead Site Reliability Engineer – Banking Domain (Wealth Management Preferred)Location: Toronto Downtown, ON (Onsite – 5 Days/Week)Duration: ContractExperience: 14+ YearsAbout the Role:We are looking for a highly skilled Site Reliability Engineering (SRE) Lead with a strong background in the Banking domain, ideally within Wealth Management. The...
-
Lead Site Reliability Engineer
4 weeks ago
Toronto, Canada Dayforce US, Inc. Full timeDayforce is a global human capital management (HCM) company headquartered in Toronto, Ontario, and Minneapolis, Minnesota, with operations across North America, Europe, Middle East, Africa (EMEA), and the Asia Pacific Japan (APJ) region. Our award-winning Cloud HCM platform offers a unified solution database and continuous calculation engine, driving...
-
Lead Site Reliability Engineer
4 weeks ago
Toronto, Canada Dayforce US, Inc. Full timeDayforce is a global human capital management (HCM) company headquartered in Toronto, Ontario, and Minneapolis, Minnesota, with operations across North America, Europe, Middle East, Africa (EMEA), and the Asia Pacific Japan (APJ) region. Our award-winning Cloud HCM platform offers a unified solution database and continuous calculation engine, driving...
-
Lead Site Reliability Engineer
4 weeks ago
Toronto, Canada Dayforce US, Inc. Full timeDayforce is a global human capital management (HCM) company headquartered in Toronto, Ontario, and Minneapolis, Minnesota, with operations across North America, Europe, Middle East, Africa (EMEA), and the Asia Pacific Japan (APJ) region. Our award-winning Cloud HCM platform offers a unified solution database and continuous calculation engine, driving...
-
Lead Site Reliability Engineer
3 weeks ago
Toronto, Canada Dayforce US, Inc. Full timeDayforce is a global human capital management (HCM) company headquartered in Toronto, Ontario, and Minneapolis, Minnesota, with operations across North America, Europe, Middle East, Africa (EMEA), and the Asia Pacific Japan (APJ) region. Our award-winning Cloud HCM platform offers a unified solution database and continuous calculation engine, driving...
-
Lead Site Reliability Engineer
2 weeks ago
Toronto, Canada SimCorp Full timeJoin to apply for the Lead Site Reliability Engineer role at SimCorp. About SimCorp Join some of the most innovative thinkers in FinTech as we lead the evolution of financial technology. If you are an innovative, curious, collaborative person who embraces challenges and wants to grow, learn and pursue outcomes with our prestigious financial clients, say...
-
Machinery Reliability Specialist
2 days ago
Etobicoke, ON MV C, Canada AVT Reliability Canada Full time $55,000 - $70,000 per yearJob SummaryAVT Reliability is a global leader in asset management, condition monitoring, and engineering support for industrial manufacturing and engineering clients. We specialize in predictive maintenance, condition monitoring, and asset reliability services across the petrochemical, manufacturing, and energy sectors.As we continue to grow, we are...