Senior Site Reliability Engineer
4 days ago
Job Description
What is the role?
Senior Site Reliability Engineer
As a Sr. SRE, you will play a critical role in ensuring the availability, reliability, scalability, and performance of key applications, balancing production support responsibilities with continuous improvement initiatives. The ideal candidate will have deep expertise in agile application development, operations, technology lifecycle management, infrastructure and automation to reduce toil, improve observability, resolve complex production incidents, address underlying root causes.
What will you do?
- Perform application production support role including off-hours support.
- Development of SRE solutions (monitoring and alerting, machine learning anomaly detection, self-healing and reliability testing)
- Run the production environment by monitoring availability and taking a holistic view of system health.
- Build software and systems to manage platform infrastructure and applications. Improve reliability, quality, and time-to-market of our suite of software solutions.
- Assist in incident management and problem management for applications in scope.
- Maintain technology currency (manage server patching, certificate renewal, etc.) with keen eye on automating opportunities.
- Ensure availability and uptime of applications in scope, as per service level objectives.
- Ensure compliance of all systems and applications in scope, including maintaining segregation of duties.
- Implement monitoring and alerting, anomaly detection, self-healing and reliability testing for applications in scope.
- Detect, diagnose, and resolve Incidents; Analyze, identify, and address Problems; and Review, raise change tickets as required.
- Implement SLI / SLOs and ensure availability targets for mission-critical applications.
- Ensure compliance with regulatory and security requirements, including segregation of duties for sensitive environments.
- Stay ahead of emerging technologies, leveraging continuous learning opportunities to drive innovation and efficiency.
- Provide hands-on application production support, including off-hours coverage as needed.
What do you need to succeed?
Must-have:
- 3+ years of experience in Application Support, Software Development (SDLC), and Operations.
- Strong proficiency in at least two programming languages (Java, Python, .NET, SQL, Databases)
- Good understanding of resilient IT solutions, driving continuous service improvements, and enhancing production reliability through automation and best practices.
- Advanced experience in a variety of environments (Linux, Windows, Databases, Cloud, distributed and mainframe, business workflows, and Services/APIs)
- Hands-on experience in a variety of DevOps / SRE tools (Ansible, Dynatrace, Moogsoft, PagerDuty, ServiceNow, Elastic, Logstash, Kibana, Logic Monitor, Jenkins, Cucumber, CA Work Automation, Power BI, ETL related tools etc)
- Excellent communication, analytical and problem-solving skills to diagnose, resolve complex production incidents and lead blameless postmortems to identify & address root causes.
- Effective negotiation skills, and stakeholder management, Excellent communication skills, direct style.
Nice-to-have:
- Prior experience working as a SRE in the financial services industry is preferred.
- Knowledge of Digital Identity Access Management, Internet / Mobile Banking Platforms, Microservices, Data Services, Test Automation and Corporate applications (HR, Finance, Risk, Compliance etc.) is preferred.
What's in it for you?
We thrive on the challenge to be our best, progressive thinking to keep growing, and working together to deliver trusted advice to help our clients thrive and communities prosper. We care about each other, reaching our potential, making a difference to our communities, and achieving success that is mutual.
- A comprehensive Total Rewards Program including competitive compensation, bonuses, and flexible benefits.
- Continued opportunities for career advancement.
- World-class sales training, coaching, and development opportunities.
- Support from a dynamic, collaborative, progressive, and high performing team, as well as world-class tools and training.
- Opportunity to achieve great success and grow your career with RBC.
#LI-Post
#TECHPJ
Job Skills
Agile Methodology, Group Problem Solving, IT Systems Integration, Organizational Leadership, Product Services, Software Development Life Cycle (SDLC), System Applications, System Integration Testing (SIT), Systems SoftwareAdditional Job Details
Address:
RBC CENTRE, 155 WELLINGTON ST W:TORONTOCity:
TorontoCountry:
CanadaWork hours/week:
Employment Type:
Full timePlatform:
TECHNOLOGY AND OPERATIONSJob Type:
RegularPay Type:
SalariedPosted Date:
Application Deadline:
Note: Applications will be accepted until 11:59 PM on the day prior to the application deadline date above
Inclusion and Equal Opportunity Employment
At RBC, we believe an inclusive workplace that has diverse perspectives is core to our continued growth as one of the largest and most successful banks in the world. Maintaining a workplace where our employees feel supported to perform at their best, effectively collaborate, drive innovation, and grow professionally helps to bring our Purpose to life and create value for our clients and communities. RBC strives to deliver this through policies and programs intended to foster a workplace based on respect, belonging and opportunity for all.
Join our Talent Community
Stay in-the-know about great career opportunities at RBC. Sign up and get customized info on our latest jobs, career tips and Recruitment events that matter to you.
Expand your limits and create a new future together at RBC. Find out how we use our passion and drive to enhance the well-being of our clients and communities
-
Senior Site Reliability Engineer
4 days ago
Toronto, Ontario, Canada RBC Full time $90,000 - $120,000 per yearJob DescriptionWhat is the opportunity?Join our Commercial, Core Banking and Payments Technology (CCBPT) team as a Senior Site Reliability Engineer, where you'll play a key role in supporting our cloud and distributed environments for the Personal Commercial Credit SRE & Ops team. This exciting opportunity will challenge you to work with cutting-edge...
-
Site Reliability Engineer
4 days ago
Toronto, Ontario, Canada Procom Full time $80,000 - $120,000 per yearSite Reliability Engineer (SRE)/ Ingénieur Fiabilité des SitesOn behalf of our banking client, Procom is seeking a Site Reliability Engineer (SRE) for a 12-month contract role. This position is a hybrid role, 3 days a week onsite at our client's Montréal, Quebec office.Site Reliability Engineer - Job Description:The Site Reliability Engineer is...
-
Senior Site Reliability Engineer
2 weeks ago
Toronto, Ontario, Canada 3cf5cb8c-b08d-42c2-a6cd-1ee0c7026e02 Full time $120,000 - $180,000 per yearAbout Us:Zensurance is redefining commercial insurance for Canadian businesses.As a leading InsurTech, we make getting the right coverage simple, fast, and accessible through a digital-first experience. Our platform combines advanced technology with deep industry expertise to deliver tailored insurance solutions that help businesses thrive.Zensurance has...
-
Senior Site Reliability Engineer
2 weeks ago
Toronto, Ontario, Canada Zensurance Full time $120,000 - $180,000 per yearAbout Us: Zensurance is redefining commercial insurance for Canadian businesses. As a leading InsurTech, we make getting the right coverage simple, fast, and accessible through a digital-first experience. Our platform combines advanced technology with deep industry expertise to deliver tailored insurance solutions that help businesses thrive. Zensurance...
-
Senior Site Reliability Engineer
2 weeks ago
Toronto, Ontario, Canada Zensurance Full time $120,000 - $180,000 per yearAbout Us: Zensurance is redefining commercial insurance for Canadian businesses As a leading InsurTech, we make getting the right coverage simple, fast, and accessible through a digital-first experience. Our platform combines advanced technology with deep industry expertise to deliver tailored insurance solutions that help businesses thrive Zensurance has...
-
Senior Site Reliability Engineer
2 weeks ago
Toronto, Ontario, Canada Zensurance Full time $900,000 - $1,200,000 per yearAbout Us:Zensurance is redefining commercial insurance for Canadian businesses. As a leading InsurTech, we make getting the right coverage simple, fast, and accessible through a digital-first experience. Our platform combines advanced technology with deep industry expertise to deliver tailored insurance solutions that help businesses thrive.Zensurance has...
-
Senior Manager, Site Reliability Engineering
2 days ago
Toronto, Ontario, Canada Tubi Full time $120,000 - $180,000 per yearAbout Tubi:Boldly built for every fandom, Tubi is a free streaming service that entertains over 100 million monthly active users. Tubi offers the world's largest collection of Hollywood movies and TV shows, thousands of creator-led stories and hundreds of Tubi Originals made for the most passionate fans. Headquartered in San Francisco and founded in 2014,...
-
Site Reliability Engineer
1 week ago
Toronto, Ontario, Canada Kablamo Full time $90,000 - $120,000 per yearReports to: Technical Support ManagerLocation: Toronto (Hybrid)Role Type: Full timeLevel: Intermediate/MidIntroductionKablamo is a fast-growing cloud digital product development company. Founded in 2017 in Australia, the business has grown quickly over the last several years, including the expansion of the team to Canada in 2021. We are proud to have...
-
Site Reliability Engineer
4 days ago
Toronto, Ontario, Canada Maneva Full time US$80,000 - US$120,000 per yearAbout ManevaManeva builds and deploys edge AI solutions powering real-time intelligence for industrial environments. Our systems run on distributed edge compute devices (NVIDIA Jetson platforms), integrate with local network cameras, PLCs, sensors, and other on-premise equipment, and securely communicate with cloud services via client- or site-based VPNs....
-
Site Reliability Engineer
4 days ago
Toronto, Ontario, Canada McCain Foods Full time $102,700 - $137,000 per yearPosition Title:Site Reliability EngineerPosition Type:Regular - Full-TimePosition Location:Toronto HQRequisition ID:36904Our Global Technology team's goal is to leverage technology and data to drive profitable growth, focus on enhancing customer experience and to further our purpose of 'Celebrating real connections through delicious, planet-friendly food'....