Senior Site Reliability Engineer

2 weeks ago


Old Toronto, Canada RBC - Royal Bank Full time
Job Summary

This role will be responsible for assisting in the development, implementation, and support of Site Reliability Engineering (SRE) solutions for applications supported by the Branch Technology in Digital organization. The incumbent will need introductory knowledge and experience working in an application development and/or technology operations organization. Perform production support role and partner with SRE delivery team in incident management and problem management.

What Will You Do?
  1. Assist in development of SRE solutions (monitoring and alerting, machine learning anomaly detection, self-healing and reliability testing)
  2. Work on assigned stories in agile based squad on scrum cadence
  3. Implement monitoring and alerting, anomaly detection, self-healing and reliability testing for applications in scope
  4. Supports unit's goals to adopt automation solutions for applications in scope
  5. Perform production support role, including off-hours support and rotational on-call support to be compensated accordingly with overtime pay, lieu time, and on-call allowance
  6. Assist in incident management and problem management for applications in scope
  7. Evaluate continuously - what went well, what went wrong, what can be done to improve and prevent in future
  8. Assist in maintaining technology currency (perform server patching, certificate renewal, etc.) with keen eye on automating opportunities
  9. Assist in ensuring availability and uptime of applications in scope, as per service level objectives
  10. Assist in ensuring compliance of all systems and applications in scope, including maintaining segregation of duties
Must Have
  • Introductory knowledge of industry practices, with focus on SRE
  • Introductory experience in a variety of environments (Cloud, distributed and mainframe, business workflows and services/APIs, databases)
  • Excellent communication skills, direct style (e.g. I did or did not do something, it does or does not work as opposed I believe or I understand it to be)
  • Hands-on experience in a variety of languages and tools (GitHub, Slack)
  • Hands-on experience in a variety of SRE languages and tools (Ansible, Dynatrace Managed, Moog, PagerDuty, Splunk, ServiceNow, Elastic, Logstash, Kibana, Blue Prism, Catch Point)
Nice to Have
  • Computer Engineering, Computer Science, related (technical) degree/diploma, or related breadth of experience
  • Exposure to Docker and OCP
  • Exposure to UCD, Openshift, Kubernetes, PCF (Pivotal Cloud Foundry), GitHub
  • Experience in agile ways of working
What's in it for you?

We thrive on the challenge to be our best, progressive thinking to keep growing, and working together to deliver trusted advice to help our clients thrive and communities prosper. We care about each other, reaching our potential, making a difference to our communities, and achieving success that is mutual.

A comprehensive Total Rewards Program including bonuses and flexible benefits, competitive compensation, commissions, and stock where applicable

Leaders who support your development through coaching and managing opportunities

Ability to make a difference and lasting impact

Work in a dynamic, collaborative, progressive, and high-performing team

A world-class training program in financial services

Flexible work/life balance options

Opportunities to do challenging work

Job Skills
  • Agile Methodology
  • Group Problem Solving
  • IT Systems Integration
  • Organizational Leadership
  • Software Development Life Cycle (SDLC)
  • Software Engineering
  • System Applications
  • System Integration Testing (SIT)
  • Systems Software
#J-18808-Ljbffr

  • Old Toronto, Canada Lloyds Banking Group Full time

    Job Description - Senior Site Reliability EngineerJOB TITLE: Senior Site Reliability Engineer (SRE)LOCATION: Halifax, Leeds or ManchesterHOURS: Full-timeWORKING PATTERN: Our work style is hybrid, which involves spending at least two days per week, or 40% of our time, at one of our office sites.Who are Lloyds Banking Group and where does this role sit?If you...


  • Old Toronto, Canada Practice Better Full time

    About us:Practice Better is a leading all-in-one practice management software solution transforming how health & wellness professionals run their practices and support their clients. The company serves 15,000+ customers in over 70+ countries across the globe, and processes hundreds of millions annually in payments on behalf of customers. Over 65% of growth...


  • Old Toronto, Canada Manulife Insurance Malaysia Full time

    Senior Site Reliability Engineer page is loaded Senior Site Reliability Engineer Postuler locations Waterloo, Ontario Toronto, siège social mondial (200 Bloor) time type Temps plein posted on Publié hier job requisition id JR24020202 Nous sommes un fournisseur de services financiers qui s’emploie à faciliter les...


  • Old Toronto, Canada Thomson Reuters Full time

    (Canada) Site Reliability Engineer (Contract) Contract (9 months 4 days) Published 3 days ago New Relic Data Dog Site Reliability Engineer - in the Service Management OrganizationDo you have experience in IT Service Management, working with cloud providers, software development, and technology infrastructure?The Site Reliability Engineer will...


  • Old Toronto, Canada Jobber Full time

    Jobber exists to help people in small businesses be successful. We work with small home service businesses, like your local plumbers, painters, and landscapers, to transform the way service is delivered through technology. With Jobber they can quote, schedule, invoice, and collect payments from their customers, while providing an easy and professional...


  • Old Toronto, Canada Autodesk Full time

    Position Overview Autodesk, the leading Design and Make Software Company, is looking for a Principal Site Reliability Engineer to join the Autodesk Platform Services Engineering team in Toronto, Canada. In this role, you will help build trusted services of APS (Autodesk Platform Services) measured by Service Level Objectives (SLOs) and Mean Time to Recovery...


  • Old Toronto, Canada Sentry Full time

    Bad software is everywhere, and we’re tired of it. Sentry is on a mission to help developers write better software faster, so we can get back to enjoying technology.With more than $217 million in funding and 90,000 organizations that believe we’re on to something, we're building performance and error monitoring tools that help companies like Disney,...


  • Old Toronto, Canada CB Canada Full time

    Site Reliability Engineer On behalf of our client in the Banking Sector, PROCOM is looking for a Site Reliability Engineer. Site Reliability Engineer – Job Description Azure cloud Jira and confluence CICD Experience with automating (provisioning, configuration management, deployment) and integrating Azure PaaS solutions (Azure App services, Azure...


  • toronto, Canada OnX Canada Full time

    OnX is looking for a Site Reliability Engineer for one our clients in Toronto. Client: Financial Services Location: Toronto, mostly remote Duration: 6 months with potential extension JBoss in middleware experience is super important Responsibilities: Following the senior technicians plans to buil


  • Toronto, Canada Thomson Reuters Full time

    Thomson Reuters is seeking a Senior Site Reliability Engineer to join our Service Management, Technology team. This role calls for an individual who is capable of analyzing customer problems of high complexity and assessing the scope of impact, while mitigating customer impact of issues and executing work arounds. Willingness to learn is an important aspect...


  • Old Toronto, Canada Scotiabank Full time

    Press Tab to Move to Skip to Content Link Select how often (in days) to receive an alert: Select how often (in days) to receive an alert: Please be advised that our Careers site will be unavailable from November 28 at 12am ET to November 29 12am ET for scheduled system maintenance. Title: Site Reliability Engineer Requisition ID:...


  • Old Toronto, Canada Akamai Full time

    Are you passionate about cutting edge technology? Do solving some of the Internet's most difficult content delivery challenges interest you? Join our Compute Site Reliability team! Our team is responsible for monitoring and measuring the reliability of our suite of Compute products and platform. In collaboration with Engineering and Product teams, we focus...


  • Old Toronto, Canada Zendesk Full time

    Job Description Zendesk is a service-first CRM company that builds powerful, customizable software designed to improve customer relations. At Zendesk, we encourage growth, innovation, and believe in giving back to the communities we call home. The ideal candidate will want to join a growing team. You have recent experience with full-stack cloud native...


  • Old Toronto, Canada Zendesk Full time

    Job Description Zendesk is a service-first CRM company that builds powerful, customizable software designed to improve customer relations. At Zendesk, we encourage growth, innovation, and believe in giving back to the communities we call home. The ideal candidate will want to join a growing team. You have recent experience with full-stack cloud native...


  • Old Toronto, Canada RBC - Royal Bank Full time

    Job Summary Job Description What is the opportunity? The Personal and Commercial Banking (P&CB) arm of RBC is in the preparation stages of launching a groundbreaking program to deliver the best customer experiences by equipping our advisors with the very latest technology and processes in the industry. We have partnered with Salesforce to build out our next...


  • Old Toronto, Canada eTeam Full time

    Remote Work Duration 4 months - Preference is to find candidates who are willing to be converted to full-time employees. The conversion decision will be made based on performance. Job Description Role Description: Defining and measuring reliability goals—SLIs, SLOs, and error budgets for user journey. Designing for and implementing observability (ELK,...


  • Toronto, Canada CB Canada Full time

    Site Reliability Engineer On behalf of our client in the Banking Sector, PROCOM is looking for a Site Reliability Engineer. Site Reliability Engineer – Job Description Azure cloud Jira and confluence CICD Experience with automating (provisioning, configuration management, deployment) and integrating Azure PaaS solutions (Azure App services, Azure...


  • Old Toronto, Canada NVIDIA Full time

    Site Reliability Engineering (SRE) at NVIDIA Site Reliability Engineering (SRE) at NVIDIA is an engineering discipline to design, build and maintain large scale production systems with high efficiency and availability using the combination of software and systems engineering practices. This is a highly specialized discipline which demand knowledge across...


  • Old Toronto, Canada NVIDIA Full time

    Site Reliability Engineering (SRE) at NVIDIA Site Reliability Engineering (SRE) at NVIDIA is an engineering discipline to design, build and maintain large scale production systems with high efficiency and availability using the combination of software and systems engineering practices. This is a highly specialized discipline which demand knowledge across...


  • Old Toronto, Canada NVIDIA Full time

    Site Reliability Engineering (SRE) at NVIDIA Site Reliability Engineering (SRE) at NVIDIA is an engineering discipline to design, build and maintain large scale production systems with high efficiency and availability using the combination of software and systems engineering practices. This is a highly specialized discipline which demand knowledge across...