Senior Site Reliability Engineer

2 weeks ago


Toronto, Canada RBC Full time

OverviewWhat is the Opportunity? We are on the lookout for a talented Senior Site Reliability Engineer to join our forward-thinking team responsible for the development and enhancement of our CI/CD deployment portal. This platform is designed to facilitate the swift and secure deployment of applications to various cloud environments, supporting all RBC application developers. You will focus on implementing solutions that streamline application delivery, improve operational efficiency, and ensure the platform's reliability and scalability, while leveraging AI-driven tools and methodologies.ResponsibilitiesEnsure the performance, quality, and responsiveness of the platform, with a strong emphasis on Site Reliability Engineering (SRE) principles, including robust monitoring, alerting, and incident response practices.Maintain and enhance the operational capabilities of the platform, ensuring an intuitive user experience while enabling seamless integration with tools and services to proactively identify and resolve potential issues.Collaborate with cross-functional teams to implement and deliver features for our deployment platform, focusing on automation, scalability, and operational efficiency, with integration of AI-driven solutions.Implement deployment and management patterns for the various tools on our DevOps platform, optimizing resource allocation and deployment strategies, with use of AI to improve processes.Integrate with cloud services and infrastructure to guarantee secure and efficient application deployment, while exploring opportunities to enhance security and scalability.Develop and execute automated testing procedures to confirm platform stability and dependability, improving test coverage and identifying edge cases.Participate in code review processes and contribute to the collective knowledge by documenting technical procedures and methodologies, including those involving AI tools.Stay informed of emerging development practices and technologies, actively contributing to the ongoing enhancement of our technology stack and platform capabilities.The role requires providing on-call support.What do you need to succeed?Must-Have Skills:3+ years of working experience in Site Reliability Engineering (SRE) and best practices for running and maintaining critical systems, including monitoring, alerting, and incident management.Experience with implementing and deploying systems into integrated environments.Proficient with cloud-based services (e.g., AWS, Azure) and a strong grasp of developing cloud-native applications.Proficiency with Terraform for Infrastructure as Code (IaC).Solid understanding of version control systems, particularly Git.Strong analytical skills, problem-solving abilities, and excellent communication skills.Nice-to-Have Skills:Bachelor’s degree in Computer Science, Engineering, or in a field relevant to the role.Experience with full stack development, including experience with frameworks and languages such as JavaScript, React, Node.js, Python, or similar.Knowledge of Continuous Integration/Continuous Delivery (CI/CD) methodologies and associated tools.Familiarity with container technologies like Docker and orchestration platforms like Kubernetes.Experience using AI tools and efficient prompting of LLMs for operational improvements.Understanding of how AI can enhance CI/CD processes and operational workflows, and experience working with models and MCP servers.A focus on improving operational efficiency and system reliability, with experience leveraging AI for these purposes.What’s in it for you?We thrive on the challenge to be our best, progressive thinking to keep growing, and working together to deliver trusted advice to help our clients thrive and communities prosper. We care about each other, reaching our potential, making a difference to our communities, and achieving success that is mutual.A comprehensive Total Rewards Program including bonuses and flexible benefits, competitive compensation, commissions, and stock where applicable.Leaders who support your development through coaching and managing opportunities.Ability to make a difference and lasting impact.Work in a dynamic, collaborative, progressive, and high-performing team.A world-class training program in financial services.Flexible work/life balance options.Opportunities to do challenging work, including leveraging SRE principles and AI-driven enhancements to drive innovation and operational excellence.Note: Applications will be accepted until 11:59 PM on the day prior to the application deadline date above.Inclusion and Equal Opportunity EmploymentAt RBC, we believe an inclusive workplace that has diverse perspectives is core to our continued growth as one of the largest and most successful banks in the world. Maintaining a workplace where our employees feel supported to perform at their best, effectively collaborate, drive innovation, and grow professionally helps to bring our Purpose to life and create value for our clients and communities. RBC strives to deliver this through policies and programs intended to foster a workplace based on respect, belonging and opportunity for all. #J-18808-Ljbffr



  • Toronto, Canada Tubi Full time

    Join to apply for the Senior Site Reliability Engineer role at Tubi . About Tubi Boldly built for every fandom, Tubi is a free streaming service that entertains over 100 million monthly active users. Tubi offers the world's largest collection of Hollywood movies and TV shows, thousands of creator-led stories and hundreds of Tubi Originals made for the most...


  • Toronto, Canada Tubi Full time

    Join to apply for the Senior Site Reliability Engineer role at Tubi. About Tubi Boldly built for every fandom, Tubi is a free streaming service that entertains over 100 million monthly active users. Tubi offers the world's largest collection of Hollywood movies and TV shows, thousands of creator-led stories and hundreds of Tubi Originals made for the most...


  • Toronto, Canada Kyndryl Full time

    Join to apply for the Site Reliability Engineer role at Kyndryl. Direct message the job poster from Kyndryl. Recruitment & Strategic Staffing @Kyndryl | Partnering with IT Consultants in Financial Services & Technology Position: Site Reliability Engineer Client: Financial Services - Capital Markets Technology Duration: 12-month contract with potential...


  • Toronto, Canada Tubi Full time

    OverviewSenior Manager, Site Reliability Engineering at Tubi. Join to apply for the Senior Manager, Site Reliability Engineering role at Tubi. About Tubi: Boldly built for every fandom, Tubi is a free streaming service that entertains over 100 million monthly active users. Tubi offers the world's largest collection of Hollywood movies and TV shows, thousands...


  • Toronto, Canada Circle Full time

    Join to apply for the Senior Site Reliability Engineer role at Circle. Circle is a financial technology company at the epicenter of the emerging internet of money, where value can travel like other digital data—globally, nearly instantly and less expensively than legacy settlement systems. This groundbreaking new internet layer opens up previously...


  • Toronto, Canada Tubi Tv Full time

    Senior Manager, Site Reliability Engineering Overview About Tubi: Boldly built for every fandom, Tubi is a free streaming service that entertains over 100 million monthly active users. Tubi offers the world's largest collection of Hollywood movies and TV shows, thousands of creator-led stories and hundreds of Tubi Originals made for the most passionate...


  • Toronto, Canada Tubi Tv Full time

    Senior Manager, Site Reliability EngineeringOverviewAbout Tubi: Boldly built for every fandom, Tubi is a free streaming service that entertains over 100 million monthly active users. Tubi offers the world's largest collection of Hollywood movies and TV shows, thousands of creator-led stories and hundreds of Tubi Originals made for the most passionate fans....


  • Toronto, Canada Mindlance Full time

    Role : Site Reliability Engineer Location : Toronto, ON Duration : 12 Months of contract (Need to go 4 days in week onsite) Job Description: 10+ years relevant SRE experience 2+ years of relevant ITRS Geneos (Version 7) Experience on multiple projects with multiple interfaces and/or 3rd parties in the Monitoring, OpenTelemetry and Market Data space....


  • Toronto, Canada Global Technical Talent Full time

    Primary Job Title Site Reliability Engineer IV Alternate / Related Job Titles Site Reliability Engineer Senior SRE IT Reliability Engineer Systems Integration Engineer Location & Onsite Flexibility Toronto, ON — Hybrid (4 days onsite) Office Address: 66 Wellington Street West, 19th Floor, Toronto, ON Contract Details Position Type: Contract Contract...


  • Toronto, Canada Affirm Full time

    Senior Software Engineer - SRE, Backend (Reliability Engineering)Affirm is reinventing credit to make it more honest and friendly, giving consumers the flexibility to buy now and pay later without any hidden fees or compounding interest.ResponsibilitiesSite Reliability Engineering at Affirm to help engineering partners operate what they own, defining...