Cloud Reliability Engineer

6 days ago


Old Toronto, Canada TD Bank Full time
About TD Bank

TD Bank is a leading financial institution committed to delivering exceptional customer experiences and innovative banking solutions.

Job Summary

We are seeking an experienced Cloud Reliability Engineer to join our team in Toronto, Ontario. This role will be responsible for ensuring the stability, scalability, and reliability of our cloud-based systems.

Key Responsibilities
  • Provide day-to-day support for applications/systems through accurate problem identification and timely resolution of production issues.
  • Ensure timely notification and escalation of possible issues/problems, options, and recommendations.
  • Responsible for incident management (2nd level), monthly maintenance, state of health monitoring, and SLA maintenance.
  • Continuously strive to improve the stability of the production environment by partnering closely with key stakeholders on setting up, maintaining, and monitoring applications/systems.
Requirements
  • Minimum 8 years' experience managing a team of SMEs responsible for ensuring platform stability, scalability, and reliability.
  • Expert with Performance testing tool – JMeter/LR and APM/Log analysis tools.
  • Excellent Soft Skills including Speaking/presentation skills in a professional environment.
Benefits

TD Bank offers a competitive salary range of $76,800 - $115,200 CAD per annum, depending on experience.

The successful candidate will also have access to a comprehensive benefits package, including medical, dental, and vision coverage, as well as opportunities for career growth and development.



  • Old Toronto, Canada Royal Bank of Canada> Full time

    We are seeking a skilled Cloud Reliability Engineer to join our Digital team at RBC in Toronto, Canada.As a Cloud Reliability Engineer, you will be responsible for running the production environment by monitoring availability and taking a holistic view of system health. This includes debugging production issues across services and levels of the stack,...


  • Old Toronto, Canada Ascend Fundraising Solutions Full time

    We are seeking a skilled Cloud Reliability Engineer to collaborate with our IT team in Toronto. In this role, you will work closely with the client services team to diagnose, troubleshoot, and resolve system reliability issues.Responsibilities:Take ownership of customer-reported issues and drive them to resolution.Develop proactive measures to prevent...


  • Old Toronto, Canada The Home Depot Canada Full time

    About The JobAs a Cloud Reliability Engineer Lead at The Home Depot Canada, you will play a crucial role in ensuring the reliability, performance, and operational support of our eCommerce systems.Job OverviewThis position requires a strong background in reliability reviews, performance engineering practices, production engineering, and operational support,...

  • Reliability Engineer

    1 month ago


    Old Toronto, Canada Thomson Reuters Full time

    About the RoleWe are seeking a skilled Reliability Engineer - Cloud Systems to join our team at Thomson Reuters.As a Reliability Engineer - Cloud Systems, you will be responsible for analyzing and resolving chronic and major issues affecting our cloud-based services.Key responsibilities include:Designing and implementing scalable systems and...


  • Old Toronto, Canada Loblaw Companies Ltd - Head Office Full time

    Cloud Engineering OpportunityWe are seeking an experienced Site Reliability Engineer to join our team at Loblaw Companies Ltd - Head Office. This role offers a unique opportunity to design, develop, and maintain cloud native solutions using services like Kubernetes, AppEngine, Cloud Functions, CloudSql, BigQuery, Pub/Sub on Google Cloud Platform and...


  • Old Toronto, Canada Thomson Reuters Full time

    Site Reliability Engineer Job DescriptionThis role is part of our Service Management Organization and involves IT Service Management, cloud providers, software development, and technology infrastructure experience.The Site Reliability Engineer will analyze chronic and major issues, evaluate products and their services, and make recommendations to improve...


  • Old Toronto, Canada Northbridge Financial Corporation Full time

    About the RoleAt Northbridge Financial Corporation, we are seeking a highly skilled Senior Cloud Reliability Architect to oversee the creation and implementation of Service Level Objectives (SLOs). This senior role involves handling complex service reliability solutions and is responsible for mentoring and leading less experienced engineers.We Want Your...


  • Old Toronto, Canada Lyons Consulting Group Full time

    Reliable Infrastructure SolutionsWe are seeking a highly skilled Technical Operations Expert to join our team at Lyons Consulting Group. As a Cloud Reliability Specialist, you will be responsible for providing hands-on SRE support, including incident management, problem management, root cause analysis, monitoring, alerting, and maintenance of infrastructure...


  • Old Toronto, Canada Sentry Full time

    Sentry is on a mission to simplify software development and improve application performance. We need a skilled AWS Site Reliability Engineer to join our team and help us achieve our goals. This role involves ensuring the uptime and reliability of our hosted platform, architecting and automating services and systems to meet scaling demands, and collaborating...


  • Old Toronto, Canada Chelsea Avondale Full time

    Chelsea Avondale is the world’s most cutting-edge home insurance group. We have developed sophisticated risk modeling and insurance pricing technologies for home insurance and deploy that technology through our own insurance company. Our team consists of some of the brightest minds in insurance, software development, finance, and operations. Our group...


  • Greater Toronto Area, Canada Paymentus Full time

    At Paymentus, we lead the North American marketplace in electronic bill payment solutions. We're seeking a high performer to join our development team building SaaS Fintech solutions across various industries.You will contribute to a massively scalable data platform built on top of a world-class enterprise platform, supporting thousands of clients and...


  • Toronto, Ontario, Canada Thomson Reuters Full time

    We are seeking an experienced Senior SRE to join our Shared Capabilities, Service Reliability and Operation team in Toronto. As a Cloud Native Site Reliability Engineer, you will be responsible for implementing site reliability engineering and DevOps best practices, building and maintaining monitoring for all aspects of infrastructure, micro-services, usage...


  • Old Toronto, Canada HOOPP Thames Limited Full time

    **About the Role**We are seeking a highly skilled Cloud Infrastructure Engineer to join our IT Investment Solutions Group at HOOPP. As a Cloud Infrastructure Engineer, you will play a critical role in designing, implementing, and managing our cloud infrastructure to support the organization's strategic objectives.**Responsibilities**Design, deploy, and...


  • Greater Toronto Area, Canada GlossGenius Full time

    About GlossGeniusGlossGenius is a leading fintech company empowering small business owners to succeed by offering a range of business management tools, including booking and scheduling, marketing, analytics, payment processing, and more. Our platform serves over 75,000 entrepreneurs daily.As a pioneering force in the industry, GlossGenius is expanding its...

  • Cloud Platform Lead

    1 week ago


    Toronto, Ontario, Canada Royal Bank of Canada Full time

    Role OverviewWe are seeking a seasoned Cloud Platform Lead to spearhead the design and development of highly scalable, secure, and available architectures for cloud platforms. As a key member of our team, you will lead and coordinate a team of talented Site Reliability Engineers and Cloud Platform Engineers to drive innovation and excellence.About the...

  • Cloud Engineer

    2 months ago


    Old Toronto, Canada Scotiabank Full time

    Requisition ID: 206977Join a purpose driven winning team, committed to results, in an inclusive and high-performing culture. Scotiabank has embarked on the journey to modernize both development practices and tools. One of the main areas of transformation is the public cloud and the various platform technologies that support both development and operations on...

  • Cloud Engineer

    4 weeks ago


    Old Toronto, Canada Ontario Health Full time

    Job Title: Senior Cloud EngineerOngoing development and implementation of cloud-based systems and infrastructure for Ontario Health.Key Responsibilities:Design, implement, and manage cloud-based infrastructure and applications.Collaborate with cross-functional teams to ensure efficient and secure cloud services.Provide expert-level guidance on cloud...

  • Cloud Engineer

    2 months ago


    Old Toronto, Canada Scotiabank Full time

    Join a purpose-driven winning team, committed to results, in an inclusive and high-performing culture.Scotiabank has embarked on the journey to modernize both development practices and tools. One of the main areas of transformation is the public cloud and the various platform technologies that support both development and operations on the cloud. The aim is...


  • Old Toronto, Canada Chad Management Group Full time

    About the RoleAs a seasoned engineering leader, you will oversee the development of scalable and reliable cloud-based solutions, aligning them with the company's strategic objectives.Key ResponsibilitiesDevelop and execute technical strategies to drive innovation and growth in cloud engineeringCollaborate with cross-functional teams to ensure successful...


  • Old Toronto, Canada Infotree Global Solutions Full time

    About Infotree Global SolutionsInfotree Global Solutions is a leading provider of innovative solutions, and we're seeking an experienced Site Reliability Engineer to lead our team.Your RoleAs our Site Reliability Engineering Lead, you will be responsible for supervising a team of skilled engineers and ensuring the reliability and scalability of our global...