Site Reliability Engineer II

3 weeks ago


Toronto, Ontario, Canada The Toronto-Dominion Bank (Canada) Full time
Job Description

Job Summary

We are seeking a highly skilled Site Reliability Engineer II to join our team at The Toronto-Dominion Bank (Canada). As a key member of our technical operations team, you will be responsible for ensuring the stability, scalability, and reliability of our production environment.

Key Responsibilities

  • Provide day-to-day support for applications/systems through accurate problem identification and timely resolution of production issues.
  • Perform controlled and timely resolution of incidents while prioritizing and monitoring client satisfaction.
  • Ensure timely notification and escalation of possible issues/problems, options, and recommendations.
  • Responsible for incident management (2nd level), monthly maintenance, state of health monitoring, and SLA maintenance.
  • Continuously strive to improve the stability of the production environment by partnering closely with key stakeholders on setting up, maintaining, and monitoring applications/systems, ensuring availability targets are met.
  • Provide technical leadership to improve the design and operation of systems in alignment with reliability engineering best practices and overall Technology and Bank strategies, applying the practices of computer science and software engineering to the design and development of large, complex systems.
  • Influence and partner with key technology and product team members in the design and development of solutions that promote automation and the elimination of toil; identify optimal ways to improve the design and operation of systems to make them more scalable, more reliable, and more efficient and has the ability to implement the required changes.
  • Develop deep relationships with Product Owners, Tech Leads, and Ops to build transparency and help foster end-to-end accountability of products and services.

Requirements

  • Minimum 8 years' experience with MUST have experience managing a team of SMEs who are responsible for ensuring platforms stability, scalability, and reliability with different types of technology stack.
  • Strategize and lead testing of TD Bank's production-like infrastructure with a focus on promoting and applying best practices for performance testing/engineering of scalable and reliable services across engineering.
  • Experience in delivering large/complex programs with knowledge on the latest technology landscape to support enterprise-level engagements.
  • Expert with Performance testing tool – JMeter/LR and APM/Log analysis tools like Dynatrace, Datadog, Splunk with good understanding of Thread dump, Heap dump, and DB analysis.
  • Be a subject matter expert and partner with our engineering team on areas of performance and scale together with multiple streams from Technology and Business to ensure alignment.
  • Participate in root cause analysis for performance-related incidents and determining how we can prevent them in the future, identify performance bottlenecks, and create repeatable processes.
  • Be a site reliability engineering advocate and advisor for the tools/technology and frameworks that serve as a model for others about quality, scalability, operability, maintainability, etc.
  • Mentor and coach junior engineers to leverage their full potential.
  • Able to bring new ideas, innovation for continuous improvement.
  • Excellent Soft Skills including Speaking/Presentation skills in a professional environment.

About Us

The Toronto-Dominion Bank (Canada) is one of the world's leading global financial institutions and is the fifth largest bank in North America by branches/stores. Every day, we deliver legendary customer experiences to over 27 million households and businesses in Canada, the United States, and around the world. More than 95,000 TD colleagues bring their skills, talent, and creativity to the Bank, those we serve, and the economies we support. We are guided by our vision to Be the Better Bank and our purpose to enrich the lives of our customers, communities, and colleagues.

Total Rewards Package

Our Total Rewards package reflects the investments we make in our colleagues to help them and their families achieve their financial, physical, and mental well-being goals. Total Rewards at TD includes a base salary, variable compensation, and several other key plans such as health and well-being benefits, savings and retirement programs, paid time off, banking benefits and discounts, career development, and reward and recognition programs.



  • Toronto, Ontario, Canada The Toronto-Dominion Bank (Canada) Full time

    Job DescriptionJob SummaryWe are seeking a highly skilled Site Reliability Engineer II to join our team. As a Site Reliability Engineer II, you will be responsible for ensuring the stability, scalability, and reliability of our platforms and applications.Key ResponsibilitiesProvide day-to-day support for applications and systems, including incident...


  • Toronto, Ontario, Canada The Toronto-Dominion Bank (Canada) Full time

    Job DescriptionJob SummaryWe are seeking a highly skilled Site Reliability Engineer II to join our team. As a Site Reliability Engineer II, you will be responsible for ensuring the stability, scalability, and reliability of our platforms and applications.Key ResponsibilitiesProvide day-to-day support for applications and systems, including incident...


  • Toronto, Ontario, Canada TD Bank Full time

    Job Summary:TD Bank is seeking a highly skilled Site Reliability Engineer II to join our team. As a Site Reliability Engineer II, you will be responsible for ensuring the stability, scalability, and reliability of our production infrastructure. You will work closely with our engineering team to identify and resolve performance issues, implement changes, and...


  • Toronto, Ontario, Canada The Toronto-Dominion Bank (Canada) Full time

    Job DescriptionJob SummaryWe are seeking a highly skilled Site Reliability Engineer II to join our team. As a Site Reliability Engineer II, you will be responsible for ensuring the stability, scalability, and reliability of our platforms and applications.Key ResponsibilitiesProvide day-to-day support for applications and systems, including incident...


  • Toronto, Ontario, Canada The Toronto-Dominion Bank (Canada) Full time

    Job DescriptionJob SummaryWe are seeking a highly skilled Site Reliability Engineer II to join our team. As a Site Reliability Engineer II, you will be responsible for ensuring the stability, scalability, and reliability of our platforms and applications.Key ResponsibilitiesProvide day-to-day support for applications and systems, including incident...


  • Toronto, Ontario, Canada TD Bank Full time

    Job SummaryWe are seeking a highly skilled Site Reliability Engineer II to join our team at TD Bank. As a key member of our technology organization, you will be responsible for ensuring the stability, scalability, and reliability of our production environment.Key ResponsibilitiesProvide day-to-day support for applications and systems, including accurate...


  • Toronto, Ontario, Canada The Toronto-Dominion Bank (Canada) Full time

    Job DescriptionJob SummaryWe are seeking a highly skilled Site Reliability Engineer II to join our team. As a Site Reliability Engineer II, you will be responsible for ensuring the stability, scalability, and reliability of our platforms and applications.Key ResponsibilitiesProvide day-to-day support for applications and systems, including accurate problem...


  • Toronto, Ontario, Canada The Toronto-Dominion Bank (Canada) Full time

    Job DescriptionJob SummaryWe are seeking a highly skilled Site Reliability Engineer II to join our team. As a Site Reliability Engineer II, you will be responsible for ensuring the stability, scalability, and reliability of our platforms and applications.Key ResponsibilitiesProvide day-to-day support for applications and systems, including accurate problem...


  • Toronto, Ontario, Canada The Toronto-Dominion Bank (Canada) Full time

    Job DescriptionAt The Toronto-Dominion Bank (Canada), we're committed to delivering legendary customer experiences to over 27 million households and businesses in Canada, the United States, and around the world. As a Site Reliability Engineer II, you'll play a critical role in ensuring the stability, scalability, and reliability of our platforms and...


  • Old Toronto, Ontario, Canada TD Full time

    Job DescriptionAs a Site Reliability Engineer II at TD, you will play a critical role in ensuring the stability, scalability, and reliability of our platforms. You will work closely with our engineering team to identify and resolve performance bottlenecks, implement process improvements, and develop solutions that promote automation and eliminate toil.Key...


  • Old Toronto, Ontario, Canada TD Full time

    Job DescriptionAs a Site Reliability Engineer II at TD, you will play a critical role in ensuring the stability, scalability, and reliability of our platforms. You will work closely with our engineering team to identify and resolve performance bottlenecks, implement process improvements, and develop solutions that promote automation and eliminate toil.Key...


  • Toronto, Ontario, Canada The Toronto-Dominion Bank (Canada) Full time

    Job SummaryWe are seeking a highly skilled Site Reliability Engineer II to join our team at The Toronto-Dominion Bank (Canada). As a key member of our technical operations team, you will be responsible for ensuring the stability, scalability, and reliability of our production environment.Key ResponsibilitiesProvide day-to-day support for applications/systems...


  • Toronto, Ontario, Canada The Toronto-Dominion Bank (Canada) Full time

    Job SummaryWe are seeking a highly skilled Site Reliability Engineer II to join our team at The Toronto-Dominion Bank (Canada). As a key member of our technical operations team, you will be responsible for ensuring the stability, scalability, and reliability of our production environment.Key ResponsibilitiesProvide day-to-day support for applications/systems...


  • Toronto, Ontario, Canada Bourse de Montreal Inc. Full time

    Job Title: Site Reliability EngineerAt Bourse de Montreal Inc., we're seeking a highly skilled Site Reliability Engineer to join our Global Technology Services (GTS) team. As a Site Reliability Engineer, you will play a critical role in ensuring the reliability, scalability, and performance of our technology infrastructure.Key Responsibilities:Evaluate new...


  • Toronto, Ontario, Canada Bourse de Montreal Inc. Full time

    Job Title: Site Reliability EngineerAt Bourse de Montreal Inc., we're seeking a highly skilled Site Reliability Engineer to join our Global Technology Services (GTS) team. As a Site Reliability Engineer, you will play a critical role in ensuring the reliability, scalability, and performance of our technology infrastructure.Key Responsibilities:Evaluate new...


  • Toronto, Ontario, Canada Bourse de Montreal Inc. Full time

    Job Title: Site Reliability EngineerAt Bourse de Montreal Inc., we're seeking a highly skilled Site Reliability Engineer to join our Global Technology Services (GTS) team. As a Site Reliability Engineer, you will play a critical role in ensuring the reliability, scalability, and performance of our technology infrastructure.Key Responsibilities:Evaluate new...


  • Toronto, Ontario, Canada Bourse de Montreal Inc. Full time

    Job Title: Site Reliability EngineerAt Bourse de Montreal Inc., we're seeking a highly skilled Site Reliability Engineer to join our Global Technology Services (GTS) team. As a Site Reliability Engineer, you will play a critical role in ensuring the reliability, scalability, and performance of our technology infrastructure.Key Responsibilities:Evaluate new...


  • Toronto, Ontario, Canada Lorven Technologies Full time

    Job Title: Site Reliability EngineerLocation: Toronto, CADuration: Long termWe are seeking a highly skilled Site Reliability Engineer to join our team at Lorven Technologies. As a key member of our infrastructure team, you will be responsible for ensuring the reliability and scalability of our cloud-based systems.Key ResponsibilitiesA Bachelor's degree in...


  • Toronto, Ontario, Canada Lorven Technologies Full time

    Job Title: Site Reliability EngineerLocation: Toronto, CADuration: Long termWe are seeking a highly skilled Site Reliability Engineer to join our team at Lorven Technologies. As a key member of our infrastructure team, you will be responsible for ensuring the reliability and scalability of our cloud-based systems.Key ResponsibilitiesA Bachelor's degree in...


  • Toronto, Ontario, Canada Lorven Technologies Full time

    Job Title: Site Reliability Engineer (SRE)Location: Toronto, CADuration: Long termWe are seeking a highly skilled Site Reliability Engineer to join our team at Lorven Technologies. As a key member of our technical operations team, you will be responsible for ensuring the reliability, scalability, and performance of our cloud-based infrastructure.Key...