Site Reliability Engineer

2 weeks ago


Toronto, Canada Scotiabank Full time

Site Reliability Engineer (SRE) – Scotiabank Requisition ID: 247129 Join a purpose‑driven winning team, committed to results, in an inclusive and high‑performing culture. Job Overview As an SRE, you will implement, measure, and gather insights from Operational Level Indicators (OLI) to identify service improvements covering availability, performance, resilience, incidents and chronic problems. You will execute jobs per runbooks and automate repetitive tasks to reduce toil. You are expected to lead and perform Disaster Recovery (DR) exercises, engage teams for technical validation, and participate in vulnerability assessments, making recommendations for remediation and overall enhancements. Responsibilities Challenge yourself with complex solving and apply lessons for continuous improvement. Support critical systems requiring high trust, resilience, and security. Proactively seek opportunities to automate and solve problems before they occur. Act as a Subject Matter Expert (SME) on Performance, Scalability, Reliability, Audit, Monitoring, and Security following SRE best practices. Manage communication of production releases and their service availability impact to internal and external stakeholders. Passionate about metrics, trends, and patterns of availability and serviceability dictated by defined SLA/SLI. Qualifications Strong communication (verbal and) skills in English. 3+ years of hands‑on experience in real‑time streaming data projects in operations. 7+ years of hands‑on technical experience with production support and in‑depth troubleshooting of major incidents and problem management. 2+ years of technical experience with Apache Kafka for event management. 3+ years of technical experience with Splunk and/or Dynatrace for monitoring and alerts. 5+ years of technical experience building CI/CD pipelines using Jenkins, Gradle/Maven, or Bitbucket. Ability to read Java code for troubleshooting and debugging. Strong technical understanding of RESTful services. Proficiency in SQL queries across relational databases. Knowledge of microservices in Google Cloud Platform (GCP) and/or Azure. Experience with UNIX shell scripting and Python. Excellent organizational skills. Post‑secondary education in Computer Science, Engineering, or Mathematics. Confluent Certified Administrator for Apache Kafka is an asset. Benefits Commitment to diversity, equity, inclusion, and allyship with employee resource groups. Accessibility and workplace accommodations for all employees. Upskilling through online courses, cross‑functional development, and tuition assistance. Competitive rewards program including bonus, flexible vacation, personal and sick days; benefits start day one. Dynamic ecosystem with free tea & coffee, universal washrooms, and collaborative spaces. Opportunities for community engagement and belonging through various programs. Location Canada – Ontario – Toronto Equal Employment Opportunity Statement Scotiabank is a leading bank in the Americas. We value the unique skills and experiences of each employee and are committed to an inclusive, accessible environment. If you require accommodation during the recruitment process, please let our Recruitment team know. Candidates must apply directly online to be considered for this role. We thank all applicants for their interest; however, only those selected for an interview will be contacted. #J-18808-Ljbffr



  • (s): Canada : Ontario : Toronto Scotiabank Global Site Full time

    Requisition ID: 245210Join a purpose driven winning team, committed to results, in an inclusive and high-performing culture.The TeamGlobal Banking and Markets Engineering (GBME) is the fast-moving, award-winning technology engine that powers Scotiabank's Corporate, Investment Banking and Capital Markets businesses.The RoleGBME is searching for a Site...


  • (s): Canada : Ontario : Toronto Scotiabank Global Site Full time

    Requisition ID: 244026Join a purpose driven winning team, committed to results, in an inclusive and high-performing culture.Overview: As a Site Reliability Engineer (SRE), you will join the Digital Engineering Operations team, responsible for ensuring the operations and reliability of Scotiabank digital applications. You will have the opportunity to drive...


  • Toronto, Canada Kyndryl Full time

    Join to apply for the Site Reliability Engineer role at Kyndryl. Direct message the job poster from Kyndryl. Recruitment & Strategic Staffing @Kyndryl | Partnering with IT Consultants in Financial Services & Technology Position: Site Reliability Engineer Client: Financial Services - Capital Markets Technology Duration: 12-month contract with potential...


  • Toronto, Canada Kyndryl Full time

    Join to apply for the Site Reliability Engineer role at Kyndryl. Direct message the job poster from Kyndryl. Recruitment & Strategic Staffing @Kyndryl | Partnering with IT Consultants in Financial Services & Technology Position: Site Reliability Engineer Client: Financial Services - Capital Markets Technology Duration: 12-month contract with potential...


  • (s): Canada : Ontario : Toronto Scotiabank Global Site Full time

    Requisition ID: 247129Join a purpose driven winning team, committed to results, in an inclusive and high-performing culture.As a SRE, you will implement, measure and gather insights from Operational Level Indicators identifying areas for service improvements covering availability, performance, resilience, incidents and chronic problems. You will implement...


  • Toronto, Canada Global Technical Talent Full time

    Primary Job Title Site Reliability Engineer IV Alternate / Related Job Titles Site Reliability Engineer Senior SRE IT Reliability Engineer Systems Integration Engineer Location & Onsite Flexibility Toronto, ON — Hybrid (4 days onsite) Office Address: 66 Wellington Street West, 19th Floor, Toronto, ON Contract Details Position Type: Contract Contract...


  • Toronto, Canada Moneris Full time

    Your Moneris Career - The OpportunityAs the Site Reliability Engineer, you will help ensure the reliability, performance, and scalability of our systems. You will work with development and operations teams to build and maintain robust infrastructure, automate processes, and improve overall system health.Location: You will be based in our Toronto office,...


  • Toronto, Canada Denvr Full time

    Site Reliability Engineer - Platform Infrastructure Team (100% Remote - Canada) Denvr is a vertically integrated AI Platform Services company headquartered in Calgary, Canada. We provide foundational compute infrastructure and services to support the broader AI ecosystem and its end users. The platform includes cloud‑native solutions for training,...


  • Toronto, Ontario, Canada Global Technical Talent, an Inc. 5000 Company Full time

    Primary Job Title:Site Reliability Engineer IVAlternate / Related Job Titles:Site Reliability EngineerSenior SREIT Reliability EngineerSystems Integration EngineerLocation & Onsite Flexibility:Toronto, ON —Hybrid (4 days onsite)Office Address:66 Wellington Street West, 19th Floor, Toronto, ONContract DetailsPosition Type:ContractContract...


  • Toronto, Canada Tecsys Inc. Full time

    Having recognized the advantages of remote work, including employee morale, productivity, reduced commuting on employee wellbeing and the environment, we are proud to be a digital-first company. The technologies and programs in which we invested have provided a fantastic foundation to this end. Our digital-first work environment, together with our...