Lead Site Reliability Engineer

3 weeks ago


Toronto, Canada SimCorp Full time

Join to apply for the Lead Site Reliability Engineer role at SimCorp. About SimCorp Join some of the most innovative thinkers in FinTech as we lead the evolution of financial technology. If you are an innovative, curious, collaborative person who embraces challenges and wants to grow, learn and pursue outcomes with our prestigious financial clients, say Hello to SimCorp At its foundation, SimCorp is guided by our values – caring, customer success‑driven, collaborative, curious, and courageous. Our people‑centered organization focuses on skills development, relationship building, and client success. We take pride in cultivating an environment where all team members can grow, feel heard, valued, and empowered. Why This Role is Important to Us The Lead Site Reliability Engineer (SRE) will be responsible for leading the efforts to maintain and improve the reliability, scalability, and performance of various SimCorp products and services. This individual will work collaboratively with product development teams across several lines of business and other stakeholders to fulfill their responsibilities effectively. The candidate will need in‑depth expertise and experience in Azure Cloud and associated technologies to address infrastructure challenges, implement automation solutions, and boost overall operational effectiveness. Responsibilities Lead the development of SRE solutions, including monitoring and alerting, machine learning‑based anomaly detection, self‑healing mechanisms, and reliability testing strategies. Design and implement reliability, scalability, and performance strategies, while leading the development of initiatives for capacity planning, resource management, and automation opportunities across systems and onboarding pipelines. Collaborate with product development teams to optimize application performance and infrastructure, applying design‑thinking and agile methodologies in cross‑functional environments. Manage incident response and root cause analysis, ensuring timely resolution of outages and performance issues, and maintaining high‑quality documentation for operational processes and system configurations. Drive continuous improvement and adoption of best practices, including change management, observability, and operational excellence, while staying current with technology trends through formal and self‑directed learning. Mentor and guide junior SREs, promoting a culture of collaboration and knowledge‑sharing, and participate in on‑call rotations (including weekends) as needed. Qualifications Bachelor’s or Master’s degree in Computer Science or a related field, with 5–8+ years in SRE or Cloud Infrastructure leadership. Extensive expertise in Microsoft Azure, including production‑grade design, operations, and cloud‑native reliability practices. Proficiency in Infrastructure‑as‑Code tools such as Bicep, Terraform, ARM, Ansible, and comprehensive understanding of monitoring and incident frameworks. Direct experience leading incident response, platform‑wide reliability improvements, and applying ITIL practices (problem, change, incident management). Broad technical knowledge across Azure DevOps, Kubernetes, Docker, CI/CD, APIs, scripting, SQL, Cosmos DB, and MongoDB Atlas. Established expertise in mentoring engineers, steering architectural decisions, and synchronizing long‑term strategies with actionable implementation. Financial industry experience is a plus. Compensation For Toronto only: The salary range for this position is $97,000 to $120,000 CAD. Base pay may vary based on factors such as years of experience, skills and qualifications. Employees are eligible for an annual discretionary bonus and benefits including health and dental care, time off and Group RRSP/TFSA. Benefits SimCorp offers several benefits that might play a significant factor in considering whether to accept a job offer. As a global company with 30+ offices worldwide, the benefits package may vary by country. SimCorp follows a global hybrid policy, requiring employees to work from the office two days each week while allowing remote work on other days. Application Process Please submit your application in English via our career site as soon as possible. Applications sent through our system will be processed; other submissions will not be considered. We strive to mitigate bias in the recruitment process by requesting that candidates exclude personal data such as photos, age, or non‑professional information from their application. Equal Opportunity & Accessibility SimCorp is an equal opportunity employer. We are committed to building a culture where diverse perspectives and expertise are integrated in our everyday work. SimCorp Canada welcomes and encourages applications from people with disabilities. Accommodations are available upon request for candidates during the recruitment process. Candidates requiring accommodation should contact the People & Culture team at HumanResourcesNA@simcorp.com. #J-18808-Ljbffr



  • (s): Canada : Ontario : Toronto Scotiabank Global Site Full time $105,000 - $170,000 per year

    Requisition ID: 244026Join a purpose driven winning team, committed to results, in an inclusive and high-performing culture.Overview: As a Site Reliability Engineer (SRE), you will join the Digital Engineering Operations team, responsible for ensuring the operations and reliability of Scotiabank digital applications. You will have the opportunity to drive...


  • (s): Canada : Ontario : Toronto Scotiabank Global Site Full time US$80,000 - US$140,000 per year

    Requisition ID: 244027Join a purpose driven winning team, committed to results, in an inclusive and high-performing culture.Overview: As a Site Reliability Engineer (SRE), you will join the Digital Engineering Operations team, responsible for ensuring the operations and reliability of Scotiabank digital applications. You will have the opportunity to drive...


  • Toronto, Canada Masabi Full time

    Introducing Masabi At Masabi, we’re driving the fare payment revolution, powering the journeys of millions all over the world. We build fare collection platforms that allow riders to seamlessly buy and present tickets for public transport either on their mobile phones, from a ticket machine, or even by tapping their bank card to travel. Our Justride...


  • Toronto, Canada Masabi Full time

    Introducing Masabi At Masabi, we’re driving the fare payment revolution, powering the journeys of millions all over the world. We build fare collection platforms that allow riders to seamlessly buy and present tickets for public transport either on their mobile phones, from a ticket machine, or even by tapping their bank card to travel. Our Justride...


  • Toronto, Canada S&P Global Full time

    About the Role Lead Site Reliability Engineer The Team The Cloud Engineering team at S&P Global is responsible for designing, building, and maintaining the cloud infrastructure that supports our business operations. The team works closely with development, testing, and product groups to understand their requirements and deliver scalable, reliable cloud...


  • Toronto, Canada S&P Global Full time

    About the Role Lead Site Reliability Engineer The Team The Cloud Engineering team at S&P Global is responsible for designing, building, and maintaining the cloud infrastructure that supports our business operations. The team works closely with development, testing, and product groups to understand their requirements and deliver scalable, reliable cloud...


  • Toronto, Canada S&P Global Full time

    About the Role Lead Site Reliability Engineer The Team The Cloud Engineering team at S&P Global is responsible for designing, building, and maintaining the cloud infrastructure that supports our business operations. The team works closely with development, testing, and product groups to understand their requirements and deliver scalable, reliable cloud...


  • Toronto, Canada S&P Global Full time

    About the Role Lead Site Reliability Engineer The Team The Cloud Engineering team at S&P Global is responsible for designing, building, and maintaining the cloud infrastructure that supports our business operations. The team works closely with development, testing, and product groups to understand their requirements and deliver scalable, reliable cloud...


  • Toronto, Ontario, Canada Procom Full time $80,000 - $120,000 per year

    Site Reliability Engineer (SRE)/ Ingénieur Fiabilité des SitesOn behalf of our banking client, Procom is seeking a Site Reliability Engineer (SRE) for a 12-month contract role. This position is a hybrid role, 3 days a week onsite at our client's Montréal, Quebec office.Site Reliability Engineer - Job Description:The Site Reliability Engineer is...


  • Toronto, Canada RBC Full time

    Opportunity Join RBC as a Lead Site Reliability Engineer and take the lead in ensuring the reliability, scalability, and performance of our critical production systems and infrastructure. This is your chance to drive innovation through cutting‑edge engineering practices, automation, and process optimization. Collaborate with cross‑functional teams,...