Manager, Site Reliability Engineering

4 days ago


Toronto, Canada Northbridge Financial Corporation Full time

What it’s like to be a Site Reliability Engineer Manager at Northbridge Financial

The Manager, Site Reliability Engineering is a critical role responsible for ensuring the reliability, performance, and availability of our core insurance platforms. You will be required to work very closely with our application and infrastructure teams to prevent incidents, manage infrastructure, and build effective monitoring systems. In this cross functional role, you must communicate effectively with various stakeholders while maintaining smooth operations and improving overall system reliability.

We want your talent

If you’re great at:

Strong knowledge of, and experience with, cloud technologies. Documented experience with infrastructure modelling tools to troubleshoot, resolve issues. Demonstrated experience providing over-the-phone support to clients and other departments. Demonstrated experience in developing process improvement initiatives. Transform difficult and complex technical ideas and topics into clear and relevant business ideas and concepts. An expansive understanding of industry practices and how to best leverage them. Exceptional analytical, conceptual, and problem-solving abilities and an enhanced capability of "thinking outside the box" to deliver innovative strategies that move the business forward and respond to changing market conditions. Proficiency in one or more of the following: C, C++, Java, Python, Go, Perl or Ruby. Documented experience with algorithms, data structures, complexity analysis and software design. Documented experience working with configuration management and deployment automation tools like Chef, Terraform, Puppet or Ansible.

If you have:

Bachelors Degree (Computer Science or Computer Engineering degree is preferred) and/or equivalent on the job experience. Strong understanding of cloud computing platforms such as Azure, GCP, IBM Cloud or AWS, as well as cloud automation and orchestration tools. Including items such as: SCPs, landing zones, Control tower, Azure Policy, Azure Security Centre, Azure monitor, Azure Advisor.  Technical hands-on leader with a deep understanding of Azure cloud architecture, DevOps, Site Reliability Engineering and Cybersecurity Experience in designing, creating and supporting Automation (PowerShell, Python, Ruby, AWK, SED, etc.) to run health-checks and self-healing capabilities. Act as a Subject Matter Expert on SE Practices. Agile methodologies and a product mindset approach. Outstanding communication, interpersonal, and leadership skills An understanding of mission critical applications and be able to assess the risk and impact associated with change. Demonstrated experience monitoring systems and components. Experience operating in ITIL Framework for Change, incident, and problem management. 5+ years as a technology leader managing a team of 5+ individuals DBs – MongoDB, Redis, MS SQL Sever, Oracle, Mysql, PostgresSQL Architectures & Patterns – BFF, SOA, MDA, OOA, Event Driven, TDD, BDD, CI/CD, etc Configuration Management - Ansible, Puppet, Chef Other languages ( Node.js, C#, Perl) Windows & Linux skills

We really mean it when we say we put you first. Here are a few ways how:

Hybrid work you get to work from the office and at home 50/50, allowing you to manage both worlds with the ease and flexibility you need. We offer competitive salaries and support your financial health through our employee share purchase plan, pension plans, RRSP, discounts on staff insurance, and more We help you prioritize your well-being from day one through flexible health benefits, early leave days, wellness programs, rewards, and recognition programs. We are invested in helping you grow in your career through education assistance, internal mobility, and mentoring programs. NBFC cares about the community and supports the causes you believe in with donation matching and team volunteer days.

#LI-TS1

Qui nous sommes

Nous sommes la Financière Northbridge. Nous sommes fiers d’être une société canadienne à 100 %, détenue en propriété exclusive par Fairfax Financial. Nous offrons nos services par l’entremise de nos marques Northbridge Assurance, Les assurances Federated et TruShield Assurance. Nous sommes reconnus comme étant l’une des plus importantes sociétés d’assurance de dommages des entreprises au Canada. Nos employés sont engagés à comprendre les besoins de nos clients, et nous faisons tout en notre pouvoir pour aider les entreprises canadiennes à connaître un avenir meilleur et plus sécuritaire. Nous sommes une entreprise formée de personnes passionnées qui placent les gens au cœur de leurs préoccupations. Souhaitez-vous vous joindre à une équipe qui croit en l’importance de travailler fort, et d’avoir du plaisir au travail, tout en améliorant les choses? Ne cherchez pas plus loin que Northbridge.

À la Financière Northbridge, nous avons à cœur de créer un milieu de travail inclusif où nous célébrons les employés et les accueillons comme ils sont. Peu importe qui vous êtes ou ce qui vous rend unique, nous vous accueillons à bras ouverts. Veuillez simplement nous indiquer comment nous pouvons vous aider ou vous accommoder au cours du processus de sélection.



  • Toronto, Canada CB Canada Full time

    Site Reliability Engineer On behalf of our client in the Banking Sector, PROCOM is looking for a Site Reliability Engineer. Site Reliability Engineer – Job Description Azure cloud Jira and confluence CICD Experience with automating (provisioning, configuration management, deployment) and integrating Azure PaaS solutions (Azure App services, Azure...


  • Old Toronto, Ontario, Canada CB Canada Full time

    Site Reliability EngineerOn behalf of our client in the Banking Sector, PROCOM is looking for a Site Reliability Engineer.Site Reliability Engineer – Job DescriptionAzure cloudJira and confluenceCICD Experience with automating (provisioning, configuration management, deployment) and integrating Azure PaaS solutions (Azure App services, Azure Kubernetes...


  • Old Toronto, Canada CB Canada Full time

    Site Reliability Engineer On behalf of our client in the Banking Sector, PROCOM is looking for a Site Reliability Engineer. Site Reliability Engineer – Job Description Azure cloud Jira and confluence CICD Experience with automating (provisioning, configuration management, deployment) and integrating Azure PaaS solutions (Azure App services, Azure...


  • Old Toronto, Canada CB Canada Full time

    Site Reliability Engineer On behalf of our client in the Banking Sector, PROCOM is looking for a Site Reliability Engineer. Site Reliability Engineer – Job Description Azure cloud Jira and confluence CICD Experience with automating (provisioning, configuration management, deployment) and integrating Azure PaaS solutions (Azure App services, Azure...


  • Old Toronto, Canada Reperio Human Capital Full time

    Site Reliability Engineer 100421 Desired skills: Site Reliability Engineer, SRE, Cloud, Permanent, Remote Site Reliability Engineer Location: Ireland/UK Salary: €70K+ Type: Permanent, Full-time We're seeking experienced Site Reliability Engineers who excel at ensuring the reliability and scalability of production systems, and possess extensive experience...


  • Old Toronto, Canada Reperio Human Capital Full time

    Site Reliability Engineer 100421 Desired skills: Site Reliability Engineer, SRE, Cloud, Permanent, Remote Location: Ireland/UK Salary: €70K+ Type: Permanent, Full-time We're seeking experienced Site Reliability Engineers who excel at ensuring the reliability and scalability of production systems, and possess extensive experience with monitoring and...


  • Toronto, Canada Capgemini Full time

    Role: Site Reliability Engineer - Production SupportLocation: Toronto, ONFTEJob Description:7+ Years of ExperienceExcellent Communication.Engineering:Develop SRE solutions (monitoring and alerting, machine learning anomaly detection, self-healing and reliability testing)Simplifies development by building repeatable solutions to manual tasks.Enable SRE...


  • Toronto, Canada Capgemini Full time

    Role: Site Reliability Engineer - Production SupportLocation: Toronto, ONFTEJob Description:7+ Years of ExperienceExcellent Communication.Engineering:Develop SRE solutions (monitoring and alerting, machine learning anomaly detection, self-healing and reliability testing)Simplifies development by building repeatable solutions to manual tasks.Enable SRE...


  • Old Toronto, Canada Northbridge Financial Corporation Full time

    Manager, Site Reliability Engineering page is loaded Manager, Site Reliability Engineering Apply locations Toronto, ON time type Full time posted on Posted 2 Days Ago job requisition id R3417 What it’s like to be a Site Reliability Engineer Manager at Northbridge Financial The Manager, Site Reliability Engineering is a...


  • Old Toronto, Canada Northbridge Financial Corporation Full time

    Manager, Site Reliability Engineering page is loaded Manager, Site Reliability Engineering Apply locations Toronto, ON time type Full time posted on Posted 2 Days Ago job requisition id R3417 What it’s like to be a Site Reliability Engineer Manager at Northbridge Financial The Manager, Site Reliability Engineering is a...


  • Toronto, Canada Capgemini Full time

    Role: Site Reliability Engineer - Production Support Location: Toronto, ON FTE Job Description: 7+ Years of Experience Excellent Communication. Engineering: Develop SRE solutions (monitoring and alerting, machine learning anomaly detection, self-healing and reliability testing) Simplifies development by building repeatable solutions to manual tasks....


  • Toronto, Canada Capgemini Full time

    Role: Site Reliability Engineer - Production SupportLocation: Toronto, ONFTEJob Description:7+ Years of ExperienceExcellent Communication.Engineering:Develop SRE solutions (monitoring and alerting, machine learning anomaly detection, self-healing and reliability testing)Simplifies development by building repeatable solutions to manual tasks.Enable SRE...


  • Toronto, Canada Capgemini Full time

    Role: Site Reliability Engineer - Production SupportLocation: Toronto, ONFTEJob Description:7+ Years of ExperienceExcellent Communication.Engineering:Develop SRE solutions (monitoring and alerting, machine learning anomaly detection, self-healing and reliability testing)Simplifies development by building repeatable solutions to manual tasks.Enable SRE...


  • Toronto, Ontario, Canada Zortech Solutions Full time

    Hi,Hope you are doing GreatThis side Priya Rajput from Zortech Solutions trying to reach you for an exciting job opening, kindly have a look to job description and revert me with your positive feedback. My mail ID is or call me on .Role: Site Reliability EngineerLocation: Toronto, ON-OnsiteDuration: Fulltime PermanentSkills and Responsibilities:...


  • Toronto, Canada Capgemini Full time

    Role: Site Reliability Engineer - Production Support Location: Toronto, ON FTE Job Description: 7+ Years of Experience Excellent Communication. Engineering: Develop SRE solutions (monitoring and alerting, machine learning anomaly detection, self-healing and reliability testing) Simplifies develo


  • Toronto, Ontario, Canada eTeam Full time

    Remote work Duration - 4 months - Preference is to find candidates who are willing to be converted to full time employee . The conversion decision will be made based on performance. Job description - ::: Role Desc : Defining and measuring reliability goals—SLIs, SLOs, and error budgets for user journey Designing for and implementing observability...


  • Old Toronto, Canada Thomson Reuters Full time

    (Canada) Site Reliability Engineer (Contract) Contract (9 months 4 days) Published 3 days ago New Relic Data Dog Site Reliability Engineer - in the Service Management OrganizationDo you have experience in IT Service Management, working with cloud providers, software development, and technology infrastructure?The Site Reliability Engineer will...


  • Old Toronto, Canada Thomson Reuters Full time

    (Canada) Site Reliability Engineer (Contract) Contract (9 months 4 days) Published 3 days ago New Relic Data Dog Site Reliability Engineer - in the Service Management OrganizationDo you have experience in IT Service Management, working with cloud providers, software development, and technology infrastructure?The Site Reliability Engineer will...


  • Toronto, Canada Autodesk Full time

    Position Overview Autodesk, the leading Design and Make Software Company, is looking for a Principal Site Reliability Engineer to join the Autodesk Platform Services Engineering team in Toronto, Canada. On this position, you will help build trusted services of APS (Autodesk Platform Services) as measured by Service Level Objectives (SLOs) and Mean...


  • Old Toronto, Canada Skillfinder Full time

    SITE RELIABILITY ENGINEER - WARSAW, POLAND Contract (hybrid working) - 12 months + Role Overview My client serves a variety of world class financial services clients with their state of the art integrated investment management system. For their office in Warsaw, they are seeking a team of Site Reliability Engineers to assist them with a major client...