Senior Site Reliability Engineering Manager

4 weeks ago


Toronto, Ontario, Canada Thomson Reuters Full time

Overview of the Position

In this role as Senior Site Reliability Engineering Manager, you will: Leadership and Mentorship: Inspire and guide a team of Site Reliability Engineers, offering technical direction, coaching, and support to cultivate a collaborative, innovative, and continuously improving environment. Excellence in Operations: Spearhead the adoption of best practices for reliability, scalability, and performance across our systems and services. Establish and track essential metrics to guarantee uptime, availability, and response times that meet or surpass service level agreements. Lead initiatives to enhance efficiencies and mitigate operational risks. Conduct research on new capabilities, evaluate new solutions, and recommend and implement advanced technologies to enhance customer experience and reduce expenses. Architectural Design: Work alongside cross-functional teams to architect, develop, and sustain scalable and resilient infrastructures for our cloud-based applications. Identify areas for optimization and efficiency enhancements. Tackle complex challenges and devise solutions to elevate the products and services we provide to our clients. DevOps Methodologies: Advocate for and implement DevOps principles to streamline software delivery, automate infrastructure setup, and enhance deployment processes. Collaborate with development teams to weave SRE practices into the software development lifecycle. Automation and Tools: Promote the utilization of automation and tools to optimize operational workflows, boost efficiency, and minimize manual tasks. Lead the creation of monitoring, alerting, and automation solutions to proactively detect and resolve issues. Culture of Continuous Improvement: Foster a culture of ongoing enhancement by encouraging innovation, experimentation, and learning within the team. Support knowledge sharing and professional growth to advance technical skills and expertise.

Ideal Candidate Profile

You are an ideal candidate for the Senior Site Reliability Engineering Manager position if you possess:


• Demonstrated experience in a leadership capacity, overseeing a team of DevOps engineers and/or Site Reliability engineers or similar technical professionals.


• Proficiency in Observability tools such as Data Dog or New Relic.


• 3-5 years of relevant experience in software development and/or technology platforms, infrastructure, or operations.


• Strong analytical and problem-solving abilities, with a proactive approach to identifying and addressing complex technical challenges.


• Experience with cloud technologies, services, and their APIs, including AWS, Azure, and GCP.


• Proficiency in DevOps practices and methodologies, with hands-on experience in CI/CD pipelines, configuration management, and infrastructure as code.


• A Bachelor's degree or equivalent, preferably in Computer Science or a related technical field.

• Familiarity with AI/ML tools to enhance service delivery, reduce costs, and experience with AI-Operations solutions.

• Knowledge of programming languages such as Python, Java, or C#.

• Experience in designing and supporting scalable systems and services.

• Proficiency in Networking, Windows, Linux, Containers, PostgreSQL, or related infrastructure services at scale.

• Ability to automate tasks to enhance service operations and support.

• Proficiency in data analysis from sources such as SQL, S3, Athena, etc.

What We Offer
Join our inclusive culture of exceptional talent, where we are dedicated to your personal and professional development through: Flexible Work Environment: We have embraced a hybrid working model (2-3 days a week in the office depending on the role) for our office-based positions while ensuring a seamless experience that is both digitally and physically connected. Wellbeing Initiatives: Comprehensive benefits plans; flexible and supportive benefits for work-life balance: flexible vacation, company-wide Mental Health Days Off; work from another location for up to 8 weeks annually, with 4 of those weeks being out of the country; Headspace app subscription; retirement savings, tuition reimbursement, and employee incentive programs; resources for mental, physical, and financial wellbeing. Culture of Inclusion: Globally recognized and award-winning reputation for equality, diversity, and inclusion, flexibility, work-life balance, and more. Learning & Development Opportunities: Access to LinkedIn Learning; internal Talent Marketplace with opportunities to engage in cross-company projects; networking through Ten Thousand Coffees Thomson Reuters café. Social Responsibility: Ten employee-driven Business Resource Groups; two paid volunteer days each year; Environmental, Social, and Governance (ESG) initiatives for local and global impact. Purpose-Driven Work: We take pride in being one of the few companies globally that assists its clients in pursuing justice, truth, and transparency. Together with the professionals and institutions we serve, we uphold the rule of law, facilitate commerce, identify wrongdoers, report facts, and provide trusted, unbiased information worldwide. Do you aspire to be part of a team that is reinventing the way knowledge professionals operate? Join us at Thomson Reuters, where we have been committed to this mission for nearly 160 years. Our industry-leading products and services encompass specialized information-enabled software and tools for legal, tax, accounting, and compliance professionals, combined with the world's most comprehensive news services – Reuters. We empower these professionals to perform their roles more effectively, allowing them to focus on what matters most: advising, advocating, negotiating, governing, and informing. We are powered by the talents of 26,000 employees across more than 70 countries, where everyone has the opportunity to contribute and grow professionally in flexible work environments that celebrate diversity and inclusion. In a time when objectivity, accuracy, fairness, and transparency are under threat, we consider it our responsibility to champion these values.

  • Toronto, Ontario, Canada Thomson Reuters Full time

    About the Opportunity In this position as Senior Site Reliability Engineering Manager, you will: Leadership and Mentorship: Inspire and guide a team of Site Reliability Engineers, offering technical expertise, coaching, and support to cultivate a collaborative, innovative, and continuously improving environment. Operational Excellence: Champion the...


  • Toronto, Ontario, Canada mccainfood Full time

    Job SummaryWe are seeking a highly skilled Senior Engineering Manager to lead our Site Reliability Engineering (SRE) and Observability team at McCain Foods. As a key member of our Global Technology department, you will be responsible for designing, implementing, and monitoring enterprise-grade secure fault-tolerant SRE and Observability infrastructure.Key...


  • Toronto, Ontario, Canada mccainfood Full time

    Job SummaryWe are seeking a highly skilled Senior Engineering Manager to lead our Site Reliability Engineering (SRE) and Observability team at McCain Foods. As a key member of our Global Technology department, you will be responsible for designing, implementing, and monitoring enterprise-grade secure fault-tolerant SRE and Observability infrastructure.Key...


  • Toronto, Ontario, Canada Lightspeed Full time

    Welcome to Lightspeed Are you exploring new career avenues? You may find an exciting opportunity here. We are seeking a Senior Site Reliability Engineer to enhance our operations at Lightspeed. Our team is dedicated to developing software solutions that empower merchants to expand their business effectively. In this role, you will be instrumental in...


  • Toronto, Ontario, Canada Lightspeed Full time

    Welcome to Lightspeed! Are you exploring new career paths or simply assessing the job market? You may find the opportunity you're looking for here. We are in search of a Senior Site Reliability Engineer to enhance our NuOrder by Lightspeed team in North America. NuORDER by Lightspeed develops innovative software solutions that empower merchants to...


  • Toronto, Ontario, Canada Lightspeed Full time

    Welcome to Lightspeed Are you exploring new career paths or simply surveying the job market? You may find an exciting opportunity here. We are in search of a Senior Site Reliability Engineer to enhance our NuOrder by Lightspeed division in North America. NuORDER by Lightspeed develops innovative software solutions aimed at empowering merchants to...


  • Toronto, Ontario, Canada Behavox Full time

    About the PositionThe Behavox Platform is a robust, resilient, and high-performance system designed for the storage and processing of extensive data sets. We provide a comprehensive suite of APIs that facilitate the development of solutions enabling clients to effectively manage and analyze large volumes of information. As a Senior Site Reliability Engineer,...


  • Toronto, Ontario, Canada Northbridge Financial Corporation Full time

    Overview of the Senior Site Reliability Engineer Role at Northbridge Financial Corporation The Senior Site Reliability Engineer is responsible for the development and execution of Service Level Objectives (SLOs). This role involves managing complex service reliability solutions and processes, as well as mentoring and guiding junior SREs. Key...


  • Toronto, Ontario, Canada Northbridge Financial Corporation Full time

    Overview of the Senior Site Reliability Engineer Role at Northbridge Financial Corporation The Senior Site Reliability Engineer is responsible for the establishment and execution of Service Level Objectives (SLOs). This role involves managing complex service reliability solutions and processes, while also providing mentorship and guidance to junior...


  • Toronto, Ontario, Canada CIRCLE Full time

    About Circle: Circle is a pioneering financial technology firm positioned at the forefront of the evolving digital economy, where value can traverse globally, almost instantaneously, and at a lower cost compared to traditional settlement systems. This innovative layer of the internet unveils extraordinary opportunities for transactions, commerce, and...


  • Old Toronto, Ontario, Canada Akamai Full time

    Are you driven by the desire to enhance operational processes? Do you thrive in a multicultural team of engineering professionals? Join our elite Site Reliability team at Akamai. We focus on designing, developing, and managing applications and infrastructure that underpin Akamai's Compute offerings. Our expertise lies in creating and sustaining rapid,...


  • Toronto, Ontario, Canada Northbridge Financial Corporation Full time

    Overview of the Senior Site Reliability Engineer Role at Northbridge Financial Corporation The Senior Site Reliability Engineer is responsible for the establishment and execution of Service Level Objectives (SLOs). This role involves managing service reliability solutions and processes of increasing intricacy, along with mentoring and guiding junior...


  • Toronto, Ontario, Canada CIRCLE Full time

    About Circle: Circle operates at the forefront of financial technology, revolutionizing the way value is exchanged globally. Our innovative platform enables transactions to occur swiftly and cost-effectively, paving the way for a new era in commerce and finance. We are dedicated to enhancing economic prosperity and promoting inclusivity through our...


  • Toronto, Ontario, Canada Northbridge Financial Corporation Full time

    Job SummaryThe Senior Site Reliability Engineer at Northbridge Financial Corporation is responsible for overseeing the creation and implementation of Service Level Objectives (SLOs) to ensure the reliability and efficiency of our cloud-based solutions.Key ResponsibilitiesDesign, develop, test, and document advanced site reliability solutions within a complex...


  • Toronto, Ontario, Canada Northbridge Financial Corporation Full time

    Job SummaryThe Senior Site Reliability Engineer at Northbridge Financial Corporation is responsible for overseeing the creation and implementation of Service Level Objectives (SLOs) to ensure the reliability and efficiency of our cloud-based solutions.Key ResponsibilitiesDesign, develop, test, and document advanced site reliability solutions within a complex...


  • Toronto, Ontario, Canada Northbridge Financial Corporation Full time

    About the RoleThe Senior Site Reliability Engineer at Northbridge Financial Corporation is responsible for overseeing the creation and implementation of Service Level Objectives (SLOs) to ensure the reliability and efficiency of our cloud-based solutions.Key ResponsibilitiesDesign, develop, test, and document advanced site reliability solutions within a...


  • Toronto, Ontario, Canada Northbridge Financial Corporation Full time

    About the RoleThe Senior Site Reliability Engineer at Northbridge Financial Corporation is responsible for overseeing the creation and implementation of Service Level Objectives (SLOs) to ensure the reliability and efficiency of our cloud-based solutions.Key ResponsibilitiesDesign, develop, test, and document advanced site reliability solutions within a...


  • Toronto, Ontario, Canada CIRCLE Full time

    Circle operates at the forefront of financial technology, revolutionizing the way value is transferred across the globe. Our innovative infrastructure, including USDC, a blockchain-based dollar, empowers businesses and developers to leverage groundbreaking advancements in payments and commerce, ultimately enhancing global economic prosperity and inclusion. ...


  • Toronto, Ontario, Canada mccainfood Full time

    Job SummaryWe are seeking a highly skilled Senior Engineering Manager to lead our Site Reliability Engineering (SRE) and Observability team at McCain Foods. As a key member of our Global Technology department, you will be responsible for designing, implementing, and monitoring enterprise-grade secure fault-tolerant SRE and Observability infrastructure.Key...


  • Toronto, Ontario, Canada mccainfood Full time

    Job SummaryWe are seeking a highly skilled Senior Engineering Manager to lead our Site Reliability Engineering (SRE) and Observability team at McCain Foods. As a key member of our Global Technology department, you will be responsible for designing, implementing, and monitoring enterprise-grade secure fault-tolerant SRE and Observability infrastructure.Key...