Senior Service Reliability Engineer

4 weeks ago


Old Toronto, Canada State Street Corporation Full time
p>Senior Site Reliability Engineer

Who we are looking for
The Senior Site Reliability Engineer role is a hands-on, technology-focused role that is integral to the Charles River platform. The candidate must have excellent communication, analytical and technical skills.

Working with SaaS, product and engineering teams, the SRE will help design and implement monitoring, alerting and automation systems. The SRE's direct role pertaining to Observability involves implementing various tooling to enable proactive monitoring across the organization. The SRE will have a driving hand in addressing the technical needs of Observability to ensure our clients can be monitored, observed, and proactively supported by SaaS and engineering teams across Charles River products. p>

Why this role is important to us
The team you will be joining is part of the Charles River Investment Management Solution (CRIMS), a market leader in providing a comprehensive end-to-end investment management platform covering front, middle and back office. The Charles River IMS platform offers portfolio management, compliance, order and execution management, post-trade processing, data provisioning and management, performance measurement, as well as other key capabilities important to the investment lifecycle. SRE plays a key role in ensuring performance, stability and availability of the platform.

Key Responsibilities

  • Implement SaaS and Engineering Observability requirements in Dynatrace and other tools for proactive monitoring of client issues
  • Collaborate with Product, Engineering, and SaaS Ops teams
  • Identify and implement required configurations to meet proactive monitoring needs across the organization
  • Contribute to deployment and configuration management automation
  • Build, own, and maintain SRE and Operational dashboards, SLOs, Alerts, Synthetic Monitors, and other Observability configurations that ensure client and application health and performance
  • Integrate Observability and other monitoring alerts into incident management systems such as ServiceNow and OpsRamp
  • Identify required infrastructure to support Observability according to the growth of client environments and monitoring sources
  • Administrates SRE tooling and track/manage licenses and renewals
  • Works with functional and platform engineering for new Observability requirements
  • Triage and respond to errors in monitoring systems and integrations
  • Actively monitor Observability issues and identify potential areas of improvement across the product, while working with respective teams
  • Participate in vendor support calls
  • Document Observability best practices and learnings for engineering and SaaS Ops teams to follow

Qualifications

  • 10+ years of site reliability experience supporting high availability SaaS platforms
  • Bachelor's degree in Computer Science or IT
  • Experience with full stack and cloud monitoring solutions
  • Extensive experience implementing Dynatrace and APM within complex architectures with high-load transactions
  • Familiarity with different technology stacks across desktop, cloud, and web
  • Log analysis experience
  • Experience with Ansible, CICD pipelines, Git and other automation technologies
  • Experience with Java or similar programming languages
  • Experience with workload automation
  • Knowledge and experience of scrum/Agile principles
  • Preferred technical background in financial trading systems
  • Excellent 'soft skills' including leadership, mentoring, and technology evangelism

About State Street
What we do. State Street is one of the largest custodian banks, asset managers and asset intelligence companies in the world. From technology to product innovation we're making our mark on the financial services industry. For more than two centuries, we've been helping our clients safeguard and steward the investments of millions of people. We provide investment servicing, data & analytics, investment research & trading and investment management to institutional clients.



  • Old Toronto, Canada Data Engineer Jobs Full time

    As a Senior Data Engineer at Mozilla, you will play a pivotal role in shaping the company's data strategy and driving business growth through informed decision-making.About the RoleWe are seeking an experienced data engineer to join our Analytics Engineering team. In this role, you will work closely with data scientists to design and implement scalable data...


  • Toronto, Canada Flinks Full time

    About Flinks At Flinks, we’re not just building data infrastructure; we’re shaping the future of finance. Our mission is to empower consumers with control over their financial data and unlock its full potential. We equip fintechs and banks with cutting-edge data tools, enabling them to create innovative, client-centric products that are transforming the...


  • Old Toronto, Canada Scotiabank Full time

    Title: Senior Service Reliability ManagerRequisition ID: 205554Join a purpose driven winning team, committed to results, in an inclusive and high-performing culture.The Role:As a member of the Global Payments and Core Banking Systems (PCBE) Systems Reliability team, the Senior Manager, Systems Reliability will lead and collaborate with a team that will work...


  • Old Toronto, Canada Scotiabank Full time

    Requisition ID: 205554Join a purpose driven winning team, committed to results, in an inclusive and high-performing culture. The Role: As a member of the Global Payments and Core Banking Systems (PCBE) Systems Reliability team, the Senior Manager, Systems Reliability will lead and collaborate with a team that will work with engineering teams, infrastructure...


  • Toronto, Canada Northbridge Financial Corporation Full time

    What is it like to be a Senior Site Reliability Engineer at Northbridge Financial The Senior Site Reliability Engineer oversees the creation and implementation of Service Level Objectives (SLOs). The Senior SRE handles service reliability solutions and processes of increasing complexity, and are responsible for mentoring and leading less experienced...


  • Old Toronto, Canada Scotiabank Full time

    About the RoleThe Senior Service Reliability Manager will lead and collaborate with a team to continuously improve the stability and reliability of PCBE systems through Site Reliability Engineering (SRE) based practices.This includes continuous people, process, and technology enhancements in support of our rapidly changing technology product portfolio.You...


  • Toronto, Canada Thomson Reuters Full time

    Are you passionate about the chance to bring your technical experience to a digital organization? The Onesource team is looking to add a Senior Site Reliability Engineer to a well-established global digital team. This position requires someone who is a passionate learner, an independent thinker, wor


  • Toronto, Ontario, Canada Interac Corp. Full time

    Senior System Reliability EngineerWe are seeking a skilled Senior System Reliability Engineer to join our team at Interac Corp. in Canada.About the Role:This is an exciting opportunity to work on high-performance payment systems, focusing on Site (Application) Reliability Engineering activities, including proactive monitoring, responding to alerts and...


  • Old Toronto, Canada Soda Full time

    Job Description Job Title: Site Reliability Engineer Location: Poland - Fully Remote Salary: 324K PLN or 27.3K monthly Start: ASAP Stack: AWS, Docker, Kubernetes, Terraform, Jenkins, Ansible, Linux, JavaScript, and Lambda. Are you a seasoned DevOps/SRE professional passionate about building high-performance, scalable systems? I am working with a Media/IT...


  • Greater Toronto Area, Canada GlossGenius Full time

    About GlossGenius GlossGenius is building an ecosystem enabling entrepreneurs to succeed. We empower small business owners to focus on being creators, not admins, by offering a range of business management tools including booking and scheduling, marketing, analytics, payment processing and much more.  Over 75,000 small business owners have chosen to...


  • Old Toronto, Canada Chelsea Avondale Full time

    Chelsea Avondale is the world’s most cutting-edge home insurance group. We have developed sophisticated risk modeling and insurance pricing technologies for home insurance and deploy that technology through our own insurance company. Our team consists of some of the brightest minds in insurance, software development, finance, and operations. Our group...


  • Toronto, Ontario, Canada Vantage Full time

    About the Role:We are seeking a highly skilled Senior Site Reliability Engineer to join our team at Vantage. As a key member of our engineering team, you will play a pivotal role in ensuring the seamless operation of our large-scale, distributed systems. Your expertise in software and systems engineering will be instrumental in building, maintaining, and...


  • Old Toronto, Canada Lorien Full time

    Hybrid - Manchester We are currently working with a leading gambling company dedicated to providing exceptional gaming experiences. They are looking for an experienced Site Reliability Engineer with a strong skill set in system reliability to join its world-class technology team. This role is ideal for someone who has 4+ years of experience within the...


  • Toronto, Ontario, Canada Randstad Canada Full time

    Job Title: Senior Electrical Engineer, Maintenance and ReliabilityWe are seeking an experienced Senior Electrical Engineer, Maintenance and Reliability to join our team at Randstad Canada. This role will be responsible for providing technical support and guidance to the engineering department, ensuring the technical soundness, reliability, safety, and...


  • Toronto, Canada Vantage Full time

    Senior Site Reliability Engineer / DevOps Engineer Are you passionate about ensuring the seamless operation of large-scale, distributed, and robust systems? Do you thrive on optimizing performance, increasing reliability, and automating tasks to create more efficient processes? Are you hungry for learning? If so, we would want to chat to you! As a...


  • Toronto, Canada Thomson Reuters Full time

    Description Thomson Reuters is seeking a Senior Site Reliability Engineer to join our Service Management, Technology team. This role calls for an individual who is capable of analyzing customer problems of high complexity and assessing the scope of impact, while mitigating customer impact of issues and executing work arounds. Willingness to learn is...


  • Old Toronto, Canada Ascend Fundraising Solutions Full time

    We are seeking a skilled Cloud Reliability Engineer to collaborate with our IT team in Toronto. In this role, you will work closely with the client services team to diagnose, troubleshoot, and resolve system reliability issues.Responsibilities:Take ownership of customer-reported issues and drive them to resolution.Develop proactive measures to prevent...


  • Old Toronto, Canada TD Bank Full time

    Site Reliability Engineer Site Reliability Engineer Work Location: Canada Hours: 37.5 Line of Business: Technology Solutions Pay Details: We’re committed to providing fair and equitable compensation to all our colleagues. As a candidate, we encourage you to have an open dialogue with a member of


  • Old Toronto, Canada Sentry Full time

    About the role The Site Reliability Engineering team is responsible for the deployment, configuration, maintenance, and monitoring of Sentry's hosted platform. We do this by leveraging automation tools to automatically spin up and scale services to meet the traffic demands of 1,000,000+ developers.


  • Old Toronto, Canada Data Engineer Jobs Full time

    To learn the Hiring Ranges for this position, please select your location from the Apply Now dropdown menu.The Mozilla Corporation is wholly owned by the non-profit 501(c) Mozilla Foundation. This means we aren't beholden to any shareholders --- only to our mission. Along with thousands of volunteer contributors and collaborators all over the world,...