Cloud Site Reliability Engineer

2 months ago


Toronto, Ontario, Canada Magnet Forensics Full time
About Magnet Forensics

Magnet Forensics is a leading provider of digital investigative software that empowers law enforcement agencies, government organizations, and private sector companies to acquire, analyze, and share evidence from computers, smartphones, tablets, and IoT-related devices.

Job Summary

We are seeking a highly skilled Cloud Site Reliability Engineer to join our team. As a Cloud Site Reliability Engineer, you will be responsible for designing and implementing our cloud deployment strategy, creating and maintaining our CI/CD pipeline, and leading the implementation of elements of our React websites.

Key Responsibilities
  • Design and implement our cloud deployment strategy to ensure scalability, reliability, and security.
  • Create and maintain our CI/CD pipeline to ensure seamless deployment and testing of our applications.
  • Lead the implementation of elements of our React websites to deliver cross-cutting business features.
  • Create and maintain monitoring and observability dashboards to help the team understand and debug production issues.
  • Assist the team in creating a scalable, fault-tolerant, and highly available application while keeping costs as low as possible.
  • Contribute to the development of performant, lean, and thorough test suites to ensure minimal issues escape to production.
  • Work with other Magnet teams to ensure our application is highly secure and help fix critical vulnerabilities as they are discovered.
  • Provide thought leadership, support, and coaching within the immediate team and across Engineering.
Requirements
  • Bachelor's degree in a Computer Science-related field or equivalent practical experience.
  • Significant experience operating a production SaaS application running on one or more major cloud providers (AWS, Azure, GCP).
  • Significant experience implementing CI/CD pipelines for SaaS products.
  • Experience working with Web Applications specifically React.
  • Experience working with Azure DevOps and Jenkins or similar tools to implement CI/CD pipelines.
  • Experience troubleshooting and recovering from SaaS disaster scenarios while maintaining calmness under pressure and excellent listening and communication skills.
  • Experience using Datadog, CloudWatch, or similar tools to troubleshoot production issues and create observability dashboards.
  • Experience writing and maintaining IaC (CDK, CloudFormation, Terraform) that provisions elastically scalable infrastructure.
  • Experience with performance and cost optimization of cloud infrastructure.
  • Experience implementing secure cloud solutions.
  • Experience writing and maintaining various types of system test suites (load tests, chaos tests).
  • Experience working with one or more general-purpose programming languages (C#, Python, JavaScript).
  • Proactivity around planning, organizing, and implementing large pieces of work efficiently.
  • Effectiveness at leadership, mentorship, and coaching – you encourage joint ownership of ops.
What We Offer
  • Competitive compensation package.
  • Generous time off policies.
  • Volunteer opportunities.
  • Reward and recognition programs.
  • Employee committees and resource groups.
  • Healthcare and retirement benefits.


  • Toronto, Ontario, Canada State Street Full time

    At State Street, we are seeking a Cloud Platform/Site Reliability Engineer to join our team.Key Responsibilities:Design and implement scalable cloud infrastructure solutions.Ensure high availability and reliability of cloud-based systems.Collaborate with cross-functional teams to drive cloud adoption and innovation.Requirements:Strong background in cloud...


  • Toronto, Ontario, Canada Thomson Reuters Full time

    About the RoleWe are seeking a skilled Site Reliability Engineer - Cloud Expert to join our team at Thomson Reuters. In this role, you will be responsible for designing, implementing, and maintaining scalable cloud-based systems and services.As a Site Reliability Engineer, you will work closely with cross-functional teams to identify and resolve technical...


  • Toronto, Ontario, Canada KPMG Canada Full time

    About the RoleWe are seeking a highly skilled Site Reliability Engineer to join our team at KPMG Canada. As a key member of our Operations team, you will play a critical role in ensuring the smooth operation of our Managed Service.Key ResponsibilitiesDesign and implement scalable and reliable cloud infrastructure solutionsCollaborate with cross-functional...


  • Old Toronto, Ontario, Canada Thomson Reuters Full time

    Site Reliability Engineer (Contract)Contract (5 months 29 days)Closed OpportunityThomson Reuters is seeking a skilled Site Reliability Engineer to join our Service Management Organization.The ideal candidate will have experience in IT Service Management, working with cloud providers, software development, and technology infrastructure.As a Site Reliability...


  • Old Toronto, Ontario, Canada Ascend Fundraising Solutions Full time

    Job Title: Site Reliability Engineer - AutomationWe are seeking a highly skilled Site Reliability Engineer to join our IT team at Ascend Fundraising Solutions. As a key member of our team, you will collaborate closely with our client services team to diagnose, troubleshoot, and resolve issues related to system reliability.Responsibilities:Take ownership of...


  • Old Toronto, Ontario, Canada Thomson Reuters Full time

    Site Reliability EngineerWe are seeking a highly skilled Site Reliability Engineer to join our team at Thomson Reuters. As a Site Reliability Engineer, you will play a critical role in ensuring the reliability and efficiency of our cloud-based infrastructure.About the RoleIn this position, you will be responsible for:Designing and implementing scalable...


  • Old Toronto, Ontario, Canada Thomson Reuters Full time

    Site Reliability EngineerWe are seeking a highly skilled Site Reliability Engineer to join our team at Thomson Reuters. As a Site Reliability Engineer, you will play a critical role in ensuring the reliability and efficiency of our cloud-based infrastructure.About the RoleIn this position, you will be responsible for:Designing and implementing scalable...


  • Toronto, Ontario, Canada Thomson Reuters Full time

    We are seeking an experienced Senior SRE to join our Shared Capabilities, Service Reliability and Operation team in Toronto. As a Cloud Native Site Reliability Engineer, you will be responsible for implementing site reliability engineering and DevOps best practices, building and maintaining monitoring for all aspects of infrastructure, micro-services, usage...

  • Cloud Engineer

    1 month ago


    Toronto, Ontario, Canada Royal Bank of Canada Full time

    Job SummaryWe are seeking a highly skilled Senior Site Reliability Engineer to join our Client360 Advisor Platform team at Royal Bank of Canada. As a key member of our team, you will be responsible for ensuring the availability, scalability, and performance of our cloud-based applications built on the Salesforce platform.Key ResponsibilitiesMonitor and...

  • Cloud Engineer

    1 month ago


    Toronto, Ontario, Canada Royal Bank of Canada Full time

    Job SummaryWe are seeking a highly skilled Senior Site Reliability Engineer to join our Client360 Advisor Platform team at Royal Bank of Canada. As a key member of our team, you will be responsible for ensuring the availability, scalability, and performance of our cloud-based applications built on the Salesforce platform.Key ResponsibilitiesMonitor and...


  • Toronto, Ontario, Canada Lorven Technologies Full time

    Job Title: Site Reliability EngineerLocation: Toronto, CADuration: Long termWe are seeking a highly skilled Site Reliability Engineer to join our team at Lorven Technologies. As a Site Reliability Engineer, you will be responsible for ensuring the reliability and scalability of our cloud infrastructure.Key ResponsibilitiesA Bachelor's degree in Computer...


  • Toronto, Ontario, Canada Lorven Technologies Full time

    Job Title: Site Reliability EngineerLocation: Toronto, CADuration: Long termWe are seeking a highly skilled Site Reliability Engineer to join our team at Lorven Technologies. As a Site Reliability Engineer, you will be responsible for ensuring the reliability and scalability of our cloud infrastructure.Key ResponsibilitiesA Bachelor's degree in Computer...


  • Toronto, Ontario, Canada Northbridge Financial Corporation Full time

    Site Reliability Engineer Role OverviewThe Senior Site Reliability Engineer at Northbridge Financial Corporation is responsible for overseeing the creation and implementation of Service Level Objectives (SLOs). This role involves handling service reliability solutions and processes of increasing complexity, as well as mentoring and leading less experienced...


  • Old Toronto, Ontario, Canada TD Bank Full time

    Job Title: AWS Site Reliability EngineerTD Bank is seeking a highly skilled AWS Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will be responsible for ensuring the reliability, scalability, and performance of our cloud-based systems.Key Responsibilities:Design and implement scalable and reliable cloud-based systems using AWS...


  • Old Toronto, Ontario, Canada TD Bank Full time

    Job Title: AWS Site Reliability EngineerTD Bank is seeking a highly skilled AWS Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will be responsible for ensuring the reliability, scalability, and performance of our cloud-based systems.Key Responsibilities:Design and implement scalable and reliable cloud-based systems using AWS...


  • Toronto, Ontario, Canada Thomson Reuters Full time

    About the RoleWe are seeking an experienced Senior Site Reliability Engineer to join our Shared Capabilities, Service Reliability and Operation team in Toronto. As a key member of our team, you will be responsible for implementing site reliability engineering and DevOps best practices, building and maintaining monitoring for all aspects of infrastructure,...


  • Toronto, Ontario, Canada The Home Depot Canada Full time

    Unlock Your Potential at The Home Depot CanadaAs a Site Reliability Engineering Manager, you will lead a team of Site Reliability Engineers to ensure the reliability, performance, and operational support of our eCommerce systems, with a focus on Google Cloud Platform (GCP) environments.Key Responsibilities:Lead and mentor a team of Site Reliability Engineers...


  • Toronto, Ontario, Canada The Home Depot Canada Full time

    About The Home Depot CanadaThe Home Depot Canada is a leading retailer of home improvement products and services, committed to delivering exceptional customer experiences and driving business growth. We are seeking a highly skilled Cloud Reliability Engineering Manager to join our team and lead our Site Reliability Engineers in ensuring the reliability,...


  • Old Toronto, Ontario, Canada https:www.energyjobline.comsitemap Full time

    Product: Global Platform EngineeringYour Role:As a key member of our Global Platform Engineering team, you will be responsible for overseeing a team of Site Reliability Engineers and ensuring the smooth operation of our cloud-based infrastructure.Lead a team of Site Reliability Engineers to ensure the reliability and scalability of our cloud-based...


  • Toronto, Ontario, Canada Northbridge Financial Corporation Full time

    Site Reliability Engineer Role OverviewThe Senior Site Reliability Engineer at Northbridge Financial Corporation is responsible for overseeing the creation and implementation of Service Level Objectives (SLOs). This role involves handling service reliability solutions and processes of increasing complexity, as well as mentoring and leading less experienced...