Cloud Site Reliability Engineer

2 months ago


Toronto, Ontario, Canada Magnet Forensics Full time
About Magnet Forensics

Magnet Forensics is a leading provider of digital investigative software that empowers law enforcement agencies, government organizations, and private sector companies to acquire, analyze, and share evidence from computers, smartphones, tablets, and IoT-related devices.

Job Summary

We are seeking a highly skilled Cloud Site Reliability Engineer to join our team. As a Cloud Site Reliability Engineer, you will be responsible for designing and implementing our cloud deployment strategy, creating and maintaining our CI/CD pipeline, and leading the implementation of elements of our React websites.

Key Responsibilities
  • Design and implement our cloud deployment strategy to ensure scalability, reliability, and security.
  • Create and maintain our CI/CD pipeline to ensure seamless deployment and testing of our applications.
  • Lead the implementation of elements of our React websites to deliver cross-cutting business features.
  • Create and maintain monitoring and observability dashboards to help the team understand and debug production issues.
  • Assist the team in creating a scalable, fault-tolerant, and highly available application while keeping costs as low as possible.
  • Contribute to the development of performant, lean, and thorough test suites to ensure minimal issues escape to production.
  • Work with other Magnet teams to ensure our application is highly secure and help fix critical vulnerabilities as they are discovered.
  • Provide thought leadership, support, and coaching within the immediate team and across Engineering.
Requirements
  • Bachelor's degree in a Computer Science-related field or equivalent practical experience.
  • Significant experience operating a production SaaS application running on one or more major cloud providers (AWS, Azure, GCP).
  • Significant experience implementing CI/CD pipelines for SaaS products.
  • Experience working with Web Applications specifically React.
  • Experience working with Azure DevOps and Jenkins or similar tools to implement CI/CD pipelines.
  • Experience troubleshooting and recovering from SaaS disaster scenarios while maintaining calmness under pressure and excellent listening and communication skills.
  • Experience using Datadog, CloudWatch, or similar tools to troubleshoot production issues and create observability dashboards.
  • Experience writing and maintaining IaC (CDK, CloudFormation, Terraform) that provisions elastically scalable infrastructure.
  • Experience with performance and cost optimization of cloud infrastructure.
  • Experience implementing secure cloud solutions.
  • Experience writing and maintaining various types of system test suites (load tests, chaos tests).
  • Experience working with one or more general-purpose programming languages (C#, Python, JavaScript).
  • Proactivity around planning, organizing, and implementing large pieces of work efficiently.
  • Effectiveness at leadership, mentorship, and coaching – you encourage joint ownership of ops.
What We Offer
  • Competitive compensation package.
  • Generous time off policies.
  • Volunteer opportunities.
  • Reward and recognition programs.
  • Employee committees and resource groups.
  • Healthcare and retirement benefits.


  • Toronto, Ontario, Canada State Street Full time

    At State Street, we are seeking a Cloud Platform/Site Reliability Engineer to join our team.Key Responsibilities:Design and implement scalable cloud infrastructure solutions.Ensure high availability and reliability of cloud-based systems.Collaborate with cross-functional teams to drive cloud adoption and innovation.Requirements:Strong background in cloud...


  • Toronto, Ontario, Canada KPMG Canada Full time

    About the RoleWe are seeking a highly skilled Site Reliability Engineer to join our team at KPMG Canada. As a key member of our Managed Services team, you will be responsible for ensuring the smooth operation of our cloud-based services.Key ResponsibilitiesDesign, implement, and maintain scalable and reliable cloud-based systemsCollaborate with...


  • Toronto, Ontario, Canada Thomson Reuters Full time

    About the RoleWe are seeking a skilled Site Reliability Engineer - Cloud Expert to join our team at Thomson Reuters. In this role, you will be responsible for designing, implementing, and maintaining scalable cloud-based systems and services.As a Site Reliability Engineer, you will work closely with cross-functional teams to identify and resolve technical...


  • Toronto, Ontario, Canada KPMG Canada Full time

    About the RoleWe are seeking a highly skilled Site Reliability Engineer to join our team at KPMG Canada. As a key member of our Operations team, you will play a critical role in ensuring the smooth operation of our Managed Service.Key ResponsibilitiesDesign and implement scalable and reliable cloud infrastructure solutionsCollaborate with cross-functional...


  • Toronto, Ontario, Canada Northbridge Financial Corporation Full time

    About the RoleWe are seeking a highly skilled Senior Site Reliability Engineer to join our team at Northbridge Financial Corporation. As a key member of our technical team, you will be responsible for designing, developing, and implementing advanced site reliability solutions to ensure the stability and performance of our systems.Key...


  • Toronto, Ontario, Canada KPMG-Canada Full time

    OverviewKPMG Canada is seeking a highly skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will play a critical role in ensuring the smooth operation of our cloud-based services.Key ResponsibilitiesDesign, implement, and maintain scalable and reliable cloud infrastructureCollaborate with development teams to automate...


  • Toronto, Ontario, Canada KPMG-Canada Full time

    OverviewKPMG Canada is seeking a highly skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will play a critical role in ensuring the smooth operation of our cloud-based services.Key ResponsibilitiesDesign, implement, and maintain scalable and reliable cloud infrastructureCollaborate with development teams to automate...


  • Toronto, Ontario, Canada KPMG-Canada Full time

    OverviewKPMG Canada is seeking a highly skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will play a critical role in ensuring the smooth operation of our cloud-based services.Key ResponsibilitiesDesign, implement, and maintain scalable and reliable cloud infrastructureCollaborate with development teams to automate...


  • Toronto, Ontario, Canada KPMG-Canada Full time

    OverviewKPMG Canada is seeking a highly skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will play a critical role in ensuring the smooth operation of our cloud-based services.Key ResponsibilitiesDesign, implement, and maintain scalable and reliable cloud infrastructureCollaborate with development teams to automate...


  • Old Toronto, Ontario, Canada Thomson Reuters Full time

    Site Reliability Engineer (Contract)Contract (5 months 29 days)Closed OpportunityThomson Reuters is seeking a skilled Site Reliability Engineer to join our Service Management Organization.The ideal candidate will have experience in IT Service Management, working with cloud providers, software development, and technology infrastructure.As a Site Reliability...


  • Toronto, Ontario, Canada Lorven Technologies Full time

    Job Title: Site Reliability Engineer (SRE)Location: Toronto, CADuration: Long termWe are seeking a highly skilled Site Reliability Engineer to join our team at Lorven Technologies. As a key member of our technical operations team, you will be responsible for ensuring the reliability, scalability, and performance of our cloud-based infrastructure.Key...


  • Toronto, Ontario, Canada Lorven Technologies Full time

    Job Title: Site Reliability Engineer (SRE)Location: Toronto, CADuration: Long termWe are seeking a highly skilled Site Reliability Engineer to join our team at Lorven Technologies. As a key member of our technical operations team, you will be responsible for ensuring the reliability, scalability, and performance of our cloud-based infrastructure.Key...


  • Old Toronto, Ontario, Canada Ascend Fundraising Solutions Full time

    Job Title: Site Reliability Engineer - AutomationWe are seeking a highly skilled Site Reliability Engineer to join our IT team at Ascend Fundraising Solutions. As a key member of our team, you will collaborate closely with our client services team to diagnose, troubleshoot, and resolve issues related to system reliability.Responsibilities:Take ownership of...


  • Old Toronto, Ontario, Canada Thomson Reuters Full time

    Site Reliability EngineerWe are seeking a highly skilled Site Reliability Engineer to join our team at Thomson Reuters. As a Site Reliability Engineer, you will play a critical role in ensuring the reliability and efficiency of our cloud-based infrastructure.About the RoleIn this position, you will be responsible for:Designing and implementing scalable...


  • Old Toronto, Ontario, Canada Thomson Reuters Full time

    Site Reliability EngineerWe are seeking a highly skilled Site Reliability Engineer to join our team at Thomson Reuters. As a Site Reliability Engineer, you will play a critical role in ensuring the reliability and efficiency of our cloud-based infrastructure.About the RoleIn this position, you will be responsible for:Designing and implementing scalable...


  • Old Toronto, Ontario, Canada Thomson Reuters Full time

    Site Reliability EngineerWe are seeking a highly skilled Site Reliability Engineer to join our team at Thomson Reuters. As a Site Reliability Engineer, you will be responsible for ensuring the reliability and scalability of our cloud-based infrastructure.About the RoleIn this role, you will be responsible for:Designing and implementing scalable systems and...


  • Old Toronto, Ontario, Canada Thomson Reuters Full time

    Site Reliability EngineerWe are seeking a highly skilled Site Reliability Engineer to join our team at Thomson Reuters. As a Site Reliability Engineer, you will be responsible for ensuring the reliability and scalability of our cloud-based infrastructure.About the RoleIn this role, you will be responsible for:Designing and implementing scalable systems and...


  • Toronto, Ontario, Canada Lorven Technologies Full time

    Job Title: Site Reliability EngineerLocation: Toronto, CADuration: Long termWe are seeking a highly skilled Site Reliability Engineer to join our team at Lorven Technologies. As a key member of our infrastructure team, you will be responsible for ensuring the reliability and scalability of our cloud-based systems.Key ResponsibilitiesA Bachelor's degree in...

  • Cloud Engineer

    4 weeks ago


    Toronto, Ontario, Canada Royal Bank of Canada Full time

    Job SummaryWe are seeking a highly skilled Senior Site Reliability Engineer to join our Client360 Advisor Platform team at Royal Bank of Canada. As a key member of our team, you will be responsible for ensuring the availability, scalability, and performance of our cloud-based applications built on the Salesforce platform.Key ResponsibilitiesMonitor and...

  • Cloud Engineer

    4 weeks ago


    Toronto, Ontario, Canada Royal Bank of Canada Full time

    Job SummaryWe are seeking a highly skilled Senior Site Reliability Engineer to join our Client360 Advisor Platform team at Royal Bank of Canada. As a key member of our team, you will be responsible for ensuring the availability, scalability, and performance of our cloud-based applications built on the Salesforce platform.Key ResponsibilitiesMonitor and...