Cloud Site Reliability Engineer

6 days ago


Toronto, Ontario, Canada Magnet Forensics Full time
About Magnet Forensics

Magnet Forensics is a leading provider of digital investigative software that empowers law enforcement agencies, government organizations, and private sector companies to acquire, analyze, and share evidence from computers, smartphones, tablets, and IoT-related devices.

Job Summary

We are seeking a highly skilled Cloud Site Reliability Engineer to join our team. As a Cloud Site Reliability Engineer, you will be responsible for designing and implementing our cloud deployment strategy, creating and maintaining our CI/CD pipeline, and leading the implementation of elements of our React websites.

Key Responsibilities
  • Design and implement our cloud deployment strategy to ensure scalability, reliability, and security.
  • Create and maintain our CI/CD pipeline to ensure seamless deployment and testing of our applications.
  • Lead the implementation of elements of our React websites to deliver cross-cutting business features.
  • Create and maintain monitoring and observability dashboards to help the team understand and debug production issues.
  • Assist the team in creating a scalable, fault-tolerant, and highly available application while keeping costs as low as possible.
  • Contribute to the development of performant, lean, and thorough test suites to ensure minimal issues escape to production.
  • Work with other Magnet teams to ensure our application is highly secure and help fix critical vulnerabilities as they are discovered.
  • Provide thought leadership, support, and coaching within the immediate team and across Engineering.
Requirements
  • Bachelor's degree in a Computer Science-related field or equivalent practical experience.
  • Significant experience operating a production SaaS application running on one or more major cloud providers (AWS, Azure, GCP).
  • Significant experience implementing CI/CD pipelines for SaaS products.
  • Experience working with Web Applications specifically React.
  • Experience working with Azure DevOps and Jenkins or similar tools to implement CI/CD pipelines.
  • Experience troubleshooting and recovering from SaaS disaster scenarios while maintaining calmness under pressure and excellent listening and communication skills.
  • Experience using Datadog, CloudWatch, or similar tools to troubleshoot production issues and create observability dashboards.
  • Experience writing and maintaining IaC (CDK, CloudFormation, Terraform) that provisions elastically scalable infrastructure.
  • Experience with performance and cost optimization of cloud infrastructure.
  • Experience implementing secure cloud solutions.
  • Experience writing and maintaining various types of system test suites (load tests, chaos tests).
  • Experience working with one or more general-purpose programming languages (C#, Python, JavaScript).
  • Proactivity around planning, organizing, and implementing large pieces of work efficiently.
  • Effectiveness at leadership, mentorship, and coaching – you encourage joint ownership of ops.
What We Offer
  • Competitive compensation package.
  • Generous time off policies.
  • Volunteer opportunities.
  • Reward and recognition programs.
  • Employee committees and resource groups.
  • Healthcare and retirement benefits.


  • Toronto, Ontario, Canada Lorven Technologies Full time

    Job Title: Site Reliability EngineerLocation: RemoteDuration: Long-termAbout the RoleWe are seeking a highly skilled Site Reliability Engineer to join our team at Lorven Technologies. As a Site Reliability Engineer, you will be responsible for ensuring the reliability and scalability of our cloud-based infrastructure.Key ResponsibilitiesDesign and implement...


  • Toronto, Ontario, Canada Lorven Technologies Full time

    Job Title: Site Reliability EngineerLocation: RemoteDuration: Long-termAbout the RoleWe are seeking a highly skilled Site Reliability Engineer to join our team at Lorven Technologies. As a Site Reliability Engineer, you will be responsible for ensuring the reliability and scalability of our cloud-based infrastructure.Key ResponsibilitiesDesign and implement...


  • Toronto, Ontario, Canada KPMG Canada Full time

    About the RoleWe are seeking a highly skilled Site Reliability Engineer to join our team at KPMG Canada. As a key member of our Managed Services team, you will be responsible for ensuring the smooth operation of our cloud-based services.Key ResponsibilitiesDesign, implement, and maintain scalable and reliable cloud-based systemsCollaborate with...


  • Toronto, Ontario, Canada KPMG Canada Full time

    About the RoleWe are seeking a highly skilled Site Reliability Engineer to join our team at KPMG Canada. As a key member of our Managed Services team, you will be responsible for ensuring the smooth operation of our cloud-based services.Key ResponsibilitiesDesign, implement, and maintain scalable and reliable cloud-based systemsCollaborate with...


  • Toronto, Ontario, Canada Lorven Technologies Full time

    Job Title: Site Reliability Engineer (SRE)Location: Remote Duration: Long-termAbout the RoleWe are seeking a highly skilled Site Reliability Engineer to join our team at Lorven Technologies. The ideal candidate will have a strong background in cloud computing and a passion for ensuring the reliability and scalability of our systems.The successful candidate...


  • Toronto, Ontario, Canada Lorven Technologies Full time

    Job Title: Site Reliability Engineer (SRE)Location: Remote Duration: Long-termAbout the RoleWe are seeking a highly skilled Site Reliability Engineer to join our team at Lorven Technologies. The ideal candidate will have a strong background in cloud computing and a passion for ensuring the reliability and scalability of our systems.The successful candidate...


  • Toronto, Ontario, Canada Northbridge Financial Corporation Full time

    Job SummaryThe Senior Site Reliability Engineer at Northbridge Financial Corporation is responsible for overseeing the creation and implementation of Service Level Objectives (SLOs) to ensure the reliability and efficiency of our cloud-based solutions.Key ResponsibilitiesDesign, develop, test, and document advanced site reliability solutions within a complex...


  • Toronto, Ontario, Canada Northbridge Financial Corporation Full time

    Job SummaryThe Senior Site Reliability Engineer at Northbridge Financial Corporation is responsible for overseeing the creation and implementation of Service Level Objectives (SLOs) to ensure the reliability and efficiency of our cloud-based solutions.Key ResponsibilitiesDesign, develop, test, and document advanced site reliability solutions within a complex...


  • Toronto, Ontario, Canada Northbridge Financial Corporation Full time

    About the RoleThe Senior Site Reliability Engineer at Northbridge Financial Corporation is responsible for overseeing the creation and implementation of Service Level Objectives (SLOs) to ensure the reliability and efficiency of our cloud-based solutions.Key ResponsibilitiesDesign, develop, test, and document advanced site reliability solutions within a...


  • Toronto, Ontario, Canada Northbridge Financial Corporation Full time

    About the RoleThe Senior Site Reliability Engineer at Northbridge Financial Corporation is responsible for overseeing the creation and implementation of Service Level Objectives (SLOs) to ensure the reliability and efficiency of our cloud-based solutions.Key ResponsibilitiesDesign, develop, test, and document advanced site reliability solutions within a...


  • Toronto, Ontario, Canada Forhyre Full time

    We are looking for someone that is generalist at heart, one who is curious, appreciates complexity, knows or wants to learn when to step back and when to dive deep. We call this role a Cloud Service Reliability Engineer. The Cloud Service Reliability Engineer will be responsible for effective design, execution, and maintenance of systems implemented on...


  • Toronto, Ontario, Canada CIRCLE Full time

    About CircleCircle is a pioneering financial technology company at the forefront of the emerging internet of money, where value can flow freely like other digital data - globally, nearly instantly and less expensively than traditional financial systems. This groundbreaking new internet layer opens up previously unimaginable possibilities for payments,...


  • Toronto, Ontario, Canada CIRCLE Full time

    About CircleCircle is a pioneering financial technology company at the forefront of the emerging internet of money, where value can flow freely like other digital data - globally, nearly instantly and less expensively than traditional financial systems. This groundbreaking new internet layer opens up previously unimaginable possibilities for payments,...


  • Toronto, Ontario, Canada Thomson Reuters Full time

    About the RoleWe are seeking an experienced Senior Cloud Reliability Engineer to join our Shared Capabilities, Service Reliability and Operation team in Toronto. As a key member of our team, you will be responsible for implementing site reliability engineering and DevOps best practices, ensuring the scalability, reliability, and security of our cloud-based...


  • Toronto, Ontario, Canada Thomson Reuters Full time

    About the RoleWe are seeking an experienced Senior Cloud Reliability Engineer to join our Shared Capabilities, Service Reliability and Operation team in Toronto. As a key member of our team, you will be responsible for implementing site reliability engineering and DevOps best practices, ensuring the scalability, reliability, and security of our cloud-based...


  • Toronto, Ontario, Canada Thomson Reuters Full time

    About the RoleWe are seeking an experienced Senior Cloud Reliability Engineer to join our Shared Capabilities, Service Reliability and Operation team in Toronto. As a key member of our team, you will be responsible for implementing site reliability engineering and DevOps best practices, ensuring the scalability, reliability, and security of our cloud-based...


  • Toronto, Ontario, Canada Thomson Reuters Full time

    About the RoleWe are seeking an experienced Senior Cloud Reliability Engineer to join our Shared Capabilities, Service Reliability and Operation team in Toronto. As a key member of our team, you will be responsible for implementing site reliability engineering and DevOps best practices, ensuring the scalability, reliability, and security of our cloud-based...


  • Toronto, Ontario, Canada Criteo Full time

    About the Role:Criteo is seeking a highly skilled Senior Site Reliability Engineer to join our team. As a key member of our Product Reliability Engineering (PRE) group, you will play a critical role in ensuring the reliability and scalability of our applications and systems.Key Responsibilities:Collaborate with product engineering teams to design, develop,...


  • Toronto, Ontario, Canada Criteo Full time

    About the Role:Criteo is seeking a highly skilled Senior Site Reliability Engineer to join our team. As a key member of our Product Reliability Engineering (PRE) group, you will play a critical role in ensuring the reliability and scalability of our applications and systems.Key Responsibilities:Collaborate with product engineering teams to design, develop,...

  • Cloud Engineer

    6 days ago


    Toronto, Ontario, Canada Thomson Reuters Full time

    About the RoleThomson Reuters is seeking a Senior Site Reliability Engineer to join our Service Management, Technology team. This role calls for an individual who is capable of analyzing complex customer problems and assessing the scope of impact, while mitigating customer impact of issues and executing workarounds.Key ResponsibilitiesIdentify options for...