Cloud Site Reliability Engineer
6 days ago
Magnet Forensics is a leading provider of digital investigative software that empowers law enforcement agencies, government organizations, and private sector companies to acquire, analyze, and share evidence from computers, smartphones, tablets, and IoT-related devices.
Job SummaryWe are seeking a highly skilled Cloud Site Reliability Engineer to join our team. As a Cloud Site Reliability Engineer, you will be responsible for designing and implementing our cloud deployment strategy, creating and maintaining our CI/CD pipeline, and leading the implementation of elements of our React websites.
Key Responsibilities- Design and implement our cloud deployment strategy to ensure scalability, reliability, and security.
- Create and maintain our CI/CD pipeline to ensure seamless deployment and testing of our applications.
- Lead the implementation of elements of our React websites to deliver cross-cutting business features.
- Create and maintain monitoring and observability dashboards to help the team understand and debug production issues.
- Assist the team in creating a scalable, fault-tolerant, and highly available application while keeping costs as low as possible.
- Contribute to the development of performant, lean, and thorough test suites to ensure minimal issues escape to production.
- Work with other Magnet teams to ensure our application is highly secure and help fix critical vulnerabilities as they are discovered.
- Provide thought leadership, support, and coaching within the immediate team and across Engineering.
- Bachelor's degree in a Computer Science-related field or equivalent practical experience.
- Significant experience operating a production SaaS application running on one or more major cloud providers (AWS, Azure, GCP).
- Significant experience implementing CI/CD pipelines for SaaS products.
- Experience working with Web Applications specifically React.
- Experience working with Azure DevOps and Jenkins or similar tools to implement CI/CD pipelines.
- Experience troubleshooting and recovering from SaaS disaster scenarios while maintaining calmness under pressure and excellent listening and communication skills.
- Experience using Datadog, CloudWatch, or similar tools to troubleshoot production issues and create observability dashboards.
- Experience writing and maintaining IaC (CDK, CloudFormation, Terraform) that provisions elastically scalable infrastructure.
- Experience with performance and cost optimization of cloud infrastructure.
- Experience implementing secure cloud solutions.
- Experience writing and maintaining various types of system test suites (load tests, chaos tests).
- Experience working with one or more general-purpose programming languages (C#, Python, JavaScript).
- Proactivity around planning, organizing, and implementing large pieces of work efficiently.
- Effectiveness at leadership, mentorship, and coaching – you encourage joint ownership of ops.
- Competitive compensation package.
- Generous time off policies.
- Volunteer opportunities.
- Reward and recognition programs.
- Employee committees and resource groups.
- Healthcare and retirement benefits.
-
Site Reliability Engineer
2 days ago
Toronto, Ontario, Canada Lorven Technologies Full timeJob Title: Site Reliability EngineerLocation: RemoteDuration: Long-termAbout the RoleWe are seeking a highly skilled Site Reliability Engineer to join our team at Lorven Technologies. As a Site Reliability Engineer, you will be responsible for ensuring the reliability and scalability of our cloud-based infrastructure.Key ResponsibilitiesDesign and implement...
-
Site Reliability Engineer
2 days ago
Toronto, Ontario, Canada Lorven Technologies Full timeJob Title: Site Reliability EngineerLocation: RemoteDuration: Long-termAbout the RoleWe are seeking a highly skilled Site Reliability Engineer to join our team at Lorven Technologies. As a Site Reliability Engineer, you will be responsible for ensuring the reliability and scalability of our cloud-based infrastructure.Key ResponsibilitiesDesign and implement...
-
Site Reliability Engineer
3 days ago
Toronto, Ontario, Canada KPMG Canada Full timeAbout the RoleWe are seeking a highly skilled Site Reliability Engineer to join our team at KPMG Canada. As a key member of our Managed Services team, you will be responsible for ensuring the smooth operation of our cloud-based services.Key ResponsibilitiesDesign, implement, and maintain scalable and reliable cloud-based systemsCollaborate with...
-
Site Reliability Engineer
3 days ago
Toronto, Ontario, Canada KPMG Canada Full timeAbout the RoleWe are seeking a highly skilled Site Reliability Engineer to join our team at KPMG Canada. As a key member of our Managed Services team, you will be responsible for ensuring the smooth operation of our cloud-based services.Key ResponsibilitiesDesign, implement, and maintain scalable and reliable cloud-based systemsCollaborate with...
-
Site Reliability Engineer
7 days ago
Toronto, Ontario, Canada Lorven Technologies Full timeJob Title: Site Reliability Engineer (SRE)Location: Remote Duration: Long-termAbout the RoleWe are seeking a highly skilled Site Reliability Engineer to join our team at Lorven Technologies. The ideal candidate will have a strong background in cloud computing and a passion for ensuring the reliability and scalability of our systems.The successful candidate...
-
Site Reliability Engineer
6 days ago
Toronto, Ontario, Canada Lorven Technologies Full timeJob Title: Site Reliability Engineer (SRE)Location: Remote Duration: Long-termAbout the RoleWe are seeking a highly skilled Site Reliability Engineer to join our team at Lorven Technologies. The ideal candidate will have a strong background in cloud computing and a passion for ensuring the reliability and scalability of our systems.The successful candidate...
-
Senior Site Reliability Engineer
6 days ago
Toronto, Ontario, Canada Northbridge Financial Corporation Full timeJob SummaryThe Senior Site Reliability Engineer at Northbridge Financial Corporation is responsible for overseeing the creation and implementation of Service Level Objectives (SLOs) to ensure the reliability and efficiency of our cloud-based solutions.Key ResponsibilitiesDesign, develop, test, and document advanced site reliability solutions within a complex...
-
Senior Site Reliability Engineer
7 days ago
Toronto, Ontario, Canada Northbridge Financial Corporation Full timeJob SummaryThe Senior Site Reliability Engineer at Northbridge Financial Corporation is responsible for overseeing the creation and implementation of Service Level Objectives (SLOs) to ensure the reliability and efficiency of our cloud-based solutions.Key ResponsibilitiesDesign, develop, test, and document advanced site reliability solutions within a complex...
-
Senior Site Reliability Engineer
2 days ago
Toronto, Ontario, Canada Northbridge Financial Corporation Full timeAbout the RoleThe Senior Site Reliability Engineer at Northbridge Financial Corporation is responsible for overseeing the creation and implementation of Service Level Objectives (SLOs) to ensure the reliability and efficiency of our cloud-based solutions.Key ResponsibilitiesDesign, develop, test, and document advanced site reliability solutions within a...
-
Senior Site Reliability Engineer
2 days ago
Toronto, Ontario, Canada Northbridge Financial Corporation Full timeAbout the RoleThe Senior Site Reliability Engineer at Northbridge Financial Corporation is responsible for overseeing the creation and implementation of Service Level Objectives (SLOs) to ensure the reliability and efficiency of our cloud-based solutions.Key ResponsibilitiesDesign, develop, test, and document advanced site reliability solutions within a...
-
Cloud Service Reliability Engineer
3 months ago
Toronto, Ontario, Canada Forhyre Full timeWe are looking for someone that is generalist at heart, one who is curious, appreciates complexity, knows or wants to learn when to step back and when to dive deep. We call this role a Cloud Service Reliability Engineer. The Cloud Service Reliability Engineer will be responsible for effective design, execution, and maintenance of systems implemented on...
-
Toronto, Ontario, Canada CIRCLE Full timeAbout CircleCircle is a pioneering financial technology company at the forefront of the emerging internet of money, where value can flow freely like other digital data - globally, nearly instantly and less expensively than traditional financial systems. This groundbreaking new internet layer opens up previously unimaginable possibilities for payments,...
-
Toronto, Ontario, Canada CIRCLE Full timeAbout CircleCircle is a pioneering financial technology company at the forefront of the emerging internet of money, where value can flow freely like other digital data - globally, nearly instantly and less expensively than traditional financial systems. This groundbreaking new internet layer opens up previously unimaginable possibilities for payments,...
-
Senior Cloud Reliability Engineer
2 days ago
Toronto, Ontario, Canada Thomson Reuters Full timeAbout the RoleWe are seeking an experienced Senior Cloud Reliability Engineer to join our Shared Capabilities, Service Reliability and Operation team in Toronto. As a key member of our team, you will be responsible for implementing site reliability engineering and DevOps best practices, ensuring the scalability, reliability, and security of our cloud-based...
-
Senior Cloud Reliability Engineer
3 days ago
Toronto, Ontario, Canada Thomson Reuters Full timeAbout the RoleWe are seeking an experienced Senior Cloud Reliability Engineer to join our Shared Capabilities, Service Reliability and Operation team in Toronto. As a key member of our team, you will be responsible for implementing site reliability engineering and DevOps best practices, ensuring the scalability, reliability, and security of our cloud-based...
-
Senior Cloud Reliability Engineer
6 days ago
Toronto, Ontario, Canada Thomson Reuters Full timeAbout the RoleWe are seeking an experienced Senior Cloud Reliability Engineer to join our Shared Capabilities, Service Reliability and Operation team in Toronto. As a key member of our team, you will be responsible for implementing site reliability engineering and DevOps best practices, ensuring the scalability, reliability, and security of our cloud-based...
-
Senior Cloud Reliability Engineer
6 days ago
Toronto, Ontario, Canada Thomson Reuters Full timeAbout the RoleWe are seeking an experienced Senior Cloud Reliability Engineer to join our Shared Capabilities, Service Reliability and Operation team in Toronto. As a key member of our team, you will be responsible for implementing site reliability engineering and DevOps best practices, ensuring the scalability, reliability, and security of our cloud-based...
-
Senior Site Reliability Engineer
7 days ago
Toronto, Ontario, Canada Criteo Full timeAbout the Role:Criteo is seeking a highly skilled Senior Site Reliability Engineer to join our team. As a key member of our Product Reliability Engineering (PRE) group, you will play a critical role in ensuring the reliability and scalability of our applications and systems.Key Responsibilities:Collaborate with product engineering teams to design, develop,...
-
Senior Site Reliability Engineer
6 days ago
Toronto, Ontario, Canada Criteo Full timeAbout the Role:Criteo is seeking a highly skilled Senior Site Reliability Engineer to join our team. As a key member of our Product Reliability Engineering (PRE) group, you will play a critical role in ensuring the reliability and scalability of our applications and systems.Key Responsibilities:Collaborate with product engineering teams to design, develop,...
-
Cloud Engineer
6 days ago
Toronto, Ontario, Canada Thomson Reuters Full timeAbout the RoleThomson Reuters is seeking a Senior Site Reliability Engineer to join our Service Management, Technology team. This role calls for an individual who is capable of analyzing complex customer problems and assessing the scope of impact, while mitigating customer impact of issues and executing workarounds.Key ResponsibilitiesIdentify options for...