Cloud Site Reliability Engineer
2 months ago
Magnet Forensics is a leading provider of digital investigative software that empowers law enforcement agencies, government organizations, and private sector companies to acquire, analyze, and share evidence from computers, smartphones, tablets, and IoT-related devices.
Job SummaryWe are seeking a highly skilled Cloud Site Reliability Engineer to join our team. As a Cloud Site Reliability Engineer, you will be responsible for designing and implementing our cloud deployment strategy, creating and maintaining our CI/CD pipeline, and leading the implementation of elements of our React websites.
Key Responsibilities- Design and implement our cloud deployment strategy to ensure scalability, reliability, and security.
- Create and maintain our CI/CD pipeline to ensure seamless deployment and testing of our applications.
- Lead the implementation of elements of our React websites to deliver cross-cutting business features.
- Create and maintain monitoring and observability dashboards to help the team understand and debug production issues.
- Assist the team in creating a scalable, fault-tolerant, and highly available application while keeping costs as low as possible.
- Contribute to the development of performant, lean, and thorough test suites to ensure minimal issues escape to production.
- Work with other Magnet teams to ensure our application is highly secure and help fix critical vulnerabilities as they are discovered.
- Provide thought leadership, support, and coaching within the immediate team and across Engineering.
- Bachelor's degree in a Computer Science-related field or equivalent practical experience.
- Significant experience operating a production SaaS application running on one or more major cloud providers (AWS, Azure, GCP).
- Significant experience implementing CI/CD pipelines for SaaS products.
- Experience working with Web Applications specifically React.
- Experience working with Azure DevOps and Jenkins or similar tools to implement CI/CD pipelines.
- Experience troubleshooting and recovering from SaaS disaster scenarios while maintaining calmness under pressure and excellent listening and communication skills.
- Experience using Datadog, CloudWatch, or similar tools to troubleshoot production issues and create observability dashboards.
- Experience writing and maintaining IaC (CDK, CloudFormation, Terraform) that provisions elastically scalable infrastructure.
- Experience with performance and cost optimization of cloud infrastructure.
- Experience implementing secure cloud solutions.
- Experience writing and maintaining various types of system test suites (load tests, chaos tests).
- Experience working with one or more general-purpose programming languages (C#, Python, JavaScript).
- Proactivity around planning, organizing, and implementing large pieces of work efficiently.
- Effectiveness at leadership, mentorship, and coaching – you encourage joint ownership of ops.
- Competitive compensation package.
- Generous time off policies.
- Volunteer opportunities.
- Reward and recognition programs.
- Employee committees and resource groups.
- Healthcare and retirement benefits.
-
Cloud Platform/Site Reliability Engineer
3 weeks ago
Toronto, Ontario, Canada State Street Full timeAt State Street, we are seeking a Cloud Platform/Site Reliability Engineer to join our team.Key Responsibilities:Design and implement scalable cloud infrastructure solutions.Ensure high availability and reliability of cloud-based systems.Collaborate with cross-functional teams to drive cloud adoption and innovation.Requirements:Strong background in cloud...
-
Site Reliability Engineer
3 weeks ago
Toronto, Ontario, Canada Thomson Reuters Full timeAbout the RoleWe are seeking a skilled Site Reliability Engineer - Cloud Expert to join our team at Thomson Reuters. In this role, you will be responsible for designing, implementing, and maintaining scalable cloud-based systems and services.As a Site Reliability Engineer, you will work closely with cross-functional teams to identify and resolve technical...
-
Site Reliability Engineer
4 weeks ago
Toronto, Ontario, Canada KPMG Canada Full timeAbout the RoleWe are seeking a highly skilled Site Reliability Engineer to join our team at KPMG Canada. As a key member of our Operations team, you will play a critical role in ensuring the smooth operation of our Managed Service.Key ResponsibilitiesDesign and implement scalable and reliable cloud infrastructure solutionsCollaborate with cross-functional...
-
Site Reliability Engineer
4 weeks ago
Old Toronto, Ontario, Canada Thomson Reuters Full timeSite Reliability Engineer (Contract)Contract (5 months 29 days)Closed OpportunityThomson Reuters is seeking a skilled Site Reliability Engineer to join our Service Management Organization.The ideal candidate will have experience in IT Service Management, working with cloud providers, software development, and technology infrastructure.As a Site Reliability...
-
Site Reliability Engineer
4 weeks ago
Old Toronto, Ontario, Canada Ascend Fundraising Solutions Full timeJob Title: Site Reliability Engineer - AutomationWe are seeking a highly skilled Site Reliability Engineer to join our IT team at Ascend Fundraising Solutions. As a key member of our team, you will collaborate closely with our client services team to diagnose, troubleshoot, and resolve issues related to system reliability.Responsibilities:Take ownership of...
-
Site Reliability Engineer
1 month ago
Old Toronto, Ontario, Canada Thomson Reuters Full timeSite Reliability EngineerWe are seeking a highly skilled Site Reliability Engineer to join our team at Thomson Reuters. As a Site Reliability Engineer, you will play a critical role in ensuring the reliability and efficiency of our cloud-based infrastructure.About the RoleIn this position, you will be responsible for:Designing and implementing scalable...
-
Site Reliability Engineer
1 month ago
Old Toronto, Ontario, Canada Thomson Reuters Full timeSite Reliability EngineerWe are seeking a highly skilled Site Reliability Engineer to join our team at Thomson Reuters. As a Site Reliability Engineer, you will play a critical role in ensuring the reliability and efficiency of our cloud-based infrastructure.About the RoleIn this position, you will be responsible for:Designing and implementing scalable...
-
Cloud Native Site Reliability Engineer
3 days ago
Toronto, Ontario, Canada Thomson Reuters Full timeWe are seeking an experienced Senior SRE to join our Shared Capabilities, Service Reliability and Operation team in Toronto. As a Cloud Native Site Reliability Engineer, you will be responsible for implementing site reliability engineering and DevOps best practices, building and maintaining monitoring for all aspects of infrastructure, micro-services, usage...
-
Cloud Engineer
1 month ago
Toronto, Ontario, Canada Royal Bank of Canada Full timeJob SummaryWe are seeking a highly skilled Senior Site Reliability Engineer to join our Client360 Advisor Platform team at Royal Bank of Canada. As a key member of our team, you will be responsible for ensuring the availability, scalability, and performance of our cloud-based applications built on the Salesforce platform.Key ResponsibilitiesMonitor and...
-
Cloud Engineer
1 month ago
Toronto, Ontario, Canada Royal Bank of Canada Full timeJob SummaryWe are seeking a highly skilled Senior Site Reliability Engineer to join our Client360 Advisor Platform team at Royal Bank of Canada. As a key member of our team, you will be responsible for ensuring the availability, scalability, and performance of our cloud-based applications built on the Salesforce platform.Key ResponsibilitiesMonitor and...
-
Site Reliability Engineer
2 months ago
Toronto, Ontario, Canada Lorven Technologies Full timeJob Title: Site Reliability EngineerLocation: Toronto, CADuration: Long termWe are seeking a highly skilled Site Reliability Engineer to join our team at Lorven Technologies. As a Site Reliability Engineer, you will be responsible for ensuring the reliability and scalability of our cloud infrastructure.Key ResponsibilitiesA Bachelor's degree in Computer...
-
Site Reliability Engineer
2 months ago
Toronto, Ontario, Canada Lorven Technologies Full timeJob Title: Site Reliability EngineerLocation: Toronto, CADuration: Long termWe are seeking a highly skilled Site Reliability Engineer to join our team at Lorven Technologies. As a Site Reliability Engineer, you will be responsible for ensuring the reliability and scalability of our cloud infrastructure.Key ResponsibilitiesA Bachelor's degree in Computer...
-
Senior Site Reliability Engineer
3 weeks ago
Toronto, Ontario, Canada Northbridge Financial Corporation Full timeSite Reliability Engineer Role OverviewThe Senior Site Reliability Engineer at Northbridge Financial Corporation is responsible for overseeing the creation and implementation of Service Level Objectives (SLOs). This role involves handling service reliability solutions and processes of increasing complexity, as well as mentoring and leading less experienced...
-
AWS Site Reliability Engineer
1 month ago
Old Toronto, Ontario, Canada TD Bank Full timeJob Title: AWS Site Reliability EngineerTD Bank is seeking a highly skilled AWS Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will be responsible for ensuring the reliability, scalability, and performance of our cloud-based systems.Key Responsibilities:Design and implement scalable and reliable cloud-based systems using AWS...
-
AWS Site Reliability Engineer
1 month ago
Old Toronto, Ontario, Canada TD Bank Full timeJob Title: AWS Site Reliability EngineerTD Bank is seeking a highly skilled AWS Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will be responsible for ensuring the reliability, scalability, and performance of our cloud-based systems.Key Responsibilities:Design and implement scalable and reliable cloud-based systems using AWS...
-
Senior Cloud Reliability Engineer
4 weeks ago
Toronto, Ontario, Canada Thomson Reuters Full timeAbout the RoleWe are seeking an experienced Senior Site Reliability Engineer to join our Shared Capabilities, Service Reliability and Operation team in Toronto. As a key member of our team, you will be responsible for implementing site reliability engineering and DevOps best practices, building and maintaining monitoring for all aspects of infrastructure,...
-
Site Reliability Engineering Manager
3 weeks ago
Toronto, Ontario, Canada The Home Depot Canada Full timeUnlock Your Potential at The Home Depot CanadaAs a Site Reliability Engineering Manager, you will lead a team of Site Reliability Engineers to ensure the reliability, performance, and operational support of our eCommerce systems, with a focus on Google Cloud Platform (GCP) environments.Key Responsibilities:Lead and mentor a team of Site Reliability Engineers...
-
Cloud Reliability Engineering Manager
3 weeks ago
Toronto, Ontario, Canada The Home Depot Canada Full timeAbout The Home Depot CanadaThe Home Depot Canada is a leading retailer of home improvement products and services, committed to delivering exceptional customer experiences and driving business growth. We are seeking a highly skilled Cloud Reliability Engineering Manager to join our team and lead our Site Reliability Engineers in ensuring the reliability,...
-
Senior Cloud Reliability Engineer
3 weeks ago
Old Toronto, Ontario, Canada https:www.energyjobline.comsitemap Full timeProduct: Global Platform EngineeringYour Role:As a key member of our Global Platform Engineering team, you will be responsible for overseeing a team of Site Reliability Engineers and ensuring the smooth operation of our cloud-based infrastructure.Lead a team of Site Reliability Engineers to ensure the reliability and scalability of our cloud-based...
-
Senior Site Reliability Engineer
3 weeks ago
Toronto, Ontario, Canada Northbridge Financial Corporation Full timeSite Reliability Engineer Role OverviewThe Senior Site Reliability Engineer at Northbridge Financial Corporation is responsible for overseeing the creation and implementation of Service Level Objectives (SLOs). This role involves handling service reliability solutions and processes of increasing complexity, as well as mentoring and leading less experienced...