Reliability Engineer

2 weeks ago

Canada Chelsea Avondale Full time

Chelsea Avondale is the world’s most cutting-edge home insurance group. We have developed sophisticated risk modeling and insurance pricing technologies for home insurance and deploy that technology through our own insurance company. Our team consists of some of the brightest minds in insurance, software development, finance, and operations. Our group includes our scientific research & engineering division (Skynet Software) and Canadian property & casualty insurance company (Max Insurance). Together, our group is transforming the Canadian and global insurance landscape. JOB DESCRIPTION: Chelsea Avondale is looking for a Reliability Engineer with a background in infrastructure system engineering to support the growth of a secure, dynamic, and scalable IT environment across the group. Our business is going through rapid growth, and it is essential that our systems infrastructure keeps pace. The Reliability Engineer will play a crucial role in ensuring the reliability, scalability, and performance of our systems, enabling the continuous delivery of our products and services. They will be accountable for ensuring overall availability, as well as enhancing Engineering teams’ capability to design, build and operate robust systems at scale. This position is ideal for candidates who have an extraordinary sense of responsibility and are not afraid to roll up their sleeves. Our IT environment is not toolkit rich. What we are NOT looking for is someone who wants to take months installing a large number of tools from their preferred toolkit. We take pride in maintaining a fundamental stack of technologies, much of it in Python, and we are looking for someone who shares this mentality. If you are someone who thrives in a high-performance culture and is eager for work that is both challenging and constantly evolving, this role is perfect for you. We strongly encourage and help our team members to improve and enhance their personal skill sets within our organization. On your journey with us, you will have the ability to learn and grow rapidly, taking on more responsibilities. RESPONSIBILITIES: Play an integral role in the design, implementation & maintenance of AWS cloud server environments. Design, implement, and maintain robust monitoring and alerting systems in Python to detect and respond to incidents in a timely manner. Collaborate with cross-functional teams to enhance reliability of our systems and services. Design, configure, deploy, and maintain infrastructure on AWS using best practices and industry standards. Conduct post-incident analysis to identify root causes, implement corrective actions, and prevent similar issues in the future. Assist in capacity planning & optimize services to provide scalable, stable, & secure systems. Implement high availability and disaster recovery solutions to provide data redundancy, resilience, and data loss prevention. Assist with the implementation of select network engineering solutions including firewalls, load balancing, VPNs & LANs, where necessary. PREFERRED EXPERIENCE & SKILLS: Bachelor’s degree in Computer Science, Computer Engineering, Electrical Engineering, or related field. 1+ years of experience as a Reliability Engineer or similar role, with a focus on maintaining high-performance, scalable, and reliable web systems. We also encourage highly motivated new grads to apply. Hands-on experience with AWS cloud environments – instances, CloudWatch, EFS, etc. Proficiency at Python is a must. Experience using NGINX for reverse proxy, load balancing, and caching. Experience with Unix / Windows server configuration, administration, performance tuning and troubleshooting. Working knowledge of web development processes (source control, deployment, etc.). Experience load testing, pen testing, and providing security for cloud resources is beneficial. Apply for this job * indicates a required field First Name * Last Name * Email * Phone Resume/CV * Enter manually Accepted file types: pdf, doc, docx, txt, rtf Enter manually Accepted file types: pdf, doc, docx, txt, rtf LinkedIn Profile Do you require sponsorship now or in the future to work in Canada? * Select... #J-18808-Ljbffr

Reliability Engineer

2 weeks ago

, , Canada Graymont Full time

Join to apply for the Reliability Engineer role at Graymont . Full-Time, Permanent Canada - Remote Graymont is seeking a Reliability Engineer to join our team and provide guidance and support to our network of facilities across North America to ensure the highest levels of performance and reliability for our plant equipment. Reporting to the Maintenance...
Reliability Engineer

1 week ago

, , Canada Graymont Limited Full time

Function Technical Services incls. Engineering/geology/mining/remote ops/central lab Any province in which Graymont has operations Graymont is seeking a Reliability Engineer to join our team and provide guidance and support to our network of facilities across North America to ensure the highest levels of performance and reliability for our plant equipment....
Reliability Engineer

3 days ago

Remote, Canada Chelsea Avondale Full time

Chelsea Avondale is the world's most cutting-edge home insurance group. We have developed sophisticated risk modeling and insurance pricing technologies for home insurance and deploy that technology through our own insurance company.Our team consists of some of the brightest minds in insurance, software development, finance, and operations. Our group...
Site Reliability Engineer

6 days ago

Canada Dayforce Full time

About the OpportunityAs a Site Reliability Engineer at Dayforce, you will be part of a pioneering team responsible for ensuring our industry-leading HCM platform delivers exceptional scalability, availability, and reliability. Dayforce is a global HCM technology company with operations across North America, EMEA, and APJ, and our award-winning cloud platform...
Area Reliability Engineer

3 weeks ago

, , Canada AV Group Full time

Posted Tuesday, November 11, 2025 at 4:00 AM Area Reliability Engineer – Maintenance & Engineering AV Group NB Inc. is an exciting and innovative company operating two pulp mills in NB. Both mills produce dissolving grade pulp for the unique global market of viscos fibre. AV Group NB Inc. is part of the Aditya Birla Group (ABG), a 60 billion $US...
Reliability Engineer

2 weeks ago

, , Canada Mondelez International Full time

- Manages change/ transformation change/ transformation amongst the Operating teams in the implementation of IL6S-Integrated Lean 6 sigma phase journey, FoF-Factory of Future Line centric organization & roles (AM-Autonomous Maintenance, PM-Progressive Maintenance and an integrated 6 star model within operating Line teams) to progress into Self sufficient...
Site Reliability Engineer

2 weeks ago

, , Canada Dayforce Full time

Base pay range CA$67,700.00/yr - CA$120,900.00/yr Dayforce is a global human capital management (HCM) company headquartered in Toronto, Ontario, and Minneapolis, Minnesota, with operations across North America, Europe, Middle East, Africa (EMEA), and the Asia Pacific Japan (APJ) region. Our award‑winning Cloud HCM platform offers a unified solution...
Regional Reliability Engineer

7 days ago

, , Canada Carmeuse Full time

Responsibilities Anaylzing equipment performance, failure data, and corrective maintenance history to develop and deploy engineering solutions, improved maintenance strategies, preventative and predictive maintenance optimization, and other reliability techniques. Conducting analyses and investigations to determine relative reliability with regard to factors...
Performance Reliability Engineer

2 weeks ago

, , Canada Cerebras Full time

Cerebras Systems builds the world's largest AI chip, 56 times larger than GPUs. Our novel wafer‑scale architecture provides the AI compute power of dozens of GPUs on a single chip, with the programming simplicity of a single device. This approach allows Cerebras to deliver industry‑leading training and inference speeds and empowers machine learning users...
Senior Site Reliability Engineer

2 weeks ago

, , Canada Thinkific Full time

Join to apply for the Senior Site Reliability Engineer role at Thinkific Join to apply for the Senior Site Reliability Engineer role at Thinkific Are you an experienced Site Reliability Engineer looking for a new challenge? We’re looking for a Senior Site Reliability Engineer to join us at Thinkific. We’re looking for a Senior Site Reliability Engineer...

Americas

Europe

Asia / Oceania

Africa

Reliability Engineer