Reliability Systems Engineer

1 month ago


Old Toronto, Ontario, Canada Chelsea Avondale Full time
Job Title: Asset Reliability Engineer

At Chelsea Avondale, we're pushing the boundaries of home insurance innovation. Our team of experts has developed cutting-edge risk modeling and insurance pricing technologies, which we deploy through our own insurance company.

We're a group of talented individuals from diverse backgrounds, including insurance, software development, finance, and operations. Our team includes Skynet Software, our scientific research & engineering division, and Max Insurance, our Canadian property & casualty insurance company.

We're transforming the Canadian and global insurance landscape, and we need a Reliability Engineer to support our growth. This role is crucial in ensuring our systems infrastructure keeps pace with our rapid expansion.

Key Responsibilities:
  • Design, implement, and maintain AWS cloud server environments to ensure high availability and scalability.
  • Develop robust monitoring and alerting systems in Python to detect and respond to incidents promptly.
  • Collaborate with cross-functional teams to enhance the reliability of our systems and services.
  • Design, configure, deploy, and maintain infrastructure on AWS using best practices and industry standards.
  • Conduct post-incident analysis to identify root causes, implement corrective actions, and prevent similar issues in the future.
  • Assist in capacity planning and optimize services to provide scalable, stable, and secure systems.
  • Implement high availability and disaster recovery solutions to provide data redundancy, resilience, and data loss prevention.
  • Assist with the implementation of select network engineering solutions, including firewalls, load balancing, VPNs, and LANs, where necessary.
Requirements:
  • Bachelor's degree in Computer Science, Computer Engineering, Electrical Engineering, or a related field.
  • 1+ years of experience as a Reliability Engineer or a similar role, with a focus on maintaining high-performance, scalable, and reliable web systems.
  • Hands-on experience with AWS cloud environments, including instances, CloudWatch, EFS, etc.
  • Proficiency in Python is a must.
  • Experience using NGINX for reverse proxy, load balancing, and caching.
  • Experience with Unix/Windows server configuration, administration, performance tuning, and troubleshooting.
  • Working knowledge of web technologies, including web servers, DNS, SSL, and browsers.
  • Working knowledge of web development processes, including source control, deployment, etc.
  • Experience with load testing, pen testing, and providing security for cloud resources is beneficial.


  • Toronto, Ontario, Canada Lorven Technologies Full time

    Job Title : Reliability Systems Engineer Location : Remote Duration : Long term A Bachelor's degree in Computer Science or related technical field, or equivalent practical experience. Advanced knowledge of SRE practices and technologies including Azure, Linux, and scripting languages. Expertise in various SRE tools such as Ansible, Azure Automation,...


  • Toronto, Ontario, Canada Scotiabank Full time

    As a key member of our team at Scotiabank, you will play a critical role in ensuring the reliability and performance of our production systems.Key Responsibilities:Contribute to in-depth data analysis to gauge service trends and drive improvements to production systems.Collaborate closely with SREs, Development, and Operations teams to assist in...

  • Reliability Engineer

    4 weeks ago


    Toronto, Ontario, Canada Scotiabank Full time

    About the Role:We are seeking a highly skilled Systems Reliability Engineer to join our team at Scotiabank. As a key member of our Systems Reliability Office, you will be responsible for ensuring the stability and reliability of our technology portfolio.Key Responsibilities:Champion a customer-focused culture to deepen client relationships and leverage...


  • Toronto, Ontario, Canada Vantage Full time

    About the Role:We are seeking a highly skilled Senior Site Reliability Engineer to join our team at Vantage. As a key member of our engineering team, you will play a pivotal role in ensuring the seamless operation of our large-scale, distributed systems. Your expertise in software and systems engineering will be instrumental in building, maintaining, and...


  • Toronto, Ontario, Canada Interac Corp. Full time

    Senior System Reliability EngineerWe are seeking a skilled Senior System Reliability Engineer to join our team at Interac Corp. in Canada.About the Role:This is an exciting opportunity to work on high-performance payment systems, focusing on Site (Application) Reliability Engineering activities, including proactive monitoring, responding to alerts and...


  • Old Toronto, Ontario, Canada Thomson Reuters Full time

    About the RoleWe are seeking a highly skilled Site Reliability Engineering Specialist to join our team at Thomson Reuters. As a Site Reliability Engineer, you will be responsible for designing, implementing, and maintaining scalable systems and services that meet the needs of our customers.Key ResponsibilitiesDesign and implement scalable systems and...


  • Old Toronto, Ontario, Canada Sentry Full time

    About the RoleSentry is on a mission to help developers write better software faster. As a Cloud Reliability Engineer, you will play a critical role in ensuring the uptime and reliability of our hosted platform.You will work with a multitude of technologies, including cloud providers, to architect and automate services and systems to meet the demand of...


  • Toronto, Ontario, Canada Safran Landing Systems Full time

    Job Description Assist in the development and certification of the landing gear system, including hydro-mechanical, electrical, and control systems designed per software and complex hardware (DO-178/DO-254). Liaise with customers and airworthiness authorities on matters pertaining to certification and system development. Define requirements applicable to the...

  • Reliability Engineer

    4 weeks ago


    Toronto, Ontario, Canada Scotiabank Full time

    About the Role:We are seeking a highly skilled System Reliability Engineer to join our team at Scotiabank. As a key member of our Systems Reliability Office, you will play a critical role in ensuring the stability and reliability of our technology portfolio.Key Responsibilities:Champion a customer-focused culture to deepen client relationships and leverage...


  • Toronto, Ontario, Canada Safran Landing Systems Full time

    Job DescriptionWe are seeking a highly skilled Senior Systems Engineer to join our team at Safran Landing Systems. As a key member of our team, you will be responsible for the development and certification of the landing gear system, including hydro-mechanical, electrical, and control systems designed per software and complex hardware (DO-178/DO-254).Key...


  • Toronto, Ontario, Canada Scotiabank Full time

    About the RoleAs a key member of the Systems Reliability Office, you will work collaboratively with various teams to deliver high-quality results. Your contributions will directly impact the success of our stakeholders.Key ResponsibilitiesContribute to cross-functional teams to drive significant deliverables.Work closely with stakeholders to understand and...


  • Toronto, Ontario, Canada Safran Landing Systems Full time

    Job DescriptionAs a Senior Systems Engineer at Safran Landing Systems, you will play a key role in the development and certification of the Landing Gear System. This includes working on hydro-mechanical, electrical, and control systems designed per Software and Complex Hardware (DO-178/DO-254). You will liaise with customers and airworthiness authorities on...

  • Senior Data Engineer

    1 month ago


    Old Toronto, Ontario, Canada Data Engineer Jobs Full time

    About This RoleWe are seeking a highly skilled Senior Data Engineer to join our Analytics Engineering team. As a key member of this team, you will be responsible for designing and building scalable data models and ETL pipelines to support business decisions.Key Responsibilities:Collaborate with data scientists to design data models and answer questions.Work...


  • Old Toronto, Ontario, Canada Cerebras Systems Full time

    About the RoleCerebras Systems is revolutionizing the field of artificial intelligence with its cutting-edge technology. As an ML Integration and Ops Engineer, you will play a crucial role in bringing together software and hardware components to make large-scale LLM model training simple and easy to use.Key ResponsibilitiesDrive technical projects involving...


  • Toronto, Ontario, Canada The Toronto-Dominion Bank (Canada) Full time

    Job SummaryWe are seeking a highly skilled Site Reliability Engineer to join our team at The Toronto-Dominion Bank (Canada). As a Site Reliability Engineer, you will play a critical role in ensuring the reliability, scalability, and performance of our systems and applications.Key ResponsibilitiesProvide technical leadership and expertise in designing and...


  • Old Toronto, Ontario, Canada Teranet Full time

    About the RoleWe are seeking a highly skilled Site Reliability Engineer to join our DevOps team at Teranet. As a key member of our team, you will be responsible for applying software engineering principles to infrastructure and operations problems, with the goal of creating highly automated, scalable, and reliable systems.Key ResponsibilitiesDesign and...


  • Toronto, Ontario, Canada Criteo Full time

    About the Role:This is a challenging opportunity for an experienced engineer to join Criteo's PRE team as a Site Reliability Engineer. The role involves working closely with product engineering to improve the reliability of our apps, systems, and pipelines, assessing where optimization is needed most, and telling stories with meaningful monitoring.Key...


  • Old Toronto, Ontario, Canada Thomson Reuters Full time

    Site Reliability Engineer (Contract)Contract (5 months 29 days)Closed OpportunityThomson Reuters is seeking a skilled Site Reliability Engineer to join our Service Management Organization.The ideal candidate will have experience in IT Service Management, working with cloud providers, software development, and technology infrastructure.As a Site Reliability...


  • Toronto, Ontario, Canada Lorven Technologies Full time

    Job Title : Reliability Systems SpecialistLocation : RemoteDuration : Long termA Bachelor's degree in Computer Science or related technical field, or equivalent practical experience.Advanced knowledge of reliability engineering practices and technologies.Hands-on experience in reliability tools (Ansible, Azure Automation, Catchpoint).Azure, Linux.Dynatrace,...


  • Old Toronto, Ontario, Canada TD Bank Full time

    Job Summary:We are seeking a highly skilled AWS Cloud Reliability Engineer to join our team at TD Bank. As a key member of our technology organization, you will be responsible for designing and operating large, complex systems that meet the highest standards of reliability, scalability, and efficiency.Key Responsibilities:Provide technical leadership to...