AWS Cloud Reliability Specialist

3 weeks ago


Old Toronto, Ontario, Canada Reperio Human Capital Full time
Cloud Reliability Engineer

Type: Permanent, Full-time

We're seeking experienced Cloud Reliability Engineers who excel at ensuring the reliability and scalability of production systems, and possess extensive experience with monitoring and automation tools.

Key Responsibilities:

  • Design, implement, and maintain monitoring and alerting systems to ensure the reliability and availability of production systems.
  • Collaborate with development teams to improve system architecture and deployment processes, ensuring seamless integration and scalability.
  • Conduct root cause analysis of incidents and implement corrective measures to prevent future occurrences.
  • Develop and maintain automation scripts using tools like Ansible and Terraform to streamline processes and improve efficiency.
  • Stay up-to-date with the latest cloud technologies and best practices to ensure the company remains competitive and secure.

Requirements:

  • 5+ years of experience as a Cloud Reliability Engineer or similar role.
  • Expertise in monitoring tools (Prometheus, Grafana) and automation tools (Ansible, Terraform).
  • Strong understanding of networking and security best practices.
  • Experience with both Windows and Linux operating systems.

About Reperio Human Capital:

We're a leading provider of human capital solutions, dedicated to helping businesses find the best talent to drive their success. Our team of experts is passionate about matching the right candidates with the right opportunities, and we're committed to delivering exceptional results.



  • Old Toronto, Ontario, Canada eTeam Full time

    Job OverviewWe are seeking a highly skilled AWS Site Reliability Engineer to join our team at eTeam. As a key member of our cloud operations team, you will be responsible for ensuring the reliability and performance of our cloud-based systems.Key ResponsibilitiesDefining and measuring reliability goals, including SLIs, SLOs, and error budgets for user...


  • Old Toronto, Ontario, Canada eTeam Full time

    Job OverviewWe are seeking a highly skilled AWS Site Reliability Engineer to join our team at eTeam. As a key member of our cloud operations team, you will be responsible for ensuring the reliability and performance of our cloud-based systems.Key ResponsibilitiesDefining and measuring reliability goals, including SLIs, SLOs, and error budgets for user...


  • Old Toronto, Ontario, Canada Rogers Communications Full time

    Job Title: AWS Site Reliability EngineerWe are seeking a highly skilled AWS Site Reliability Engineer to join our team at Rogers Sports & Media. As a key player in delivering Canadian audiences a diverse content portfolio, you will be at the forefront of technology innovation in the media space.Key Responsibilities:Design, develop, and implement monitoring...


  • Old Toronto, Ontario, Canada Rogers Communications Full time

    Job Title: AWS Site Reliability EngineerWe are seeking a highly skilled AWS Site Reliability Engineer to join our team at Rogers Sports & Media. As a key player in delivering Canadian audiences a diverse content portfolio, you will be at the forefront of technology innovation in the media space.Key Responsibilities:Design, develop, and implement monitoring...


  • Old Toronto, Ontario, Canada eTeam Full time

    Job DescriptionWe are seeking a skilled AWS Site Reliability Engineer to join our team at eTeam. As a key member of our cloud operations team, you will be responsible for defining and measuring reliability goals, designing for and implementing observability, and defining, testing, and running an incident management process.Key responsibilities...


  • Old Toronto, Ontario, Canada eTeam Full time

    Job DescriptionWe are seeking a skilled AWS Site Reliability Engineer to join our team at eTeam. As a key member of our cloud operations team, you will be responsible for defining and measuring reliability goals, designing for and implementing observability, and defining, testing, and running an incident management process.Key responsibilities...


  • Old Toronto, Ontario, Canada Guidewire Full time

    Job SummaryWe are seeking a highly skilled AWS Site Reliability Engineer to join our team at Guidewire. As a key member of our engineering team, you will be responsible for designing, implementing, and maintaining scalable and highly available cloud infrastructure on Amazon Web Services (AWS).Key ResponsibilitiesDesign and implement scalable and highly...


  • Old Toronto, Ontario, Canada Guidewire Full time

    Job SummaryWe are seeking a highly skilled AWS Site Reliability Engineer to join our team at Guidewire. As a key member of our engineering team, you will be responsible for designing, implementing, and maintaining scalable and highly available cloud infrastructure on Amazon Web Services (AWS).Key ResponsibilitiesDesign and implement scalable and highly...


  • Old Toronto, Ontario, Canada Rogers Full time

    Cloud Reliability EngineerRogers Sports & Media is seeking a Cloud Reliability Engineer to build and maintain a robust monitoring system for its cloud and on-prem systems. As a key player in delivering Canadian audiences a diverse content portfolio, from the thrilling Stanley Cup playoffs to the latest Bachelor episode, you'll be at the forefront of...


  • Old Toronto, Ontario, Canada Rogers Full time

    Cloud Reliability EngineerRogers Sports & Media is seeking a Cloud Reliability Engineer to build and maintain a robust monitoring system for its cloud and on-prem systems. As a key player in delivering Canadian audiences a diverse content portfolio, from the thrilling Stanley Cup playoffs to the latest Bachelor episode, you'll be at the forefront of...


  • Old Toronto, Ontario, Canada Guidewire Full time

    Job SummaryWe are seeking a highly skilled AWS Site Reliability Engineer to join our team at Guidewire. As a key member of our cloud infrastructure team, you will be responsible for designing, implementing, and maintaining scalable and highly available cloud-based systems.Key ResponsibilitiesDesign and implement cloud-based infrastructure using AWS services...


  • Old Toronto, Ontario, Canada Guidewire Full time

    Job SummaryWe are seeking a highly skilled AWS Site Reliability Engineer to join our team at Guidewire. As a key member of our cloud infrastructure team, you will be responsible for designing, implementing, and maintaining scalable and highly available cloud-based systems.Key ResponsibilitiesDesign and implement cloud-based infrastructure using AWS services...


  • Old Toronto, Ontario, Canada TD Bank Full time

    Job Title: AWS Site Reliability EngineerTD Bank is seeking a highly skilled AWS Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will be responsible for ensuring the reliability, scalability, and performance of our cloud-based systems.Key Responsibilities:Design and implement scalable and reliable cloud-based systems using AWS...


  • Old Toronto, Ontario, Canada TD Bank Full time

    Job Title: AWS Site Reliability EngineerTD Bank is seeking a highly skilled AWS Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will be responsible for ensuring the reliability, scalability, and performance of our cloud-based systems.Key Responsibilities:Design and implement scalable and reliable cloud-based systems using AWS...


  • Old Toronto, Ontario, Canada TD Bank Full time

    Job Title: AWS Site Reliability EngineerTD Bank is seeking a highly skilled AWS Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will be responsible for ensuring the reliability, scalability, and performance of our cloud-based systems.Key Responsibilities:Design and implement scalable and reliable cloud-based systems using AWS...


  • Old Toronto, Ontario, Canada TD Bank Full time

    Job Title: AWS Site Reliability EngineerTD Bank is seeking a highly skilled AWS Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will be responsible for ensuring the reliability, scalability, and performance of our cloud-based systems.Key Responsibilities:Design and implement scalable and reliable cloud-based systems using AWS...


  • Old Toronto, Ontario, Canada CB Canada Full time

    Site Reliability EngineerCB Canada is seeking a skilled Site Reliability Engineer to join our team.Job SummaryWe are looking for a highly motivated and experienced Site Reliability Engineer to design, build, and maintain modern cloud infrastructure and data pipelines. The ideal candidate will have a strong background in cloud computing, automation, and...


  • Old Toronto, Ontario, Canada CB Canada Full time

    Site Reliability EngineerCB Canada is seeking a skilled Site Reliability Engineer to join our team.Job SummaryWe are looking for a highly motivated and experienced Site Reliability Engineer to design, build, and maintain modern cloud infrastructure and data pipelines. The ideal candidate will have a strong background in cloud computing, automation, and...


  • Old Toronto, Ontario, Canada Reperio Human Capital Full time

    Cloud Reliability EngineerType: Permanent, Full-timeWe're seeking experienced Cloud Reliability Engineers who excel at ensuring the reliability and scalability of production systems, and possess extensive experience with monitoring and automation tools.Key Responsibilities:Design, implement, and maintain monitoring and alerting systems to ensure the...


  • Old Toronto, Ontario, Canada Reperio Human Capital Full time

    Cloud Reliability EngineerType: Permanent, Full-timeWe're seeking experienced Cloud Reliability Engineers who excel at ensuring the reliability and scalability of production systems, and possess extensive experience with monitoring and automation tools.Key Responsibilities:Design, implement, and maintain monitoring and alerting systems to ensure the...