Reliability Engineer

1 week ago


Vancouver, British Columbia, Canada Perlego Full time
About the Role

We are seeking an experienced Site Reliability Engineer to join our cloud operations team at Perlego. This is a unique opportunity to work on a high-profile platform, ensuring the availability and performance of our services. As a Site Reliability Engineer, you will be responsible for designing, implementing, and maintaining our cloud infrastructure, ensuring seamless user experiences and minimizing downtime. Our team operates across multiple time zones, and you will be part of a global team that supports cloud infrastructure and platform initiatives.

Key Responsibilities
  • Design and implement scalable, secure, and reliable cloud infrastructure using AWS services.
  • Monitor and manage platform activity using tools like Datadog, Prometheus, Grafana, or AWS CloudWatch.
  • Respond quickly to alerts and incidents, independently resolving issues and ensuring service uptime during off-peak hours.
  • Collaborate with cross-functional teams to implement platform improvements and ensure effective backup, recovery, and disaster recovery strategies.
  • Automate manual processes to reduce human error and improve efficiency, and continuously enhance monitoring systems to ensure robust early detection and resolution capabilities.
Requirements
  • Experience in Site Reliability Engineering, DevOps, or a similar field.
  • Strong experience with AWS services.
  • Expertise in using monitoring tools (Prometheus, Grafana, CloudWatch) for real-time platform performance insights.
  • Hands-on experience with CI/CD pipeline management for deploying containerized (Docker) and serverless applications.
  • Proficiency in Linux-based operating systems and shell scripting.
  • Familiarity with Infrastructure as Code tools (Terraform, CloudFormation).
  • Experience with incident management, troubleshooting, and platform recovery in high-pressure environments.
  • Strong communication skills with a proven ability to work both independently and collaboratively across time zones.
What We Offer

We offer a competitive salary, share options, and a comprehensive benefits package, including private medical insurance, a personal L&D budget, unlimited coaching opportunities, and a generous holiday allowance. Our company culture champions self-empowerment, personal development, direct communication, and mutual support. We are an equal opportunity employer and value diversity of thought and background. We are actively building a diverse team, so we strongly encourage applications from people of colour, the LGBTQ+ community, people with disabilities, neurodivergent people, parents, carers, and people from lower socio-economic backgrounds. To enable an equitable experience for all and give you the best chance of success, if you have any specific requirements for any stage of the interview process, please let us know in your application.



  • Vancouver, British Columbia, Canada Azad Technology Partners Full time

    Site Reliability Engineer/DevOps EngineerAZAD Technology Partners is seeking a skilled Site Reliability Engineer/DevOps Engineer to join our team in Chicago, IL.About the RoleWe are looking for a candidate with extensive experience in Site Reliability Engineering principles and culture. The ideal candidate will have a strong background in designing and...


  • Vancouver, British Columbia, Canada Azad Technology Partners Full time

    About the RoleAZAD Technology Partners is seeking a skilled Site Reliability Engineer/DevOps Engineer to join our team in Chicago, IL. As a key member of our team, you will be responsible for ensuring the reliability and performance of our production environment.Key ResponsibilitiesDesign and implement observability solutions for comprehensive monitoring and...


  • Vancouver, British Columbia, Canada Azad Technology Partners Full time

    About the RoleAZAD Technology Partners is seeking a skilled Site Reliability Engineer/DevOps Engineer to join our team in Chicago, IL. As a key member of our team, you will be responsible for ensuring the reliability and performance of our production environment.Key ResponsibilitiesDesign and implement observability solutions for comprehensive monitoring and...


  • Vancouver, British Columbia, Canada Halliwell Consulting Corp Full time

    Job Title: Reliability Engineer SpecialistAbout the Role:We are seeking a skilled Reliability Engineer Specialist to join our team at Halliwell Consulting Corp. As a key member of our team, you will be responsible for supporting and implementing our Root Cause analysis program.Key Responsibilities:Analyze equipment histories to identify and address specific...


  • Vancouver, British Columbia, Canada Halliwell Consulting Corp Full time

    Job Title: Reliability Engineer SpecialistJob Summary: We are seeking a skilled Reliability Engineer Specialist to join our team at Halliwell Consulting Corp. The successful candidate will be responsible for supporting and implementing the Root Cause analysis program, analyzing equipment histories to identify and address specific repetitive failures, and...


  • Vancouver, British Columbia, Canada Royal Bank of Canada Full time

    Job SummaryThe Application Support SRE will be responsible for the support, development, and implementation of Site Reliability Engineering solutions for all applications within Royal Bank of Canada. This team will work collaboratively with teams across several lines of business to ensure the stability and scalability of our systems.Key...


  • Vancouver, British Columbia, Canada Royal Bank of Canada Full time

    Job SummaryThe Application Support SRE will be responsible for the support, development, and implementation of Site Reliability Engineering solutions for all applications within Royal Bank of Canada. This team will work collaboratively with teams across several lines of business to ensure the stability and scalability of our systems.Key...


  • Vancouver, British Columbia, Canada Azad Technology Partners Full time

    Site Reliability Engineer/DevOps EngineerAZAD Technology Partners is seeking a skilled Site Reliability Engineer/DevOps Engineer for a full-time, W2 Contract position based in Chicago, IL.About the RoleWe are looking for a candidate who has extensive experience in Site Reliability Engineering principles and culture, and who enjoys collaborating with a...


  • Vancouver, British Columbia, Canada Azad Technology Partners Full time

    Site Reliability Engineer/DevOps EngineerAZAD Technology Partners is seeking a skilled Site Reliability Engineer/DevOps Engineer for a full-time, W2 Contract position based in Chicago, IL.About the RoleWe are looking for a candidate who has extensive experience in Site Reliability Engineering principles and culture, and who enjoys collaborating with a...


  • Vancouver, British Columbia, Canada Electronic Arts Full time

    Job Title: Site Reliability EngineerElectronic Arts is seeking a highly skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will play a critical role in ensuring the reliability, scalability, and performance of our cloud-based systems.Responsibilities:Collaborate with development teams to identify and resolve build issues...


  • Vancouver, British Columbia, Canada Electronic Arts Full time

    Job Title: Site Reliability EngineerElectronic Arts is seeking a highly skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will play a critical role in ensuring the reliability, scalability, and performance of our cloud-based systems.Responsibilities:Collaborate with development teams to identify and resolve build issues...


  • Vancouver, British Columbia, Canada https:www.energyjobline.comsitemap Full time

    Job SummaryWe are seeking a highly skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will be responsible for ensuring the reliability and scalability of our cloud infrastructure. You will work closely with our engineering teams to design, develop, and maintain our cloud-based systems.Key ResponsibilitiesDesign and...


  • Vancouver, British Columbia, Canada Microsoft Full time

    Unlock Your Potential as a Site Reliability Engineer II at MicrosoftAre you a skilled engineer looking for a challenging role that combines technical expertise with business acumen? Do you want to work on large-scale projects that drive innovation and customer satisfaction? We're seeking a talented Site Reliability Engineer II to join our ES365 team, where...


  • Vancouver, British Columbia, Canada Microsoft Full time

    Unlock Your Potential as a Site Reliability Engineer II at MicrosoftAre you a skilled engineer looking for a challenging role that combines technical expertise with business acumen? Do you want to work on large-scale projects that drive innovation and customer satisfaction? We're seeking a talented Site Reliability Engineer II to join our ES365 team, where...


  • Vancouver, British Columbia, Canada Tbwa ChiatDay Inc Full time

    Unlock Your Potential as a Site Reliability EngineerAt Visier, we're on a mission to revolutionize the way organizations make decisions by harnessing the power of people analytics. As a Site Reliability Engineer, you'll play a critical role in ensuring the smooth operation of our systems, leveraging cutting-edge technology to drive business outcomes.Key...


  • Vancouver, British Columbia, Canada Electronic Arts Full time

    Job Title: Site Reliability EngineerWe are seeking a highly skilled Site Reliability Engineer to join our team at Electronic Arts. As a Site Reliability Engineer, you will play a critical role in ensuring the reliability, scalability, and performance of our cloud-based systems.Responsibilities:Collaborate with development teams to identify and resolve build...


  • Vancouver, British Columbia, Canada Electronic Arts Full time

    Job Title: Site Reliability EngineerWe are seeking a highly skilled Site Reliability Engineer to join our team at Electronic Arts. As a Site Reliability Engineer, you will play a critical role in ensuring the reliability, scalability, and performance of our cloud-based systems.Responsibilities:Collaborate with development teams to identify and resolve build...


  • Vancouver, British Columbia, Canada Royal Bank of Canada Full time

    Job SummaryThe Director of Site Reliability Engineering will be responsible for the strategic direction, design, development, and implementation of Site Reliability Engineering solutions for all applications across a line of business within Royal Bank of Canada.Key ResponsibilitiesDevelop and maintain a comprehensive SRE strategy aligned with business...


  • Vancouver, British Columbia, Canada Royal Bank of Canada Full time

    Job SummaryThe Director of Site Reliability Engineering will be responsible for the strategic direction, design, development, and implementation of Site Reliability Engineering solutions for all applications across a line of business within Royal Bank of Canada.Key ResponsibilitiesDevelop and maintain a comprehensive SRE strategy aligned with business...


  • Vancouver, British Columbia, Canada Tbwa ChiatDay Inc Full time

    Unlock Your Potential as a Site Reliability EngineerAt Visier, we're on a mission to revolutionize the way organizations make decisions by harnessing the power of people analytics. As a Site Reliability Engineer, you'll play a critical role in helping us achieve this vision.What You'll DoCollaborate with our DevOps scrum team to develop and implement...