Lead Site Reliability Engineering(SRE) @ Canada Remote

3 weeks ago


Canada Ampstek Full time

Lead Site Reliability Engineering(SRE) @ Canada Remote

Min exp : 10+ Years

Location: Canada (Remote)

Note : Candidates should be located in Canada ONLY.


Job Description:-

Candidate will go through a client interview and we are looking for SRE Lead who is already good at DevOps/Cloud (AWS) with a strong background in implementing Site Reliability Engineering (SRE) practices. The ideal candidate should demonstrate expertise in the following areas: (In this priority Order)


1. SRE Implementations: Look for candidates who have experience implementing SRE principles, including the establishment of Service Level Indicators (SLIs), Service Level Objectives (SLOs), and Error Budgets to ensure system reliability and availability.

2. Observability: Search for keywords related to observability, including familiarity with concepts such as full-stack observability and distributed tracing,

3. Tool Proficiency: Datadog, CloudWatch, Synthetic Monitoring tools

4. Building SRE Culture: Evaluate candidates based on their ability to develop SRE frameworks within organizations, such as creating SRE charters and fostering a culture of reliability and accountability across teams.

5. Automation: Look for candidates with extensive experience in automation, including the automation of repetitive tasks, infrastructure provisioning, and deployment processes, to streamline operations and enhance efficiency.

6. Chaos Engineering: Consider candidates who have experience in Chaos Engineering practices and related tools, demonstrating their ability to proactively identify system weaknesses and improve resilience through controlled experiments.

• Lead and mentor a team of SREs to ensure operational excellence and maximize the reliability and availability of client systems.

• Minimum 10 years of work experience in DevOps/SRE, including leadership roles.

• Architect and design highly scalable and available infrastructure solutions, integrating best practices in reliability engineering and automation.

• Collaborate with cross-functional teams (DevOps, Development, IT) to implement SRE principles throughout the software development life cycle.

• Establish and manage Service Level Objectives (SLOs) and Service Level Indicators (SLIs) for critical services, monitoring and maintaining performance against defined targets.

• Implement and enhance observability, alerting, and incident response processes to proactively address issues and minimize downtime.

• Drive continuous improvement initiatives, identifying bottlenecks and optimizing within the infrastructure and application stack.

• Develop and maintain documentation related to system architecture, configuration, and procedures.

• Stay current with industry trends, recommending and adopting new tools and practices to enhance system reliability.

• Qualifications:

• Strong background in designing and implementing highly available and scalable infrastructure.

• Proficiency in scripting and automation using Python or Shell

• Experience with container orchestration platforms, serverless architectures, CI/CD pipelines, and IaC implementations. (Ansible & Terraform)

• Experience with Observability tools (preferred: Datadog, CloudWatch).

• In-depth knowledge of cloud computing platforms (preferred: AWS).

• Solid understanding of SRE/DevOps principles and practices.

• Excellent problem-solving skills with the ability to troubleshoot complex issues in production environments.

• Strong communication and leadership skills, fostering effective collaboration with cross-functional teams.

• Relevant certifications in SRE, DevOps, Cloud, etc., are a plus.

Thank Youu



  • Canada Cribl, Inc. Full time

    As a remote-first company we believe in empowering our employees to do their best work, wherever they are.  As the data engine for IT and Security many of the biggest names in the most demanding industries trust Cribl to solve their most pressing data needs. Cribl Inc is seeking a Senior Site Reliability Engineer to join our mission to unlock the value of...


  • Canada Boundlessfellows Full time

    Clover is reinventing health insurance by working to keep people healthier. We value diversity — in backgrounds and in experiences. Healthcare is a universal concern, and we need people from all backgrounds and swaths of life to help build the future of healthcare. Clover's engineering team is empathetic, caring, and supportive. We are deliberate and...


  • Canada Boundlessfellows Full time

    Clover is reinventing health insurance by working to keep people healthier. We value diversity — in backgrounds and in experiences. Healthcare is a universal concern, and we need people from all backgrounds and swaths of life to help build the future of healthcare. Clover's engineering team is empathetic, caring, and supportive. We are deliberate and...


  • Canada Dapper Labs Full time

    We’re looking for a Senior Site Reliability Engineer who wants to be at the technical core of an organization that’s completely reshaping how distributed applications on blockchains can reach massive audiences. You will join a Site Reliability Engineering team that has the ability to architect, build, and iterate on resilient, scalable systems. SRE...

  • DevOps Engineer

    7 days ago


    Canada Flowmentum, Inc. Full time

    Job Title: DevOps Engineer Company Overview: Join our pioneering team at the forefront of technology and innovation. We are seeking talented Full Stack Engineers with a strong background in System Architecture and a specialization in Site Reliability Engineering (SRE). This role involves engaging with complex DevOps solutions across our SaaS products...

  • Engineering Team Lead

    3 weeks ago


    Canada StudentUniverse Full time

    Full time 1 Remote 1 Brand ~ WhereTo - Engineering Team Lead - Canada Brand: WhereTo Full time, Remote Location: Virtual - Canada Categories: Information & Technology WhereTo provides an AI-powered travel platform for corporate travel. Their platform uses machine learning algorithms to recommend personalized travel options based on a...


  • Canada Zapier Full time

    With millions of people benefiting from automation, we're excited to keep expanding our product and team. As we continue to scale our product and grow our team, we’re looking for an experienced Engineering Manager. This role will lead a new team of Site Reliability Engineers focused on enabling developers to build and run products that are not only...


  • Canada Rally Engineering Full time

    E&I Engineer will report to the Lead Project Engineer at a client based Western Canada site. This role will be a Project Manager for several projects, such as Safety Instrument System – procurement of SIS equipment, replacement of electrical protection relays and upgrade of low voltage switchgear. The position is in a secondment role working directly...


  • Canada Page Mechanical Group, Inc. Full time

    ​​​​​​ Lead Software Engineer - Remote - Canada - Permanent POSITION: Join our product development team to provide thought leadership and innovation. This role offers the opportunity to develop a deep understanding of our business and work closely with customers, sales, professional services, and product management to architect, design, and...

  • Reliability Engineer

    4 weeks ago


    Canada The Mosaic Company Full time

    Reliability Engineer (all levels) page is loaded Reliability Engineer (all levels) Apply locations CA-Colonsay, SK time type Full time posted on Posted 2 Days Ago job requisition id 52419 The Mosaic Company (NYSE: MOS) is the world’s leading integrated producer of concentrated phosphate and potash—two of the three most important nutrients in...


  • Canada Catena Media plc Full time

    As a Senior Site Reliability Engineer at Catena, you will play a crucial role in maintaining optimal system performance and upholding high standards of availability, security, and resilience. Working at the intersection of software development and operations, you will collaborate closely with cross-functional teams to deliver high-quality services to our...

  • Site Manager

    3 weeks ago


    Canada ICDS (UK) Ltd Full time

    Site Manager - Civil Engineering - Canada, Alberta and Ontario Are you interested in relocating and working in Canada? If so, then we want to hear from you! ICDS have a number of vacancies for experienced Construction Site Managers/Site Agents with one of Canada’s leading Civil Engineering Contractors (part of a circa $5billion turnover group of...

  • Senior C++ Engineer

    2 weeks ago


    Canada Devengine Full time

    Senior C++ Engineer (Remote, anywhere in Canada) Remote - Canada | Permanent / Full Time Our publicly traded infrastructure software engineering client in Ontario is looking for a Senior C++ Software Engineer to join their team on a full-time permanent basis. The successful candidate will have experience with modern C++, v11 at the very least of higher ,...

  • Lead Process Engineer

    2 weeks ago


    Canada Equinox Engineering Ltd Full time

    Lead Process Engineer Application Deadline: 30 June 2024 Department: Process Engineering Employment Type: Full Time Location: Calgary, AB, Canada Description Equinox Engineering Ltd., a leading EPCM firm headquartered in Calgary, specializes in oil and gas processing, providing comprehensive services in facilities design, implementation, and...

  • Account Lead

    3 weeks ago


    Canada Inworld AI Full time

    Inworld is the best-funded startup in AI and games with a $500 million valuation and backing from top tier investors like Intel, Microsoft, Lightspeed, Bitkraft, Founders Fund, Kleiner Perkins, and more. Inworld is the leading AI engine for games, enabling developers to build groundbreaking game mechanics, dynamic NPCs and worlds that evolve with each...

  • DevOps Engineer

    7 days ago


    Canada Flowmentum, Inc. Full time

    Job Title: DevOps Engineer Company Overview: Join our pioneering team at the forefront of technology and innovation. We are seeking talented Full Stack Engineers with a strong background in System Architecture and a specialization in Site Reliability Engineering (SRE). This role involves engaging with complex DevOps solutions across our SaaS products...

  • DevOps Engineer

    7 days ago


    Canada Flowmentum, Inc. Full time

    Job Title: DevOps Engineer Company Overview: Join our pioneering team at the forefront of technology and innovation. We are seeking talented Full Stack Engineers with a strong background in System Architecture and a specialization in Site Reliability Engineering (SRE). This role involves engaging with complex DevOps solutions across our SaaS products...

  • DevOps Engineer

    6 days ago


    Canada Flowmentum, Inc. Full time

    Job Title: DevOps Engineer Company Overview: Join our pioneering team at the forefront of technology and innovation. We are seeking talented Full Stack Engineers with a strong background in System Architecture and a specialization in Site Reliability Engineering (SRE). This role involves engaging with complex DevOps solutions across our SaaS products...

  • DevOps Engineer

    6 days ago


    Canada Flowmentum, Inc. Full time

    Job Title: DevOps Engineer Company Overview: Join our pioneering team at the forefront of technology and innovation. We are seeking talented Full Stack Engineers with a strong background in System Architecture and a specialization in Site Reliability Engineering (SRE). This role involves engaging with complex DevOps solutions across our SaaS products...


  • Canada Rally Engineering Full time

    The Int. E&I Engineer will report to the Lead Project Engineer at a client based Western Canada site. This role will be a Project Manager for several projects, such as Safety Instrument System – procurement of SIS equipment, replacement of electrical protection relays and upgrade of low voltage switchgear. The position is a seconded role, based in...