Site Reliability Engineer

4 weeks ago


Greater Vancouver Metropolitan Area, Canada Altis Technology Full time

Duration: 12 Months

Location: Principally remote, with at least one day per month in office for applicants in the lower mainland. Local candidates are given preference.

Monday – Friday, 9:00 am – 5:00 pm PST

Reference: RITM0091997

Specific Responsibilities and Deliverables:

  • Serve as the subject matter expert (SME) for Dynatrace, responsible for configuring, optimizing, and managing Dynatrace monitoring solutions.
  • Design and implement monitoring strategies using Dynatrace to ensure comprehensive visibility into system performance, availability, and reliability
  • Collaborate with our Engineering & Platform teams to ensure our services, platforms and infrastructure are emitting the right metrics
  • Lead the rollout and adoption of Observability practices, tools, and frameworks across teams and projects.
  • Collaborate with Incident Management teams to resolve critical incidents, conduct post-incident reviews, and implement preventive measures.
  • Communicate complex information clearly and concisely, to explain various business and technical information
  • Proactively identify and mitigate potential issues, bottlenecks, and performance degradation to ensure system reliability and uptime
  • Drive automation initiatives using tools like Ansible, Terraform, or Kubernetes to streamline deployment, configuration, and management of infrastructure.
  • Conduct capacity planning assessments, analyze resource utilization trends, and forecast capacity requirements to support business growth and scalability.

Mandatory Requirements:

  • Bachelor's degree in Computer Science, Engineering, or related field; Master's degree preferred.
  • Extensive and recent experience as a Site Reliability Engineer (SRE) with a focus on Dynatrace and Observability practices.
  • Strong proficiency in Dynatrace monitoring solutions, including configuration, customization, and optimization.
  • Hands-on experience with Observability tools and practices such as distributed tracing, logging, metrics collection, and anomaly detection.
  • Experience with automation tools (Ansible, Terraform, Kubernetes) and Infrastructure as Code (IaC) principles.
  • Solid understanding of cloud platforms (AWS, Azure, GCP) and containerization technologies (Docker, Kubernetes).
  • Excellent problem-solving skills, analytical thinking, and the ability to troubleshoot complex technical issues.
  • Strong communication and collaboration skills, with the ability to work effectively in cross-functional teams and drive initiatives to completion.
  • Relevant certifications (Dynatrace, AWS, Kubernetes, etc.) are a plus.
  • Previous experience in the company is a plus.
  • Local candidates or candidates willing to attend occasional on-site meetings are preferred.

What's In It For You?

  • Meaningful Impact: Contribute to impactful public sector projects.
  • Career Growth: Advance professionally in a dynamic team environment.
  • Collaboration: Foster relationships within and outside the organization.
  • Professional Development: Gain exposure to best practices.
  • Referral program:



  • Greater Toronto Area, Canada OnX Canada Full time

    OnX is looking for a Site Reliability Engineer for one our clients in Toronto. Client: Financial Services Location: Toronto, mostly remote Duration: 6 months with potential extension JBoss in middleware experience is super important Responsibilities: Following the senior technicians plans to build out lower environments with functioning software stacks...


  • Greater Toronto Area, Canada OnX Canada Full time

    OnX is looking for a Site Reliability Engineer for one our clients in Toronto. Client: Financial ServicesLocation: Toronto, mostly remoteDuration: 6 months with potential extensionJBoss in middleware experience is super importantResponsibilities:Following the senior technicians plans to build out lower environments with functioning software stacks including...


  • Greater Toronto Area, Canada OnX Canada Full time

    OnX is looking for a Site Reliability Engineer for one our clients in Toronto. Client: Financial ServicesLocation: Toronto, mostly remoteDuration: 6 months with potential extensionJBoss in middleware experience is super importantResponsibilities:Following the senior technicians plans to build out lower environments with functioning software stacks including...


  • Greater Toronto Area, Canada, Ontario OnX Canada Full time

    OnX is looking for a Site Reliability Engineer for one our clients in Toronto. Client: Financial ServicesLocation: Toronto, mostly remoteDuration: 6 months with potential extensionJBoss in middleware experience is super importantResponsibilities:Following the senior technicians plans to build out lower environments with functioning software stacks including...


  • Vancouver, Canada Dapper Labs Full time

    We’re looking for a Site Reliability Engineer who wants to be at the technical core of an organization that’s completely reshaping how distributed applications on blockchains can reach massive audiences. You will join a Site Reliability Engineering team that has the ability to architect, build, and iterate on resilient, scalable systems. SRE also...


  • Vancouver, Canada LayerZero Full time

    LayerZero The Future is Omnichain. Founded in 2021, LayerZero’s vision is to create a community of cross-chain developers, building dApps that are no longer constrained by individual blockchain capabilities. With LayerZero's simple, generic messaging protocol, builders will develop cross-chain dApps designed to unify the power of individual blockchains. We...


  • Vancouver, Canada Sigmaways Inc Full time

    We're seeking a Site Reliability Engineer to join our team with expertise in Kubernetes and troubleshooting. Responsibilities: Monitor, measure, and report alerts, overall health, performance, and capacity of one or more services. Gain deep knowledge and learn the application stack. Ability to deb


  • Vancouver, British Columbia, Canada Axiom Zen Full time

    We're looking for a Site Reliability Engineer who wants to be at the technical core of an organization that's completely reshaping how distributed applications on blockchains can reach massive audiences.You will join a Site Reliability Engineering team that has the ability to architect, build, and iterate on resilient, scalable systems.SRE also guides the...


  • Greater Toronto Area, Canada Paymentus Full time

    Summary Paymentus leads the North American marketplace in electronic bill payment solutions and is looking for high performers to join our development team building SaaS Fintech solutions across a range of industries. You will contribute to a massively scalable data platform, that is built on top of a world class enterprise platform, supporting thousands...


  • Greater Toronto Area, Canada Paymentus Full time

    Summary Paymentus leads the North American marketplace in electronic bill payment solutions and is looking for high performers to join our development team building SaaS Fintech solutions across a range of industries. You will contribute to a massively scalable data platform, that is built on top of a world class enterprise platform, supporting thousands of...


  • Vancouver, Canada Sigmaways Inc Full time

    We're seeking a Site Reliability Engineer to join our team with expertise in Kubernetes and troubleshooting.Responsibilities:Monitor, measure, and report alerts, overall health, performance, and capacity of one or more services.Gain deep knowledge and learn the application stack.Ability to debug and optimize code and automate routine tasks.Function well in a...


  • Vancouver, Canada Sigmaways Inc Full time

    We're seeking a Site Reliability Engineer to join our team with expertise in Kubernetes and troubleshooting. Responsibilities: Monitor, measure, and report alerts, overall health, performance, and capacity of one or more services. Gain deep knowledge and learn the application stack. Ability to debug and optimize code and automate routine tasks. Function...


  • Vancouver, Canada Sigmaways Inc Full time

    We're seeking a Site Reliability Engineer to join our team with expertise in Kubernetes and troubleshooting.Responsibilities:Monitor, measure, and report alerts, overall health, performance, and capacity of one or more services.Gain deep knowledge and learn the application stack.Ability to debug and optimize code and automate routine tasks.Function well in a...


  • Vancouver, Canada Sigmaways Inc Full time

    We're seeking a Site Reliability Engineer to join our team with expertise in Kubernetes and troubleshooting.Responsibilities:Monitor, measure, and report alerts, overall health, performance, and capacity of one or more services.Gain deep knowledge and learn the application stack.Ability to debug and optimize code and automate routine tasks.Function well in a...


  • Vancouver, Canada Dapper Labs Full time

    We’re looking for a Site Reliability Engineer who wants to be at the technical core of an organization that’s completely reshaping how distributed applications on blockchains can reach massive audiences.You will join a Site Reliability Engineering team that has the ability to architect, build, and iterate on resilient, scalable systems. SRE also guides...


  • Vancouver, BC, Canada Sigmaways Inc Full time

    We're seeking a Site Reliability Engineer to join our team with expertise in Kubernetes and troubleshooting.Responsibilities:Monitor, measure, and report alerts, overall health, performance, and capacity of one or more services.Gain deep knowledge and learn the application stack.Ability to debug and optimize code and automate routine tasks.Function well in a...


  • Vancouver, BC, Canada LayerZero Full time

    LayerZero The Future is Omnichain. Founded in 2021, LayerZero’s vision is to create a community of cross-chain developers, building dApps that are no longer constrained by individual blockchain capabilities. With LayerZero's simple, generic messaging protocol, builders will develop cross-chain dApps designed to unify the power of individual...


  • Vancouver, Canada Targeted Talent Full time

    We are looking for an experienced Senior Site Reliability Engineer for our client. This is a permanent position that is remote to start with later relocation to Calgary or Winnipeg. Our client is a global enterprise company with a product that you've likely used. Experience with coding/software development, along with Site Reliability will be the key to...


  • Vancouver, BC, Canada Dapper Labs Full time

    We’re looking for a Site Reliability Engineer who wants to be at the technical core of an organization that’s completely reshaping how distributed applications on blockchains can reach massive audiences. You will join a Site Reliability Engineering team that has the ability to architect, build, and iterate on resilient, scalable systems. SRE also...


  • Vancouver, Canada Sentry Full time

    About the role The Site Reliability Engineering team is responsible for the deployment, configuration, maintenance and monitoring of Sentry's hosted platform. We do this by leveraging automation tools to automatically spin up and scale services to meet the traffic demands of 1,000,000+ developers. Sentry receives over a billion events a day, and processes...