Site Reliability Engineer

4 weeks ago


Vancouver, Canada Electronic Arts Full time

We are a global team of creators, storytellers, technologists, experience originators, innovators and so much more. We believe amazing games and experiences start with teams as diverse as the players and communities we serve. At Electronic Arts, the only limit is your imagination.

Production Infrastructure & Engineering (PI&E) organization provides the essential platforms and infrastructure hosting solutions that power EA's live services. Our charter is to make EA's games and services available to all players anytime and anywhere. To do this, we focus on the high availability of infrastructure, primary services, and studio services. We aim to help developers to experiment and build new games quickly with infrastructure services on-demand and workflows that promote rapid development in the cloud. In all of this, we focus on being there for players where and when they want to play.

As a Site Reliability Engineer, your role covers the entire life-cycle of a product-- from helping developers with architecture and delivery to on-call incident response and triage. Your primary focus will be automation and continuous integration/delivery with an emphasis on solving operations issues using software. You will report to the Senior SRE Manager.

Responsibilities:

  • You will build and operate distributed, large-scale, cloud-based infrastructure using modern open-source software solutions.
  • You will use automation technologies to ensure repeatability, eliminate toil, reduce mean time to detection and resolution (MTTD & MTTR) and repair services.
  • You will perform root cause analysis and post-mortems with an eye towards future prevention.
  • You will design and build CI/CD pipelines.
  • You will create monitoring, alerting and dashboarding solutions that improve visibility into EA's application performance and business metrics.
  • You will produce documentation and support tooling for online support teams.

Qualifications:

  • 5+ years of experience with Virtualization, Containerization, Cloud Computing (AWS preferred), VMWare ecosystems, Kubernetes, or Docker.
  • 5+ years of experience supporting high-availability production-grade infrastructure and applications with defined SLIs and SLOs.
  • Systems Administration experience, including a strong understanding of Linux / Unix.
  • Network experience, including an understanding of standard protocols/components.
  • Automation and orchestration experience including Terraform, Helm, Chef, Puppet, Packer.
  • Experience writing code in Python, Golang, or Java.
  • Experience working with distributed systems.


  • Vancouver, Canada Dapper Labs Full time

    We’re looking for a Site Reliability Engineer who wants to be at the technical core of an organization that’s completely reshaping how distributed applications on blockchains can reach massive audiences. You will join a Site Reliability Engineering team that has the ability to architect, build, and iterate on resilient, scalable systems. SRE also...


  • Vancouver, Canada LayerZero Full time

    LayerZero The Future is Omnichain. Founded in 2021, LayerZero’s vision is to create a community of cross-chain developers, building dApps that are no longer constrained by individual blockchain capabilities. With LayerZero's simple, generic messaging protocol, builders will develop cross-chain dApps designed to unify the power of individual blockchains. We...


  • Vancouver, Canada Axiom Zen Full time

    We’re looking for a Site Reliability Engineer who wants to be at the technical core of an organization that’s completely reshaping how distributed applications on blockchains can reach massive audiences. You will join a Site Reliability Engineering team that has the ability to architect, build, and iterate on resilient, scalable systems. SRE also...


  • Vancouver, British Columbia, Canada Axiom Zen Full time

    We're looking for a Site Reliability Engineer who wants to be at the technical core of an organization that's completely reshaping how distributed applications on blockchains can reach massive audiences.You will join a Site Reliability Engineering team that has the ability to architect, build, and iterate on resilient, scalable systems.SRE also guides the...


  • Vancouver, Canada Dapper Labs Full time

    We’re looking for a Site Reliability Engineer who wants to be at the technical core of an organization that’s completely reshaping how distributed applications on blockchains can reach massive audiences.You will join a Site Reliability Engineering team that has the ability to architect, build, and iterate on resilient, scalable systems. SRE also guides...


  • Vancouver, BC, Canada Stafflink Full time

    Job Description Position: Site Reliability Engineer Duration: 12 Months Location: Principally remote, with at least one day per month in office for applicants in the lower mainland. Local candidates are given preference. Work hours: Monday – Friday, 9:00 am – 5:00 pm PST Reference: RITM0091997 Specific Responsibilities and Deliverables: Serve...


  • Vancouver, BC, Canada LayerZero Full time

    LayerZero The Future is Omnichain. Founded in 2021, LayerZero’s vision is to create a community of cross-chain developers, building dApps that are no longer constrained by individual blockchain capabilities. With LayerZero's simple, generic messaging protocol, builders will develop cross-chain dApps designed to unify the power of individual...


  • Vancouver, BC, Canada Dapper Labs Full time

    We’re looking for a Site Reliability Engineer who wants to be at the technical core of an organization that’s completely reshaping how distributed applications on blockchains can reach massive audiences. You will join a Site Reliability Engineering team that has the ability to architect, build, and iterate on resilient, scalable systems. SRE also...


  • Vancouver, British Columbia, Canada Electronic Arts Full time

    We are a global team of creators, storytellers, technologists, experience originators, innovators and so much more. We believe amazing games and experiences start with teams as diverse as the players and communities we serve. At Electronic Arts, the only limit is your imagination.Production Infrastructure & Engineering (PI&E) organization provides the...


  • Vancouver, BC, Canada Dapper Labs Full time

    We’re looking for a Site Reliability Engineer who wants to be at the technical core of an organization that’s completely reshaping how distributed applications on blockchains can reach massive audiences. You will join a Site Reliability Engineering team that has the ability to architect, build, and iterate on resilient, scalable systems. The support we...


  • Vancouver, Canada Taurus SA Full time

    Are you ready to take on an entrepreneurial challenge in the digital asset industry? Taurus, a global leader in digital asset infrastructure, has an exciting opportunity for you. Founded in April 2018, Taurus provides enterprise-grade solutions to issue, custody, and trade digital assets, including cryptocurrencies, tokenized assets, NFTs, and digital...

  • Senior SRE

    3 weeks ago


    Vancouver, BC, Canada RAZR Marketing, Inc. Full time

    You will be required to be in our office In Vancouver, BC three times per week. These values have made RAZR what it is for years, and today, they are more important than ever. You can't wait to get out of bed in the morning & get on with your day We are seeking a skilled and motivated Site Reliability Engineer (SRE) to join our dynamic team at RAZR...


  • Vancouver, BC, Canada S I Systems Full time

    Senior Site Reliability Engineer to design and implement Dynatrace and rollout adoption of Observability practices, tools and frameworks. -0091997 Our Vancouver Client is seeking a Senior Site Reliability Engineer to design and implement Dynatrace and rollout adoption of Observability practices, tools and frameworks. -009199712 months contract, Vancouver -...


  • Vancouver, Canada T-Net British Columbia Full time

    Site Reliability Engineer Co-op (Sept 2024 - May 2025)Job OverviewVisier is the leader in people analytics and we believe in a 'people-first' approach to business strategy. Our innovative technology transforms the way that organisations make decisions, allowing them to elevate their employees and drive better business outcomes. Embarking on an exciting new...


  • Vancouver, BC, Canada T-Net British Columbia Full time

    Site Reliability Engineer Co-op (Sept 2024 - May 2025) Job Overview Visier is the leader in people analytics and we believe in a 'people-first' approach to business strategy. Our innovative technology transforms the way that organisations make decisions, allowing them to elevate their employees and drive better business outcomes. Embarking on an exciting...


  • Vancouver, BC, Canada Goldbeck Recruiting Full time

    Our client is an engineering company specializing in fire, life safety and building code consulting. Their corporate mission is to carry on and expand our business as a consulting engineering firm specializing in building code consulting and fire protection engineering design and to be fully trained and qualified to interpret and apply the principles of...


  • Vancouver, Canada Ballard Power Systems Full time

    Position: Junior Reliability Applied Scientist/Engineer Location:  Vancouver, British Columbia Job Id: 2072 # of Openings: 1 This is an excellent opportunity for an entry-level Engineer/Scientist to join the Ballard Product Reliability Team. The new hire will be responsible for providing reliability engineering support for Ballard’s fuel...


  • Vancouver, BC, Canada Goldbeck Recruiting Inc. Full time

    Extended Health, Cell phone allowance and Car mileage Our client is an engineering company specializing in fire, life safety and building code consulting. Their corporate mission is to carry on and expand our business as a consulting engineering firm specializing in building code consulting and fire protection engineering design and to be fully trained and...


  • Vancouver, Canada Ballard Full time

    This is an excellent opportunity for an entry-level Engineer/Scientist to join the Ballard Product Reliability Team. The new hire will be responsible for providing reliability engineering support for Ballard’s fuel cell products. The individual will use standard reliability engineering principles and technical skills to define, measure, model, and improve...


  • Vancouver, BC, Canada Ballard Power Systems Full time

    Position:  Junior Reliability Applied Scientist/Engineer Location: Vancouver, British Columbia Job Id:  2072 # of Openings:  1 This is an excellent opportunity for an entry-level Engineer/Scientist to join the Ballard Product Reliability Team. The new hire will be responsible for providing reliability engineering support for Ballard’s fuel...