Site Reliability Engineer

4 weeks ago


Vancouver, British Columbia, Canada Arista Full time
Site Reliability Engineer - Cloud Vision

Arista Networks is a leader in data-driven, client-to-cloud networking for large data center, campus, and routing environments. Our innovative approach leverages the latest advancements in cloud computing, artificial intelligence, and software-defined networking to provide our clients with a competitive edge in an increasingly interconnected world.

We value diversity of thought and perspectives, fostering an inclusive environment where individuals from various backgrounds and experiences feel welcome. This diversity drives creativity and innovation, essential for our success.

Our commitment to excellence has earned us prestigious awards, including Best Engineering Team, Best Company for Diversity, Compensation, and Work-Life Balance. We take pride in our track record of success and strive to maintain the highest standards of quality and performance in everything we do.

What You'll Do
  • Design and implement the CI/CD lifecycle for services, from inception to deployment and scaling
  • Improve operational processes through automation and innovation
  • Identify key service indicators for capacity planning and optimization
  • Owning disaster recovery and management
  • Drive infrastructure and cloud-based application security design
  • Lead sustainable incident response and blameless postmortems
  • Be an active member of our globally distributed on-call team

Arista's CloudVision is an enterprise network management and streaming telemetry SaaS offering. CloudVision is deployed on Kubernetes across global regions using Spinnaker for our CI/CD pipeline. Our tech stack runs on GKE, using HBase/Hadoop as the main distributed database and storage layer, ElasticSearch for powering search data, ClickHouse for fast real-time queries of flow data, our own Kafka-based distributed real-time stream processing layer for analytics, and TensorFlow for ML analysis. Our monitoring system is built on top of Prometheus, Grafana, Loki, and other OSS tools.

Requirements
  • BS/MS degree in Computer Science or a relevant experience subject
  • 4+ years software engineering experience
  • Experience developing or managing deployments of distributed database systems or scale-out applications for a SaaS environment

Compensation Information:

The new hire base pay for this role has a salary range of CAD 95,000 to 145,000. US-based employees are also entitled to benefits including medical, dental, vision, wellbeing, tax savings, and income protection.



  • Vancouver, British Columbia, Canada Azad Technology Partners Full time

    Site Reliability Engineer/DevOps EngineerAZAD Technology Partners is seeking a skilled Site Reliability Engineer/DevOps Engineer to join our team in Chicago, IL.About the RoleWe are looking for a candidate with extensive experience in Site Reliability Engineering principles and culture. The ideal candidate will have a strong background in designing and...


  • Vancouver, British Columbia, Canada Azad Technology Partners Full time

    About the RoleAZAD Technology Partners is seeking a skilled Site Reliability Engineer/DevOps Engineer to join our team in Chicago, IL. As a key member of our team, you will be responsible for ensuring the reliability and performance of our production environment.Key ResponsibilitiesDesign and implement observability solutions for comprehensive monitoring and...


  • Vancouver, British Columbia, Canada Azad Technology Partners Full time

    About the RoleAZAD Technology Partners is seeking a skilled Site Reliability Engineer/DevOps Engineer to join our team in Chicago, IL. As a key member of our team, you will be responsible for ensuring the reliability and performance of our production environment.Key ResponsibilitiesDesign and implement observability solutions for comprehensive monitoring and...


  • Vancouver, British Columbia, Canada Royal Bank of Canada Full time

    Job SummaryThe Application Support SRE will be responsible for the support, development, and implementation of Site Reliability Engineering solutions for all applications within Royal Bank of Canada. This team will work collaboratively with teams across several lines of business to ensure the stability and scalability of our systems.Key...


  • Vancouver, British Columbia, Canada Royal Bank of Canada Full time

    Job SummaryThe Application Support SRE will be responsible for the support, development, and implementation of Site Reliability Engineering solutions for all applications within Royal Bank of Canada. This team will work collaboratively with teams across several lines of business to ensure the stability and scalability of our systems.Key...


  • Vancouver, British Columbia, Canada Azad Technology Partners Full time

    Site Reliability Engineer/DevOps EngineerAZAD Technology Partners is seeking a skilled Site Reliability Engineer/DevOps Engineer for a full-time, W2 Contract position based in Chicago, IL.About the RoleWe are looking for a candidate who has extensive experience in Site Reliability Engineering principles and culture, and who enjoys collaborating with a...


  • Vancouver, British Columbia, Canada Azad Technology Partners Full time

    Site Reliability Engineer/DevOps EngineerAZAD Technology Partners is seeking a skilled Site Reliability Engineer/DevOps Engineer for a full-time, W2 Contract position based in Chicago, IL.About the RoleWe are looking for a candidate who has extensive experience in Site Reliability Engineering principles and culture, and who enjoys collaborating with a...


  • Vancouver, British Columbia, Canada Electronic Arts Full time

    Job Title: Site Reliability EngineerElectronic Arts is seeking a highly skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will play a critical role in ensuring the reliability, scalability, and performance of our cloud-based systems.Responsibilities:Collaborate with development teams to identify and resolve build issues...


  • Vancouver, British Columbia, Canada Electronic Arts Full time

    Job Title: Site Reliability EngineerElectronic Arts is seeking a highly skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will play a critical role in ensuring the reliability, scalability, and performance of our cloud-based systems.Responsibilities:Collaborate with development teams to identify and resolve build issues...


  • Vancouver, British Columbia, Canada Tbwa ChiatDay Inc Full time

    Unlock Your Potential as a Site Reliability EngineerAt Visier, we're on a mission to revolutionize the way organizations make decisions by harnessing the power of people analytics. As a Site Reliability Engineer, you'll play a critical role in ensuring the smooth operation of our systems, leveraging cutting-edge technology to drive business outcomes.Key...


  • Vancouver, British Columbia, Canada Tbwa ChiatDay Inc Full time

    Unlock Your Potential as a Site Reliability EngineerAt Visier, we're on a mission to revolutionize the way organizations make decisions by harnessing the power of people analytics. As a Site Reliability Engineer, you'll play a critical role in helping us achieve this vision.What You'll DoCollaborate with our DevOps scrum team to develop and implement...


  • Vancouver, British Columbia, Canada Tbwa ChiatDay Inc Full time

    Unlock Your Potential as a Site Reliability EngineerAt Visier, we're on a mission to revolutionize the way organizations make decisions by harnessing the power of people analytics. As a Site Reliability Engineer, you'll play a critical role in helping us achieve this vision.What You'll DoCollaborate with our DevOps scrum team to develop and implement...


  • Vancouver, British Columbia, Canada Electronic Arts Full time

    Job Title: Site Reliability EngineerWe are seeking a highly skilled Site Reliability Engineer to join our team at Electronic Arts. As a Site Reliability Engineer, you will play a critical role in ensuring the reliability, scalability, and performance of our cloud-based systems.Responsibilities:Collaborate with development teams to identify and resolve build...


  • Vancouver, British Columbia, Canada Electronic Arts Full time

    Job Title: Site Reliability EngineerWe are seeking a highly skilled Site Reliability Engineer to join our team at Electronic Arts. As a Site Reliability Engineer, you will play a critical role in ensuring the reliability, scalability, and performance of our cloud-based systems.Responsibilities:Collaborate with development teams to identify and resolve build...


  • Vancouver, British Columbia, Canada Microsoft Full time

    Unlock Your Potential as a Site Reliability Engineer II at MicrosoftAre you a skilled engineer looking for a challenging role that combines technical expertise with business acumen? Do you want to work on large-scale projects that drive innovation and customer satisfaction? We're seeking a talented Site Reliability Engineer II to join our ES365 team, where...


  • Vancouver, British Columbia, Canada Microsoft Full time

    Unlock Your Potential as a Site Reliability Engineer II at MicrosoftAre you a skilled engineer looking for a challenging role that combines technical expertise with business acumen? Do you want to work on large-scale projects that drive innovation and customer satisfaction? We're seeking a talented Site Reliability Engineer II to join our ES365 team, where...


  • Vancouver, British Columbia, Canada Electronic Arts Full time

    About the RoleWe are seeking a skilled Site Reliability Engineer to join our team at Electronic Arts. As a Site Reliability Engineer, you will play a critical role in ensuring the reliability and scalability of our systems and services.ResponsibilitiesCollaborate with development teams to address build issues and improvementsCreate, modify, and maintain...


  • Vancouver, British Columbia, Canada Electronic Arts Full time

    ResponsibilitiesWe are seeking a skilled Site Reliability Engineer to join our team at Electronic Arts. As a Site Reliability Engineer, you will work closely with our development teams to address build issues and improve our systems.Key ResponsibilitiesCollaborate with development teams to identify and resolve build issuesCreate and maintain pipelines and...


  • Vancouver, British Columbia, Canada https:www.energyjobline.comsitemap Full time

    Job SummaryWe are seeking a highly skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will be responsible for ensuring the reliability and scalability of our cloud infrastructure. You will work closely with our engineering teams to design, develop, and maintain our cloud-based systems.Key ResponsibilitiesDesign and...


  • Vancouver, British Columbia, Canada Electronic Arts Full time

    About the RoleWe are seeking a skilled Site Reliability Engineer to join our team at Electronic Arts. As a Site Reliability Engineer, you will play a critical role in ensuring the reliability and scalability of our systems and services.ResponsibilitiesCollaborate with development teams to address build issues and improve our systemsCreate, modify, and...