Site Reliability Engineer- Automation

2 months ago


Vancouver, Canada Arista Full time
h3>Site Reliability Engineer (SRE) - Cloudvision
  • Full-time

Arista Networks is an industry leader in data-driven, client-to-cloud networking for large data center, campus and routing environments. What sets us apart is our relentless pursuit of innovation. We leverage the latest advancements in cloud computing, artificial intelligence, and software-defined networking to provide our clients with a competitive edge in an increasingly interconnected world. Our solutions are designed to not only meet the current demands of the digital landscape but to also anticipate and adapt to future challenges.

At Arista we value the diversity of thought and perspectives that each employee brings to the table. We believe that fostering an inclusive environment, where individuals from various backgrounds and experiences feel welcome, is essential for driving creativity and innovation.

Our commitment to excellence has earned us several prestigious awards, such as Best Engineering Team, Best Company for Diversity, Compensation, and Work-Life Balance. At Arista, we take pride in our track record of success and strive to maintain the highest standards of quality and performance in everything we do.

Who You’ll Work With

SREs at Arista combine strong software and systems engineering with a passion for operating production systems at scale. As an SRE you’ll be part of the team responsible for our global service fleet.

What You’ll Do
As an SRE you’ll be responsible for our global CloudVision service fleet. p>

  • Building the CI/CD lifecycle for services, from inception and design to deployment and scaling
  • Improving operational processes through automation
  • Identifying key service indicators to be used in capacity planning
  • Owning disaster recovery and management
  • Driving infrastructure and cloud-based application security design
  • Leading sustainable incident response and blameless postmortems
  • Being an active member of our globally distributed on-call team

Arista’s CloudVision is an enterprise network management and streaming telemetry SaaS offering. CloudVision is deployed on Kubernetes across global regions using Spinnaker for our CI/CD pipeline. Our tech stack runs on GKE, using HBase/Hadoop as main distributed database and storage layer, ElasticSearch for powering search data, ClickHouse for fast real time queries of flow data, our own Kafka-based distributed real time stream processing layer for analytics, and TensorFlow for ML analysis. Our monitoring system is built on top of Prometheus, Grafana, Loki, and other OSS tools.

Minimum Qualifications:

  • BS/MS degree in Computer Science or a relevant experience subject.
  • 4+ years software engineering experience.
  • Experience developing or managing deployments of distributed database systems or scale out applications for a SaaS environment.

Compensation Information:

The new hire base pay for this role has a salary range of CAD 95,000 to 145,000. US-based employees are also entitled to benefits including medical, dental, vision, wellbeing, tax savings and income protection.



  • Vancouver, Canada Themis Solutions Inc. Full time

    p>We are currently seeking a new Site Reliability Engineer, Co-op, to join our Engineering team in Burnaby, Calgary or Toronto.Applicants should be available for an 8-month co-op period from January 2025 to August 2025.What your team does:As a Site Reliability Engineer, you will help build, improve, and maintain Clio’s globally distributed network of...


  • Vancouver, British Columbia, Canada Electronic Arts Full time

    ResponsibilitiesWe are seeking a skilled Site Reliability Engineer to join our team at Electronic Arts. As a Site Reliability Engineer, you will work closely with our development teams to address build issues and improve our systems.Key ResponsibilitiesCollaborate with development teams to identify and resolve build issuesCreate and maintain pipelines and...


  • Vancouver, Canada NetApp Full time

    Title: Site Reliability Engineer (SRE) Location: Bangalore, Karnataka, IN, 560071 Requisition ID: 127074 Job SummaryAs a Site Reliability Engineer (SRE) with a specialization in storage, you'll manage and optimize a portfolio of customer-facing cloud services (SaaS/IaaS) on Google Cloud Platform (GCP), ensuring their overall availability, performance,...


  • Vancouver, British Columbia, Canada Royal Bank of Canada> Full time

    Job SummaryThe Royal Bank of Canada is seeking a skilled Site Reliability Engineering Specialist to join its team. This role will be responsible for the support, development, and implementation of Site Reliability Engineering solutions for all applications within the bank's technology infrastructure.Key ResponsibilitiesSupport and Development of Site...


  • Vancouver, Canada Royal Bank of Canada> Full time

    p>The Lead Support SRE will be responsible for supporting and spearheading the development and implementation of Site Reliability Engineering solutions for all applications within City National Bank (CNB), an RBC company. This individual will need advanced knowledge and experience working in an application development, support and/or technology operations...


  • Vancouver, Canada Microsoft Full time

    Overview Are you an individual who loves to work on large-scale projects at one of the most exciting and diverse divisions within Microsoft? Are you looking for big, creative challenges that show immediate results since your customers are the product engineers for Office and M365? Do you want to be at the core of it all, acting as a force multiplier...


  • Vancouver, British Columbia, Canada Royal Bank of Canada> Full time

    Job SummaryThe Royal Bank of Canada seeks a skilled Site Reliability Engineer to lead the development and implementation of SRE solutions for all applications within the organization. This role requires collaboration with cross-functional teams to ensure successful delivery of technology solutions.Key ResponsibilitiesDevelop and maintain production support...


  • Vancouver, Canada TrustFlight Full time

    p>TrustFlight is at the forefront of digitizing the aviation industry with the creation of intelligent workflow applications that automate operating and maintenance processes, enabling our customers to focus on the data and insights that matter. We continue to build an amazing group of people who are all here to make our products, services and culture the...


  • Vancouver, Canada Microsoft Canada Full time

    Microsoft is a company where passionate innovators come to collaborate, envision what can be and take their careers further. This is a world of more possibilities, more innovation, more openness, and the sky is the limit thinking in a cloud-enabled world. Microsoft’s Azure Data engineering team is leading the transformation of analytics in the world of...


  • Vancouver, Canada RBC Full time

    Job Summary The Application Support SRE will be responsible for the support, development, and implementation of Site Reliability Engineering solutions for all applications within City National Bank (CNB), an RBC company. This team will work collaboratively with teams across several lines of business and other Technology and Operations partners as a...


  • Vancouver, British Columbia, Canada Perlego Full time

    About the RoleWe are currently seeking a highly skilled Site Reliability Engineer to join our team at Perlego. As a Site Reliability Engineer, you will play a critical role in ensuring the availability, scalability, and performance of our cloud-based infrastructure.Key Responsibilities:Design, implement, and maintain scalable and highly available cloud-based...


  • Vancouver, Canada Microsoft Canada Full time

    Microsoft is a company where passionate innovators come to collaborate, envision what can be and take their careers further. This is a world of more possibilities, more innovation, more openness, and the sky is the limit thinking in a cloud-enabled world.Microsoft’s Azure Data engineering team is leading the transformation of analytics in the world of data...


  • Vancouver, Canada Microsoft Canada Full time

    Are you interested in working for one of the most exciting teams at Microsoft? Then look no further than Microsoft Teams SRE team. You will be building solutions that leverage state-of-the-art technologies to deliver the next evolution in collaboration and teamwork. What is a Site Reliability Engineer (SRE)? SRE is what you get when you treat operations as...


  • Vancouver, Canada RBC Full time

    Job Summary The Application Support SRE will be responsible for the support, development, and implementation of Site Reliability Engineering solutions for all applications within City National Bank (CNB), an RBC company. This team will work collaboratively with teams across several lines of business and other Technology and Operations partners as a...


  • Vancouver, Canada Conexiom Full time

    About the Opportunity: Conexiom is seeking a dedicated and experienced Site Reliability Engineering (SRE) Senior Manager to lead our SRE team. The role involves leading the Cloud SRE team in day-to-day operations, which include monitoring, support activities, ensuring customer satisfaction through reliable service, and building and designing cloud...


  • Vancouver, Canada Microsoft Canada Full time

    Are you interested in working for one of the most exciting teams at Microsoft? Then look no further than Microsoft Teams SRE team. You will be building solutions that leverage state-of-the-art technologies to deliver the next evolution in collaboration and teamwork. What is a Site Reliability Engineer (SRE)? SRE is what you get when you treat operations as...

  • Automation Engineer

    6 months ago


    Vancouver, Canada PSC Biotech Corporation Full time

    Job Description PSC Biotech provides the life sciences with essential services to ensure that health care products are developed, manufactured, and distributed to the highest standards, in compliance with all applicable regulatory requirements. Our goal is to skyrocket our clients’ success, and you can be a part of our team’s achievements....


  • Vancouver, Canada RBC Full time

    Job Summary The Lead Support SRE will be responsible for the supporting and spearheading the development, and implementation of Site Reliability Engineering solutions for all applications within City National Bank (CNB), an RBC company. This team will work collaboratively with teams across several lines of business and other Technology and Operations...


  • Vancouver, British Columbia, Canada Royal Bank of Canada Full time

    Company OverviewThe Royal Bank of Canada (RBC) is a leading financial institution that prides itself on providing exceptional banking services to its clients. With a strong presence in the Canadian market, RBC has a reputation for innovation and customer satisfaction.SalaryWe are offering a highly competitive salary range of $120,000 - $180,000 per year,...


  • Vancouver, Canada RBC Full time

    Job Summary The Lead Support SRE will be responsible for the supporting and spearheading the development, and implementation of Site Reliability Engineering solutions for all applications within City National Bank (CNB), an RBC company. This team will work collaboratively with teams across several lines of business and other Technology and Operations...