Current jobs related to Site Reliability Engineer - Vancouver - NetApp


  • Vancouver, British Columbia, Canada Azad Technology Partners Full time

    Azad Technology Partners seeks a Site Reliability Engineer/DevOps Engineer for a full-time role in Chicago, IL. The ideal candidate has experience in Site Reliability Engineering principles and culture, collaborating with cross-functional teams to develop real-world solutions.Key Responsibilities:Design and maintain observability solutions for comprehensive...


  • Vancouver, British Columbia, Canada Azad Technology Partners Full time

    Azad Technology Partners is seeking an experienced Site Reliability Engineer/DevOps Engineer for a full-time, W2 contract position in Chicago, IL.Schedule: Full-time, 40 hours/week, HybridAssignment Duration: 10 Months.Job Summary:We are looking for a skilled engineer who can run the production environment by monitoring availability and taking a holistic...


  • Vancouver, British Columbia, Canada Royal Bank of Canada Full time

    Job OverviewWe are seeking a skilled Site Reliability Engineering Director to lead our team in delivering high-quality, scalable, and secure software solutions. As a trusted advisor, you will spearhead the development of SRE solutions, drive incident management, and ensure compliance with service level objectives.


  • Vancouver, British Columbia, Canada Royal Bank of Canada> Full time

    Job SummaryWe are seeking a highly skilled Senior Site Reliability Engineer to join our team at Royal Bank of Canada. The successful candidate will be responsible for leading the development and implementation of Site Reliability Engineering solutions for all applications within our organization.About the RoleThis is a full-time position that requires a...


  • Vancouver, Canada Arista Full time

    h3>Site Reliability Engineer (SRE) - CloudvisionFull-timeArista Networks is an industry leader in data-driven, client-to-cloud networking for large data center, campus and routing environments. What sets us apart is our relentless pursuit of innovation. We leverage the latest advancements in cloud computing, artificial intelligence, and software-defined...


  • Vancouver, Canada Tbwa ChiatDay Inc Full time

    Launchpad, a people-first technology company, is a leader in North America´s rapidly growing tech sector. Through two solutions, Launchpad supports its clients with digital transformation:PaasportTM, our iPaaS solution, streamlines software integration and automates workflows.Nearshore Staff Augmentation, our managed IT staffing service, connects top IT...


  • Vancouver, Canada Tbwa ChiatDay Inc Full time

    Launchpad, a people-first technology company, is a leader in North America's rapidly growing tech sector. Through two solutions, Launchpad supports its clients with digital transformation: PaasportTM, our iPaaS solution, streamlines software integration and automates workflows. Nearshore Staff Augmentation, our managed IT staffing service, connects top IT...


  • Vancouver, Canada Tbwa ChiatDay Inc Full time

    Launchpad, a people-first technology company, is a leader in North America's rapidly growing tech sector. Through two solutions, Launchpad supports its clients with digital transformation: PaasportTM, our iPaaS solution, streamlines software integration and automates workflows. Nearshore Staff Augmentation, our managed IT staffing service, connects top IT...


  • Vancouver, British Columbia, Canada T-Net British Columbia Full time

    Unleash Your Potential as a Site Reliability EngineerWe are seeking a highly skilled Site Reliability Engineer to join our team at Global Relay Communications Inc.About Us:Global Relay has set the standard in enterprise information archiving for over 20 years. Our cloud archiving, surveillance, eDiscovery, and analytics solutions securely capture and...


  • Vancouver, Canada TrustFlight Full time

    p>TrustFlight is at the forefront of digitizing the aviation industry with the creation of intelligent workflow applications that automate operating and maintenance processes, enabling our customers to focus on the data and insights that matter. We continue to build an amazing group of people who are all here to make our products, services and culture the...


  • Vancouver, British Columbia, Canada Royal Bank of Canada Full time

    About the RoleWe are seeking a highly skilled Senior Site Reliability Engineer to join our team at Royal Bank of Canada (RBC). As a key member of our organization, you will play a critical role in ensuring the reliability and performance of our applications.The ideal candidate will have a strong background in application support, software development, and...

  • DevOps Engineer

    3 months ago


    Vancouver, Canada Azad Technology Partners Full time

    p>AZAD Technology Partners is seeking a Site Reliability Engineer/ Devops Engineer for a full-time, W2 Contract position based in Chicago, IL.Schedule: Full-time, 40 hours/week, HybridAssignment Duration: 10 Months.AZAD Technology Partners is committed to Diversity, Equity & Inclusion and is striving to build an even more diverse, inclusive team that...


  • Vancouver, Canada Royal Bank of Canada> Full time

    p>The Lead Support SRE will be responsible for supporting and spearheading the development and implementation of Site Reliability Engineering solutions for all applications within City National Bank (CNB), an RBC company. This individual will need advanced knowledge and experience working in an application development, support and/or technology operations...


  • Vancouver, British Columbia, Canada Royal Bank of Canada> Full time

    Royal Bank of Canada is a leading diversified financial services company in North America. We offer one of the best employer-employee relationships, with competitive compensation and comprehensive benefits.We are seeking a Senior Application Reliability Engineer to join our team. This role will be responsible for designing, implementing, and maintaining Site...


  • Vancouver, British Columbia, Canada Babylist Full time

    Babylist is a leading technology solution for expecting parents and their support community, offering a full-service platform that guides them in making decisions with confidence, staying connected, and building happy families. With over 9M people using our services annually, we're looking for an experienced Software Reliability Engineer to join our Platform...


  • Vancouver, British Columbia, Canada Tbwa ChiatDay Inc Full time

    Launchpad is a people-first technology company, a leader in North America's rapidly growing tech sector. We support our clients with digital transformation through two solutions: PaasportTM, an iPaaS solution that streamlines software integration and automates workflows, and Nearshore Staff Augmentation, a managed IT staffing service that connects top IT...


  • Vancouver, British Columbia, Canada RBC Full time

    Job SummaryWe are seeking an experienced Site Reliability Engineer to join our team at RBC. As a key member of our Technology and Operations group, you will be responsible for ensuring the reliability, scalability, and performance of our applications and infrastructure.About UsRBC is one of Canada's largest banks and a leading provider of financial services....


  • Vancouver, British Columbia, Canada Amazon Full time

    Job OverviewThe AWS Relational Database Service (RDS) team is seeking a skilled Database Reliability Engineer to join their ranks. As a key member of this team, you will be responsible for designing and implementing high-performance, highly available database systems that meet the demanding needs of Amazon's customers.About the TeamThe RDS team is one of the...


  • Vancouver, British Columbia, Canada TrustFlight Full time

    Job Summary:TrustFlight is a leading innovator in digitizing the aviation industry with cutting-edge workflow applications that automate operating and maintenance processes, empowering our customers to focus on data-driven insights.We're seeking a seasoned Site Reliability Engineer (SRE) to join our Operations team and ensure the reliability, scalability,...


  • Vancouver, British Columbia, Canada Tbwa ChiatDay Inc Full time

    About the CompanyLaunchpad, a technology company prioritizing people-first approach, excels in North America's rapidly growing tech sector.PaasportTM, our integration platform as a service (iPaaS), streamlines software integration and automates workflows.Nearshore Staff Augmentation, our managed IT staffing service, connects top IT talent across various...

Site Reliability Engineer

2 months ago


Vancouver, Canada NetApp Full time

Title: Site Reliability Engineer (SRE)

Location:

Bangalore, Karnataka, IN, 560071

Requisition ID: 127074

Job Summary

As a Site Reliability Engineer (SRE) with a specialization in storage, you'll manage and optimize a portfolio of customer-facing cloud services (SaaS/IaaS) on Google Cloud Platform (GCP), ensuring their overall availability, performance, and security. You will collaborate closely with global teams from NetApp and GCP, with a primary focus on supporting Google Cloud NetApp Volumes. This position includes rotational on-call work as part of a global team due to the critical nature of the services we support.

You will be working in a dynamic and fast-paced environment as an engineer on the Site Reliability Engineering (SRE) team. This team is responsible for assisting customers of Google Cloud NetApp Volumes in resolving complex technical issues in production environments. We are seeking an SRE with a deep understanding of storage systems, complex distributed systems, and cloud technologies, and the ability to articulate these concepts clearly to customers and fellow engineers.
You will work with your teammates and our customers to support innovative, cutting-edge technologies that address real-world challenges. You will provide valuable feedback and guidance to our Product and Engineering teams while representing the voice of our customers. You have the opportunity to make a significant impact and take real ownership of your work.

Job Requirements

o Collaborate with external customers and partners to ensure their success with Google Cloud NetApp Volumes.
o Respond to, troubleshoot, and drive root cause analysis (RCA) of complex live production incidents, including cross-platform issues involving OS, networking, and databases in cloud-based SaaS/IaaS environments by following and implementing SRE best practices.
o Continuously monitor, analyze, and measure system health, availability, and latency using tools like Prometheus, Google Cloud Monitoring, ElasticSearch, Grafana, and SolarWinds. Develop and implement steps to improve system and application performance, availability, and reliability.
o Document system knowledge, create runbooks, and ensure critical system information is readily available.
o Stay up-to-date with security trends and proactively identify, diagnose, and resolve complex security issues.
o Maintain and monitor deployment, orchestration of servers, Docker containers, databases, and general backend infrastructure.
o Automate tasks and system components that would benefit from automation or are performed manually.
o Utilize Atlassian Jira to track issues to resolution based on their priority.
o Engage in incident management processes and resolve issues within agreed SLAs/SLOs.

o Extensive experience in storage technologies and incident management processes.
o Advanced knowledge of Linux operating systems (e.g., Ubuntu, CentOS).
o Proficiency in container-based architecture (e.g., Kubernetes).
o Intermediate to advanced knowledge of automation tools and scripting languages such as Ansible, Python, Bash, Go, and PowerShell.
o Solid understanding of algorithms, data structures, and databases (SQL/NoSQL).
o Intermediate knowledge of networking concepts.
o Hands-on experience with cloud environments, particularly GCP.
o Exceptional debugging skills across various platforms and technologies.
o Familiarity with site reliability engineering principles and best practices.

Education

BE in Computer Science or a related field, or 6+ years of professional experience in a relevant role. 


Job Segment: Cloud, Software Engineer, Database, Computer Science, Linux, Technology, Engineering