Senior Site Reliability Engineer

2 days ago


Vancouver, Canada Microsoft Canada Full time

Microsoft is a company where passionate innovators come to collaborate, envision what can be and take their careers further. This is a world of more possibilities, more innovation, more openness, and the sky is the limit thinking in a cloud-enabled world.

Microsoft’s Azure Data engineering team is leading the transformation of analytics in the world of data with products like databases, data integration, big data analytics, messaging & real-time analytics, and business intelligence. The products our portfolio include Microsoft Fabric, Azure SQL DB, Azure Cosmos DB, Azure PostgreSQL, Azure Data Factory, Azure Synapse Analytics, Azure Service Bus, Azure Event Grid, and Power BI. Our mission is to build the data platform for the age of AI, powering a new class of data-first applications and driving a data culture.

Within Azure Data, the databases team builds and maintains Microsoft's operational Database systems. We store and manage data in a structured way to enable multitude of applications across various industries. We are on a journey to enable developer friendly, mission-critical, AI enabled operational Databases across relational, non-relational and Open Source Software (OSS) offerings.

We believe in making the day in the life of the On-Call Engineer boring while living up to the expectations of a massive cloud service with stringent Service Level Objectives (SLO’s). We do this by thinking differently, stretching ourselves to go all the way to the root of the problem, keeping data in front and center for all our decisions and taking a systems approach for generating outcomes that far exceeds the expectations. Helping attain the aspirational Service Level Objectives (SLO’s) through pragmatic innovation is what sets the SRE’s in Cosmos DB apart. If you share the same purpose, cause and belief and have passion to follow this pursuit, please read the rest of the Job description on what we do, and we would love to have you join us

Azure Cosmos DB is Microsoft’s next generation of globally distributed, massively scalable, multi-model cloud database service. It is designed to enable developers to build planet-scale applications. Azure Cosmos DB is one of the fastest growing Azure services. Joining the Azure Cosmos DB team is a fantastic opportunity to work with incredibly talented engineers operating like a startup and be at the forefront of building and shaping the Livesite Automation and AI Ops stack in Cosmos DB and lead the path for broader adoption across Microsoft Azure.

Cosmos DB is a database of choice for the spectrum spanning from the hobbyist developer to the largest of Fortune 500 companies. The database provides the data backbone of many critical systems in Health Care, Retail, Telecommunications, IoT etc. where the Service Availability and Latency is paramount. Cosmos DB provides financially backed SLA (service level agreements) around 99.99 Availability and
We are looking for a self-driven Senior Site Reliability Engineer (SRE) who likes taking a data driven and systems-based approach to solve Service Reliability problems. You will be responsible for building and optimizing solutions that can analyze massive amounts of telemetry and other Service Health indicators in near real time and perform automated root cause analysis and necessary mitigations to restore SLO’s.

Our team focuses on diversity of all types of candidates for our roles and we strive to hire people with different experiences and perspectives into our team. To that end, we know that no candidate has every desired skill and experience, but all of us together make our team strong.

We do not just value differences or different perspectives. We seek them out and invite them in so we can tap into the collective power of everyone in the company. As a result, our customers are better served.
Individual Contributor



  • Vancouver, British Columbia, Canada Royal Bank of Canada> Full time

    Job SummaryThe Royal Bank of Canada is seeking a skilled Site Reliability Engineering Specialist to join its team. This role will be responsible for the support, development, and implementation of Site Reliability Engineering solutions for all applications within the bank's technology infrastructure.Key ResponsibilitiesSupport and Development of Site...


  • Vancouver, Canada Royal Bank of Canada Full time

    Job SummaryThe Application Support SRE will be responsible for the support, development, and implementation of Site Reliability Engineering solutions for all applications within City National Bank (CNB), an RBC company. This team will work collaboratively with teams across several lines of business


  • Vancouver, Canada Microsoft Canada Full time

    Are you interested in working for one of the most exciting teams at Microsoft? Then look no further than Microsoft Teams SRE team. You will be building solutions that leverage state-of-the-art technologies to deliver the next evolution in collaboration and teamwork. What is a Site Reliability Engineer (SRE)? SRE is what you get when you treat operations as...


  • Vancouver, Canada Microsoft Canada Full time

    Are you interested in working for one of the most exciting teams at Microsoft? Then look no further than Microsoft Teams SRE team. You will be building solutions that leverage state-of-the-art technologies to deliver the next evolution in collaboration and teamwork. What is a Site Reliability Engineer (SRE)? SRE is what you get when you treat operations as...


  • Vancouver, Canada Microsoft Canada Full time

    Microsoft is a company where passionate innovators come to collaborate, envision what can be and take their careers further. This is a world of more possibilities, more innovation, more openness, and the sky is the limit thinking in a cloud-enabled world.Microsoft’s Azure Data engineering team is leading the transformation of analytics in the world of data...


  • Vancouver, British Columbia, Canada S.i. Systems Full time

    Job Description:We are seeking a Senior Site Reliability Engineer to develop robust observability solutions using Dynatrace and automate key monitoring processes through Terraform and PowerShell.Key Responsibilities:• Develop and implement observability solutions using Dynatrace• Automate key monitoring processes through Terraform and PowerShellAbout the...


  • Vancouver, British Columbia, Canada Electronic Arts Full time

    ResponsibilitiesWe are seeking a skilled Site Reliability Engineer to join our team at Electronic Arts. As a Site Reliability Engineer, you will work closely with our development teams to address build issues and improve our systems.Key ResponsibilitiesCollaborate with development teams to identify and resolve build issuesCreate and maintain pipelines and...


  • Vancouver, British Columbia, B6B, British Columbia, Canada Microsoft Canada Full time

    Are you interested in working for one of the most exciting teams at Microsoft? Then look no further than Microsoft Teams SRE team. You will be building solutions that leverage state-of-the-art technologies to deliver the next evolution in collaboration and teamwork.What is a Site Reliability Engineer (SRE)? SRE is what you get when you treat operations as if...


  • Vancouver, British Columbia, B6B, British Columbia, Canada Microsoft Canada Full time

    Microsoft is a company where passionate innovators come to collaborate, envision what can be and take their careers further. This is a world of more possibilities, more innovation, more openness, and the sky is the limit thinking in a cloud-enabled world.Microsoft’s Azure Data engineering team is leading the transformation of analytics in the world of data...


  • Vancouver, Canada NetApp Full time

    Title: Site Reliability Engineer (SRE) Location: Bangalore, Karnataka, IN, 560071 Requisition ID: 127074 Job SummaryAs a Site Reliability Engineer (SRE) with a specialization in storage, you'll manage and optimize a portfolio of customer-facing cloud services (SaaS/IaaS) on Google Cloud Platform (GCP), ensuring their overall availability, performance,...


  • Vancouver, Canada Themis Solutions Inc. Full time

    p>We are currently seeking a new Site Reliability Engineer, Co-op, to join our Engineering team in Burnaby, Calgary or Toronto.Applicants should be available for an 8-month co-op period from January 2025 to August 2025.What your team does:As a Site Reliability Engineer, you will help build, improve, and maintain Clio’s globally distributed network of...


  • Vancouver, Canada Royal Bank of Canada Full time

    Job SummaryThe Lead Support SRE will be responsible for the supporting and spearheading the development, and implementation of Site Reliability Engineering solutions for all applications within City National Bank (CNB), an RBC company. This team will work collaboratively with teams across several li


  • Vancouver, Canada RBC Full time

    Job Summary The Application Support SRE will be responsible for the support, development, and implementation of Site Reliability Engineering solutions for all applications within City National Bank (CNB), an RBC company. This team will work collaboratively with teams across several lines of business and other Technology and Operations partners as a...


  • Vancouver, Canada Conexiom Full time

    About the Opportunity: Conexiom is seeking a dedicated and experienced Site Reliability Engineering (SRE) Senior Manager to lead our SRE team. The role involves leading the Cloud SRE team in day-to-day operations, which include monitoring, support activities, ensuring customer satisfaction through reliable service, and building and designing cloud...


  • Vancouver, Canada RBC Full time

    Job Summary The Application Support SRE will be responsible for the support, development, and implementation of Site Reliability Engineering solutions for all applications within City National Bank (CNB), an RBC company. This team will work collaboratively with teams across several lines of business and other Technology and Operations partners as a...


  • Vancouver, British Columbia, Canada Royal Bank of Canada Full time

    Company OverviewThe Royal Bank of Canada (RBC) is a leading financial institution that prides itself on providing exceptional banking services to its clients. With a strong presence in the Canadian market, RBC has a reputation for innovation and customer satisfaction.SalaryWe are offering a highly competitive salary range of $120,000 - $180,000 per year,...


  • vancouver, Canada S.i. Systems Full time

    Our Vancouver Client is seeking a Senior Site Reliability Engineer to develop robust observability solutions using Dynatrace, and automating key monitoring processes through Terraform and PowerShell - 11396312 months contract, Vancouver - 2 days/month in office and as needed basis for meeting


  • Vancouver, Canada Microsoft Full time

    Overview Are you an individual who loves to work on large-scale projects at one of the most exciting and diverse divisions within Microsoft? Are you looking for big, creative challenges that show immediate results since your customers are the product engineers for Office and M365? Do you want to be at the core of it all, acting as a force multiplier...


  • Vancouver, Canada Royal Bank of Canada> Full time

    Job SummaryThe Application Support SRE will be responsible for the support, development, and implementation of Site Reliability Engineering solutions for all applications within City National Bank (CNB), an RBC company. This team will work collaboratively with teams across several lines of business and other Technology and Operations partners as a...


  • Vancouver, Canada RBC - Royal Bank Full time

    Job SummaryThe Application Support SRE will be responsible for the support, development, and implementation of Site Reliability Engineering solutions for all applications within City National Bank (CNB), an RBC company. This team will work collaboratively with teams across several lines of business and other Technology and Operations partners as a...