Infrastructure Reliability Engineer II

1 month ago


Vancouver, British Columbia, Canada Microsoft Full time
Overview

Are you passionate about contributing to large-scale initiatives within one of the most dynamic and diverse sectors at Microsoft? Do you seek challenging opportunities that yield immediate impact, particularly for product engineers working on Office and M365? Would you like to be at the forefront, empowering engineering teams to excel in their work? If so, this role is tailored for you.

The ES365 (Engineering Systems 365) team is responsible for the comprehensive developer experience in Office and M365 (Substrate), encompassing everything from source control and check-in processes to build, validation, and deployment automation. We are embarking on significant enhancements to streamline app development and deployment across various platforms, transitioning from proprietary tools to unified Microsoft investments, open-source solutions, and industry-standard technologies. This is an exhilarating phase as we aim to redefine productivity by harnessing the potential of AI across the board.

We are in search of a Site Reliability Engineer II (SRE) to join the Infrastructure teams within ES365. The responsibilities of these teams include, but are not limited to:

  1. Azure management and governance
  2. Ensuring business continuity
  3. Infrastructure as Code practices
  4. Network engineering tasks
  5. Service provisioning and deployment
  6. Security and vulnerability oversight
  7. Systems state management

As a new SRE, you will have the opportunity to deliver innovative solutions using a contemporary DevOps methodology, leveraging the full spectrum of technologies available at Microsoft. Your efforts will enable our organization to adapt more efficiently to changing customer needs and market dynamics, while simultaneously reducing costs, minimizing redundant work, and enhancing efficiencies through automation.

At Microsoft, our mission is to empower every individual and organization on the planet to achieve more. We unite with a growth mindset, innovate to empower others, and collaborate to achieve our collective objectives. Each day, we build upon our core values of respect, integrity, and accountability to foster a culture of inclusion where everyone can thrive both professionally and personally.

Qualifications

Required Qualifications

  1. A minimum of 4 years of technical experience in software engineering, network engineering, or systems administration OR a Bachelor's Degree in Computer Science, Information Technology, or a related field, accompanied by at least 1 year of relevant technical experience OR a Master's Degree in Computer Science, Information Technology, or a related field.
  2. Candidates must meet Microsoft, customer, and/or government security screening requirements for this role. These requirements include, but are not limited to, specialized security screenings:
  3. Microsoft Cloud Background Check: This position requires passing the Microsoft Cloud Background Check upon hire/transfer and every two years thereafter.

Preferred Qualifications

  1. At least 5 years of technical experience in software engineering, network engineering, or systems administration OR a Bachelor's Degree in Computer Science, Information Technology, or a related field, along with 2 years of relevant technical experience.
  2. Proficient full-stack troubleshooting skills across network, application, hardware, management fabric, and distributed services layers.
  3. Experience in accurately documenting complex systems to effectively communicate technical concepts across teams.
  4. Familiarity with implementing and managing Service Level Objectives (SLOs) and Service Level Indicators (SLIs) for production services.
  5. Proficiency with one or more automation tools or frameworks (e.g., Terraform, ARM, Chef, Bicep) and scripting languages (e.g., Python, Bash, Powershell) or similar.
Responsibilities
  1. Engage in onboarding, code/design reviews, and regular meetings with engineering teams responsible for product development and management.
  2. Independently create code or scripts that automate repetitive and scalable operational processes.
  3. Design, develop, and maintain telemetry pipelines and monitoring tools that provide insights into operational metrics.
  4. Develop, test, troubleshoot, and implement modifications to optimize code and enhance products.
  5. Respond to incidents during scheduled on-call rotations.
  6. Author technical documentation for your tools and services.
  7. Participate in post-incident reviews to drive service enhancements.

Benefits/perks listed below may vary depending on the nature of your employment with Microsoft and the country where you work.

  • Leading healthcare benefits
  • Access to educational resources
  • Discounts on products and services
  • Savings and investment opportunities
  • Maternity and paternity leave
  • Generous time-off policies
  • Community giving programs
  • Opportunities for networking and connection


  • Vancouver, British Columbia, Canada Microsoft Full time

    About the RoleWe are seeking a highly skilled Site Reliability Engineer II to join our Engineering Systems 365 (ES365) team at Microsoft. As a key member of our Infrastructure team, you will play a critical role in ensuring the reliability, scalability, and performance of our cloud-based infrastructure.The ES365 team is responsible for designing, building,...


  • Vancouver, British Columbia, Canada Microsoft Full time

    About the RoleWe are seeking a highly skilled Site Reliability Engineer II to join our Engineering Systems 365 (ES365) team at Microsoft. As a key member of our Infrastructure team, you will play a critical role in ensuring the reliability, scalability, and performance of our cloud-based infrastructure.The ES365 team is responsible for designing, building,...


  • Vancouver, British Columbia, Canada Microsoft Full time

    Overview Are you passionate about tackling large-scale challenges within one of the most dynamic divisions at Microsoft? Do you thrive on innovative projects that yield immediate impact for product engineers working on Office and M365? If you seek to be at the heart of transformative initiatives, enabling engineering teams to excel, this opportunity may...


  • Vancouver, British Columbia, Canada Microsoft Full time

    Overview Are you passionate about tackling large-scale engineering challenges within one of the most dynamic divisions at Microsoft? Do you thrive on innovative projects that yield immediate impact, particularly for product engineers in Office and M365? If you are eager to play a pivotal role in enhancing the productivity of engineering teams, this...


  • Vancouver, British Columbia, Canada Microsoft Full time

    About the RoleWe are seeking a highly skilled Site Reliability Engineer II to join our Engineering Systems 365 (ES365) team at Microsoft. As a key member of our Infrastructure team, you will play a critical role in ensuring the reliability and scalability of our cloud-based services.Key ResponsibilitiesParticipate in onboarding, code/design reviews, and...


  • Vancouver, British Columbia, Canada Microsoft Full time

    About the RoleWe are seeking a highly skilled Site Reliability Engineer II to join our Engineering Systems 365 (ES365) team at Microsoft. As a key member of our Infrastructure team, you will play a critical role in ensuring the reliability and scalability of our cloud-based services.Key ResponsibilitiesParticipate in onboarding, code/design reviews, and...


  • Vancouver, British Columbia, Canada Microsoft Full time

    About the RoleWe are seeking a highly skilled Site Reliability Engineer II to join our Engineering Systems 365 (ES365) team at Microsoft. As a key member of our Infrastructure team, you will play a critical role in ensuring the reliability and scalability of our cloud-based services.Key ResponsibilitiesParticipate in onboarding, code/design reviews, and...


  • Vancouver, British Columbia, Canada Microsoft Full time

    About the RoleWe are seeking a highly skilled Site Reliability Engineer II to join our Engineering Systems 365 (ES365) team at Microsoft. As a key member of our Infrastructure team, you will play a critical role in ensuring the reliability and scalability of our cloud-based services.Key ResponsibilitiesParticipate in onboarding, code/design reviews, and...


  • Vancouver, British Columbia, Canada Microsoft Full time

    OverviewWe are seeking a highly skilled Site Reliability Engineer II to join our Engineering Systems 365 (ES365) team at Microsoft. As a key member of our Infrastructure team, you will play a critical role in designing, developing, and maintaining the tools that make up the end-to-end developer experience in Office and M365.ResponsibilitiesParticipate in...


  • Vancouver, British Columbia, Canada Microsoft Full time

    OverviewWe are seeking a highly skilled Site Reliability Engineer II to join our Engineering Systems 365 (ES365) team at Microsoft. As a key member of our Infrastructure team, you will play a critical role in designing, developing, and maintaining the tools that make up the end-to-end developer experience in Office and M365.ResponsibilitiesParticipate in...


  • Vancouver, British Columbia, Canada Microsoft Full time

    About the RoleWe are seeking a highly skilled Site Reliability Engineer II to join our Engineering Systems 365 (ES365) team at Microsoft. As a key member of our Infrastructure team, you will play a critical role in ensuring the reliability and scalability of our cloud-based services.Key ResponsibilitiesParticipate in onboarding, code/design reviews, and...


  • Vancouver, British Columbia, Canada Microsoft Full time

    About the RoleWe are seeking a highly skilled Site Reliability Engineer II to join our Engineering Systems 365 (ES365) team at Microsoft. As a key member of our Infrastructure team, you will play a critical role in ensuring the reliability and scalability of our cloud-based services.Key ResponsibilitiesParticipate in onboarding, code/design reviews, and...


  • Vancouver, British Columbia, Canada Electronic Arts Full time

    Join Our Team as an Infrastructure Reliability EngineerCollaborate with software development teams to resolve build challenges and implement enhancementsDesign and sustain continuous integration pipelines and workflow solutionsImprove various system tools through application developmentOversee and upgrade automation pipelines in partnership with...


  • Vancouver, British Columbia, Canada tsworks Full time

    Company Overviewtsworks Canada, Inc is a leading provider of technology products and services, headquartered in Ontario, Canada. As a subsidiary of The Software Works, Inc, USA, we are committed to establishing and promoting best practices in the field of Information Technology. Our organization values its workforce, strives to deliver exceptional value in...


  • Vancouver, British Columbia, Canada tsworks Full time

    Company Overviewtsworks Canada, Inc is a leading provider of technology products and services, headquartered in Ontario, Canada. As a subsidiary of The Software Works, Inc, USA, we are committed to establishing and promoting best practices in the field of Information Technology. Our organization values its workforce, strives to deliver exceptional value in...


  • Vancouver, British Columbia, Canada Microsoft Canada Full time

    **Software Engineer II for Accelerator Technologies** We are seeking a skilled Software Engineer II to join our team responsible for developing kernel components that power graphics and compute device support in Windows. Accelerators like Graphic's Processing Units (GPUs) and Neural Processing Units (NPUs) are crucial for various technologies, including...


  • Vancouver, British Columbia, Canada tsworks Full time

    Company Overviewtsworks Canada, Inc is a leading provider of technology products and services located in Ontario, Canada. As a subsidiary of The Software Works, Inc, USA, we are committed to adopting and establishing best practices in the field of Information Technology. At tsworks Canada Inc, we prioritize our employees, strive to deliver exceptional value...


  • Vancouver, British Columbia, Canada tsworks Full time

    Company Overviewtsworks Canada, Inc is a leading provider of technology products and services, headquartered in Ontario, Canada. As a subsidiary of The Software Works, Inc, USA, we are committed to setting and upholding the highest standards in Information Technology. At tsworks Canada Inc, we prioritize our workforce, strive to deliver exceptional value in...


  • Vancouver, British Columbia, Canada tsworks Full time

    Company Overviewtsworks Canada, Inc is a leading provider of technology products and services located in Ontario, Canada. As a subsidiary of The Software Works, Inc, USA, we are committed to adopting and establishing best practices in the field of Information Technology. At tsworks Canada Inc, we prioritize our employees, strive to deliver exceptional value...


  • Vancouver, British Columbia, Canada tsworks Full time

    Company Overviewtsworks Canada, Inc is a leading provider of technology products and services, headquartered in Ontario, Canada. As a subsidiary of The Software Works, Inc, USA, we are committed to setting and upholding the highest standards in Information Technology. At tsworks Canada Inc, we prioritize our workforce, strive to deliver exceptional value in...