Infrastructure Reliability Engineer II

1 month ago


Vancouver, British Columbia, Canada Microsoft Full time

Overview

Are you passionate about tackling large-scale engineering challenges within one of the most dynamic divisions at Microsoft? Do you thrive on innovative projects that yield immediate impact, particularly for product engineers in Office and M365? If you are eager to play a pivotal role in enhancing the productivity of engineering teams, this opportunity is tailored for you.

The Engineering Systems 365 (ES365) team is responsible for the comprehensive developer experience in Office and M365 (Substrate), overseeing everything from source control to deployment automation. We are embarking on transformative initiatives to streamline app development across various platforms, transitioning from proprietary tools to open-source and industry-standard solutions. This is an exciting period as we harness AI to redefine productivity.

We are in search of a Site Reliability Engineer II (SRE) to join our Infrastructure teams. The responsibilities of these teams encompass:

  • Azure management and governance
  • Ensuring business continuity
  • Infrastructure as Code practices
  • Network engineering
  • Service provisioning and deployment
  • Security and vulnerability oversight
  • Systems state management

As a new SRE, you will deliver innovative solutions using a modern DevOps approach, leveraging the extensive technology stack that Microsoft provides. Your contributions will enable our organization to adapt more effectively to changing customer requirements and market trends, while also driving cost efficiencies and reducing redundant efforts through automation.

Microsoft's mission is to empower every individual and organization globally to achieve more. Our employees unite with a growth mindset, innovate to uplift others, and collaborate to reach shared objectives. We uphold values of respect, integrity, and accountability, fostering a culture of inclusion where everyone can excel.

Qualifications

Required Qualifications

  • A minimum of 4 years of technical experience in software engineering, network engineering, or systems administration.
  • Alternatively, a Bachelor's Degree in Computer Science, Information Technology, or a related field along with at least 1 year of relevant technical experience.
  • A Master's Degree in Computer Science, Information Technology, or a related field is also acceptable.

Other Requirements

Candidates must meet Microsoft's security screening requirements, which include a Microsoft Cloud Background Check upon hiring and every two years thereafter.

Preferred Qualifications

  • At least 5 years of technical experience in software engineering, network engineering, or systems administration, or a Bachelor's Degree in a related field with 2+ years of relevant experience.
  • Proficiency in full-stack troubleshooting across network, application, hardware, and distributed services layers.
  • Experience in documenting complex systems to effectively communicate technical concepts across teams.
  • Familiarity with implementing and managing Service Level Objectives (SLOs) and Service Level Indicators (SLIs) for production services.
  • Knowledge of automation tools or frameworks (e.g., Terraform, ARM, Chef, Bicep) and scripting languages (e.g., Python, Bash, PowerShell) is preferred.

Responsibilities

  • Participate in onboarding, code/design reviews, and regular interactions with engineering teams responsible for product development and management.
  • Independently create code or scripts to automate repetitive and scalable operational processes.
  • Design, develop, and maintain telemetry pipelines and monitoring tools to track operational metrics.
  • Develop, test, troubleshoot, and implement enhancements to optimize code and improve products.
  • Respond to incidents during scheduled on-call rotations.
  • Author technical documentation for tools and services.
  • Engage in post-incident reviews to drive service enhancements.

Benefits and perks may vary based on employment nature and location.



  • Vancouver, British Columbia, Canada Microsoft Full time

    About the RoleWe are seeking a highly skilled Site Reliability Engineer II to join our Engineering Systems 365 (ES365) team at Microsoft. As a key member of our Infrastructure team, you will play a critical role in ensuring the reliability, scalability, and performance of our cloud-based infrastructure.The ES365 team is responsible for designing, building,...


  • Vancouver, British Columbia, Canada Microsoft Full time

    About the RoleWe are seeking a highly skilled Site Reliability Engineer II to join our Engineering Systems 365 (ES365) team at Microsoft. As a key member of our Infrastructure team, you will play a critical role in ensuring the reliability, scalability, and performance of our cloud-based infrastructure.The ES365 team is responsible for designing, building,...


  • Vancouver, British Columbia, Canada Microsoft Full time

    Overview Are you passionate about tackling large-scale challenges within one of the most dynamic divisions at Microsoft? Do you thrive on innovative projects that yield immediate impact for product engineers working on Office and M365? If you seek to be at the heart of transformative initiatives, enabling engineering teams to excel, this opportunity may...


  • Vancouver, British Columbia, Canada Microsoft Full time

    Overview Are you passionate about contributing to large-scale initiatives within one of the most dynamic and diverse sectors at Microsoft? Do you seek challenging opportunities that yield immediate impact, particularly for product engineers working on Office and M365? Would you like to be at the forefront, empowering engineering teams to excel in their work?...


  • Vancouver, British Columbia, Canada Microsoft Full time

    About the RoleWe are seeking a highly skilled Site Reliability Engineer II to join our Engineering Systems 365 (ES365) team at Microsoft. As a key member of our Infrastructure team, you will play a critical role in ensuring the reliability and scalability of our cloud-based services.Key ResponsibilitiesParticipate in onboarding, code/design reviews, and...


  • Vancouver, British Columbia, Canada Microsoft Full time

    About the RoleWe are seeking a highly skilled Site Reliability Engineer II to join our Engineering Systems 365 (ES365) team at Microsoft. As a key member of our Infrastructure team, you will play a critical role in ensuring the reliability and scalability of our cloud-based services.Key ResponsibilitiesParticipate in onboarding, code/design reviews, and...


  • Vancouver, British Columbia, Canada Microsoft Full time

    About the RoleWe are seeking a highly skilled Site Reliability Engineer II to join our Engineering Systems 365 (ES365) team at Microsoft. As a key member of our Infrastructure team, you will play a critical role in ensuring the reliability and scalability of our cloud-based services.Key ResponsibilitiesParticipate in onboarding, code/design reviews, and...


  • Vancouver, British Columbia, Canada Microsoft Full time

    About the RoleWe are seeking a highly skilled Site Reliability Engineer II to join our Engineering Systems 365 (ES365) team at Microsoft. As a key member of our Infrastructure team, you will play a critical role in ensuring the reliability and scalability of our cloud-based services.Key ResponsibilitiesParticipate in onboarding, code/design reviews, and...


  • Vancouver, British Columbia, Canada Microsoft Full time

    OverviewWe are seeking a highly skilled Site Reliability Engineer II to join our Engineering Systems 365 (ES365) team at Microsoft. As a key member of our Infrastructure team, you will play a critical role in designing, developing, and maintaining the tools that make up the end-to-end developer experience in Office and M365.ResponsibilitiesParticipate in...


  • Vancouver, British Columbia, Canada Microsoft Full time

    OverviewWe are seeking a highly skilled Site Reliability Engineer II to join our Engineering Systems 365 (ES365) team at Microsoft. As a key member of our Infrastructure team, you will play a critical role in designing, developing, and maintaining the tools that make up the end-to-end developer experience in Office and M365.ResponsibilitiesParticipate in...


  • Vancouver, British Columbia, Canada Electronic Arts Full time

    Join Our Team as an Infrastructure Reliability EngineerCollaborate with software development teams to resolve build challenges and implement enhancementsDesign and sustain continuous integration pipelines and workflow solutionsImprove various system tools through application developmentOversee and upgrade automation pipelines in partnership with...


  • Vancouver, British Columbia, Canada Microsoft Full time

    About the RoleWe are seeking a highly skilled Site Reliability Engineer II to join our Engineering Systems 365 (ES365) team at Microsoft. As a key member of our Infrastructure team, you will play a critical role in ensuring the reliability and scalability of our cloud-based services.Key ResponsibilitiesParticipate in onboarding, code/design reviews, and...


  • Vancouver, British Columbia, Canada Microsoft Full time

    About the RoleWe are seeking a highly skilled Site Reliability Engineer II to join our Engineering Systems 365 (ES365) team at Microsoft. As a key member of our Infrastructure team, you will play a critical role in ensuring the reliability and scalability of our cloud-based services.Key ResponsibilitiesParticipate in onboarding, code/design reviews, and...


  • Vancouver, British Columbia, Canada tsworks Full time

    Company Overviewtsworks Canada, Inc is a leading provider of technology products and services, headquartered in Ontario, Canada. As a subsidiary of The Software Works, Inc, USA, we are committed to establishing and promoting best practices in the field of Information Technology. Our organization values its workforce, strives to deliver exceptional value in...


  • Vancouver, British Columbia, Canada tsworks Full time

    Company Overviewtsworks Canada, Inc is a leading provider of technology products and services, headquartered in Ontario, Canada. As a subsidiary of The Software Works, Inc, USA, we are committed to establishing and promoting best practices in the field of Information Technology. Our organization values its workforce, strives to deliver exceptional value in...


  • Vancouver, British Columbia, Canada Microsoft Canada Full time

    **Software Engineer II for Accelerator Technologies** We are seeking a skilled Software Engineer II to join our team responsible for developing kernel components that power graphics and compute device support in Windows. Accelerators like Graphic's Processing Units (GPUs) and Neural Processing Units (NPUs) are crucial for various technologies, including...


  • Vancouver, British Columbia, Canada tsworks Full time

    Company Overviewtsworks Canada, Inc is a leading provider of technology products and services located in Ontario, Canada. As a subsidiary of The Software Works, Inc, USA, we are committed to adopting and establishing best practices in the field of Information Technology. At tsworks Canada Inc, we prioritize our employees, strive to deliver exceptional value...


  • Vancouver, British Columbia, Canada tsworks Full time

    Company Overviewtsworks Canada, Inc is a leading provider of technology products and services, headquartered in Ontario, Canada. As a subsidiary of The Software Works, Inc, USA, we are committed to setting and upholding the highest standards in Information Technology. At tsworks Canada Inc, we prioritize our workforce, strive to deliver exceptional value in...


  • Vancouver, British Columbia, Canada tsworks Full time

    Company Overviewtsworks Canada, Inc is a leading provider of technology products and services located in Ontario, Canada. As a subsidiary of The Software Works, Inc, USA, we are committed to adopting and establishing best practices in the field of Information Technology. At tsworks Canada Inc, we prioritize our employees, strive to deliver exceptional value...


  • Vancouver, British Columbia, Canada tsworks Full time

    Company Overviewtsworks Canada, Inc is a leading provider of technology products and services, headquartered in Ontario, Canada. As a subsidiary of The Software Works, Inc, USA, we are committed to setting and upholding the highest standards in Information Technology. At tsworks Canada Inc, we prioritize our workforce, strive to deliver exceptional value in...