Technical Site Reliability Engineering

4 days ago


Montreal, Canada Ubisoft Full time
Job Description

As a Technical Site Reliability Engineering (SRE) Lead within Ubisoft’s IT department, you will manage a team of SREs to ensure the reliability, scalability, and performance of our IT platform. You will play a pivotal role in shaping the architecture and operations of our cloud-native infrastructure, with a strong focus on automation and large-scale system management.

Responsibilities:

  • Leadership: manage and mentor a team of SREs, fostering a culture of continuous learning and improvement.
  • Design and Development: Oversee the design and development of tools and solutions for the smooth operation of the Kubernetes environments.
  • Maintenance and Operation: Ensure the maintenance and operation of various components of the Ubisoft IT Platform, emphasizing documented and automated installation and support procedures.
  • Continuous Improvement: Drive enhancements in continuous integration and delivery systems, ensuring they meet the highest standards of reliability and performance.
  • Collaboration: Collaborate closely with Developer teams to assess their needs and ensure the platform is designed for operability and ease of use.
  • Advocate: Advocate for the use of Kubernetes and other cloud-native technologies within Ubisoft.
  • Evaluation: steer the evaluation of new requirements, technical designs, and standards to ensure they align with best practices and organizational goals.
  • Strategic Planning: Contribute to strategic planning and decision-making processes to guide the future direction of the platform.Qualifications

This role involves on-call.* 


Qualifications

  • Expertise in cloud-native architectures, Kubernetes (e.g., CRD, CNI, admission controllers), and Linux systems.
  • Strong CI/CD capabilities with tools like GitLab CI and ArgoCD, plus experience with public cloud providers (Azure, AWS, GCP).
  • Proficient in scripting or development (preferably Go and/or Python) and infrastructure automation with Terraform.
  • Advanced understanding of Linux networking, system configuration, and network administration.
  • Effective collaboration skills, including experience working with remote teams.

Bonus:

  • Familiarity with OpenStack, Docker, Flask, OPA, and other DevOps tools.
  • Previous leadership experience managing large-scale production systems.


Additional Information

Just a heads up: If you require a work permit, your eligibility may depend on your education and years of relevant work experience, as required by the government.

Skills and competencies show up in different forms and can be based on different experiences, that is why we strongly encourage you to apply even though you may not have all the requirements listed above.

At Ubisoft, we embrace diversity in all its forms. We’re committed to fostering an inclusive and respectful work environment for all. We know the importance of providing a pleasant interview experience, therefore if you need any accommodation, please let us know if there is anything we can do to facilitate the interview process.



  • Montreal, Canada Ubisoft Entertainment Full time

    h3>Technical Site Reliability Engineering (SRE) LeadFull-timeContract: PermanentFlexible Working Organization: HybridUbisoft’s 19,000 team members, working across more than 30 countries around the world, are bound by a common mission to enrich players’ lives with original and memorable gaming experiences. If you are excited about solving game-changing...


  • Montreal, Canada SAP Full time

    p>We help the world run betterAt SAP, we enable you to bring out your best. Our company culture is focused on collaboration and a shared passion to help the world run better. We focus every day on building the foundation for tomorrow and creating a workplace that embraces differences, values flexibility, and is aligned to our purpose-driven and...


  • Montreal, Canada Axelon Services Corporation Full time

    Job Title: Site Reliability Engineer (SRE), ServiceNow, Application InfrastructureMontreal QC12 MonthsThe manager is looking for candidates with good python skills , Unix , DB skills and SRE skills (Service Now experience is a plus).They mainly received people with DevOps skills and not SRE.The Application Infrastructure (AI) department is seeking a Site...


  • Montreal, Canada Ubisoft Full time

    h3>Job DescriptionAs a Technical Site Reliability Engineering (SRE) Lead within Ubisoft’s IT department, you will manage a team of SREs to ensure the reliability, scalability, and performance of our IT platform. You will play a pivotal role in shaping the architecture and operations of our cloud-native infrastructure, with a strong focus on automation and...


  • Montreal, Quebec, Canada Genpact Full time

    Job Title: Technical Lead, Site Reliability Engineering ExpertEstimated Salary: $150,000 - $200,000 per yearAbout Us:Genpact is a global professional services and solutions firm that delivers outcomes that shape the future. Our purpose is to create a world that works better for people, and we serve leading enterprises with our deep business and industry...


  • Montreal, Canada SAP SE Full time

    We help the world run better At SAP, we enable you to bring out your best. Our company culture is focused on collaboration and a shared passion to help the world run better. How? We focus every day on building the foundation for tomorrow and creating a workplace that embraces differences, values flexibility, and is aligned to our purpose-driven and...


  • Montreal, Quebec, Canada Axelon Services Corporation Full time

    SRE Role SummaryAs part of the Application Infrastructure team at Axelon Services Corporation, we are looking for an exceptional Site Reliability Engineer to join our global community. This role will focus on driving reliability engineering, operations, and customer support services for our ServiceNow SaaS implementation.


  • Montreal, Canada Alltech Consulting Services Full time

    Job Description Level 4 The Application Infrastructure (AI) department is seeking a Site Reliability Engineer (SRE) to help drive the reliability engineering, operations, and customer support services for Company’s ServiceNow SaaS implementation. Reporting to a Site Reliability Engineering & Operations Lead, this role requires delivering a range of SRE...


  • Montreal, Canada Lyft Full time

    At Lyft, our mission is to improve people’s lives with the world’s best transportation. Imagine cities where streets are safe, communities thrive, and personal cars are a thing of the past. We envision a future where shared and active transportation modes are the norm, fostering vibrant, connected neighborhoods.As a leader in micromobility, Lyft powers...


  • Montreal, Canada SAP SE Full time

    p>We help the world run betterAt SAP, we enable you to bring out your best. Our company culture is focused on collaboration and a shared passion to help the world run better. We focus every day on building the foundation for tomorrow and creating a workplace that embraces differences, values flexibility, and is aligned to our purpose-driven and...


  • Montreal, Canada Hamilton Barnes Associates Limited Full time

    p>Hamilton Barnes is currently representing a major vehicle manufacturer that is actively seeking a Site Reliability Engineer for an initial 6-month contract with the possibility of extension.This position has on site commitments 2/3 Days Per Week in Gaydon.Build software and systems to manage platform infrastructure and applicationsProvide primary...


  • Montreal, Quebec, Canada SAP Full time

    We empower innovation by investing in the development of our diverse employees. Our company culture is built on collaboration and a shared passion to help the world run better. We focus on building the foundation for tomorrow and creating a workplace that values flexibility, diversity, and a purpose-driven approach.The Reliability Engineering organization...


  • Montreal, Canada Axelon Services Corporation Full time

    Job Title: Site Reliability Engineer (SRE), ServiceNow, Application Infrastructure Montreal QC 12 Months The ideal candidate would have at least one of:oServiceNow administration or development experience, or Software development skills in one or more programming language, e.g. Python The Application Infrastructure (AI) department is seeking a Site...


  • Montreal, Canada National Bank Full time

    As a Specialist in site reliability engineering on the National Bank Data Protection team, you will ensure the operational reliability of data protection assets. With your experience and knowledge in the operational management of high-availability assets (HA), you will have a positive impact on the Bank's stability and reputation with its internal and...


  • Montreal, Canada National Bank Full time

    As a Specialist in site reliability engineering on the National Bank Data Protection team, you will ensure the operational reliability of data protection assets. With your experience and knowledge in the operational management of high-availability assets (HA), you will have a positive impact on the Bank's stability and reputation with its internal and...


  • Montreal, Canada National Bank Full time

    As a Specialist in site reliability engineering on the National Bank Data Protection team, you will ensure the operational reliability of data protection assets. With your experience and knowledge in the operational management of high-availability assets (HA), you will have a positive impact on the Bank's stability and reputation with its internal and...


  • Montreal, Quebec, Canada National Bank Full time

    Welcome to this exciting opportunity to join the National Bank team as a Senior Site Reliability Engineer. With a strong background in IT and experience in online services development, you will be responsible for promoting resilience and stability with teams, supporting the creation of reliable and scalable systems, and automating repetitive tasks.About the...


  • Montreal, Canada National Bank Full time

    As a Specialist in site reliability engineering on the National Bank Data Protection team, you will ensure the operational reliability of data protection assets. With your experience and knowledge in the operational management of high-availability assets (HA), you will have a positive impact on the Bank's stability and reputation with its internal and...


  • Montreal, Canada National Bank Full time

    As a Specialist in site reliability engineering on the National Bank Data Protection team, you will ensure the operational reliability of data protection assets. With your experience and knowledge in the operational management of high-availability assets (HA), you will have a positive impact on the Bank's stability and reputation with its internal and...

  • Reliability Engineer

    3 weeks ago


    Montreal, Quebec, Canada Lyft Full time

    Job OpeningLyft is seeking a Site Reliability Engineer to support our production systems and platforms.Job Summary: Assist in defining the team's roadmap and architecture based on technology and business needs.Key Responsibilities:Design and implement effective infrastructure abstractions that increase the velocity of our application teams.Be responsible for...