Site Reliability Engineer

2 days ago


Montreal, Canada Zortech Solutions Full time

Role Name: Cloud Site Reliability Specialist_Montreal

Location: Montreal / Hybrid

JOB DESCRIPTION:

Years of experience : 5+ years

Location: Montreal (Office attendance from Day 1 - Hybrid mode)

Position Description:

  • The Private Cloud SRE L3 team is part of the Enterprise Computing organization. The team has presence in cities globally and is focused on supporting cloud and container-based platforms for internal and external clients.
  • You will integrate with the global follow the sun operations model, which translates to responsibility for technologies supported by the team in the respective regions.
  • Team members frequently interact with engineering teams and collaborate on the testing and certification of software deployed to the platform.
  • Primary Responsibilities:
  • Provide L3 support for private cloud, including on-call rotation
  • Work closely with the internal engineering team and provide input on testing of new component releases and infrastructure upgrades, as well as performance, capacity, and monitoring
  • Create and improve processes for support, including training, documentation, customer engagement, incident, problem, and change management
  • Contribute to internally developed CLIs and APIs to automate SRE's activities and platform's automation
  • Work together with L2 teams and other L3 team members internationally.
  • Required Skills:
  • 5 to 10 years of relevant experience in platforms maintenance/development
  • Experience in a least one programming language
  • Experience with maintaining complex production systems with cloud and legacy technologies -Proven Kubernetes and Docker experience
  • Knowledges of monitoring stack (Grafana, Prometheus, Splunk) usage
  • Strong organizational skills and ability to manage multiple tasks and high-pressure situations for outage resolution
  • Communicate effectively with various user groups, e.g. developers and engineers, as well as remote team members.
  • Nice to have:
  • Experience in developing monitoring architecture and implementing monitoring agents, and alerts
  • Experience in Golang, React, Kubernetes Operators
  • Knowledges of security protocols, e.g. SSL/TLS, Kerberos



  • Montreal, Canada Soho Square Solutions Full time

    Site Reliability Engineer (SRE) - ServiceNow, Application InfrastructureThe Application Infrastructure (AI) department is seeking a Site Reliability Engineer (SRE) to drive reliability engineering, operations, and customer support services for a ServiceNow SaaS implementation. Reporting to a Site Reliability Engineering & Operations Lead, this role involves...


  • Montreal, Quebec, Québec, Canada Soho Square Solutions Full time

    Site Reliability Engineer (SRE) - ServiceNow, Application InfrastructureThe Application Infrastructure (AI) department is seeking a Site Reliability Engineer (SRE) to drive reliability engineering, operations, and customer support services for a ServiceNow SaaS implementation. Reporting to a Site Reliability Engineering & Operations Lead, this role involves...


  • Montreal, Quebec, Canada Genpact Full time

    Genpact is a dynamic global company that aims to make business process outcomes more impactful and sustainable for our clients. Our mission is to help organizations improve efficiency and productivity, while maintaining the highest level of quality.We are currently looking for an experienced Site Reliability Engineer to join our team in Montreal, Canada. The...


  • Montreal, Canada LanceSoft, Inc. Full time

    Site Reliability Engineer Montreal, Quebec, Canada Hybrid Duration: 12+ months Responsibilities: • Are interested in distributed systems and working with highly scalable and reliable services. • Like to work in a fast-moving environment and you aren't afraid to change things to make them better. • Enjoy new technological challenges and...


  • Montreal, Canada LanceSoft, Inc. Full time

    Site Reliability EngineerMontreal, Quebec, Canada HybridDuration: 12+ monthsResponsibilities: • Are interested in distributed systems and working with highly scalable and reliable services. • Like to work in a fast-moving environment and you aren't afraid to change things to make them better. • Enjoy new technological challenges and solving hard...


  • Montreal, Canada SAP SE Full time

    p>We help the world run betterAt SAP, we enable you to bring out your best. Our company culture is focused on collaboration and a shared passion to help the world run better. p>The Reliability Engineering organization provides a multitude of products and services related to operations and continuity of business delivery.The Site Reliability Engineering teams...


  • Montreal, Quebec, Canada LanceSoft, Inc. Full time

    Unlock a career as a Site Reliability Engineer at LanceSoft, Inc., a cutting-edge technology company based in Montreal, Quebec, Canada. We are seeking an experienced and highly motivated individual to join our team.Job Type: Full-timeDuration: 12+ monthsCompany OverviewLanceSoft, Inc. is a leading technology firm dedicated to delivering innovative solutions...


  • Montreal, Quebec, Canada Royal Bank of Canada Full time

    Transform Your Career with a Leadership Role in Site Reliability Engineering We are seeking an experienced Senior Site Reliability Engineer to join our team at the Royal Bank of Canada. As a key member of our Digital Branch SRE organization, you will play a critical role in developing, implementing, and supporting SRE solutions for applications supported by...


  • Montreal, Canada SAP SE Full time

    p>We help the world run betterAt SAP, we enable you to bring out your best. Our company culture is focused on collaboration and a shared passion to help the world run better. We focus every day on building the foundation for tomorrow and creating a workplace that embraces differences, values flexibility, and is aligned to our purpose-driven and...


  • Montreal, Canada Zortech Solutions Full time

    Role Name: Cloud Site Reliability Specialist_Montreal Location: Montreal / Hybrid JOB DESCRIPTION: Years of experience : 5+ years Location: Montreal (Office attendance from Day 1 - Hybrid mode) Position Description: The Private Cloud SRE L3 team is part of the Enterprise Computing organization. The team has presence in cities globally and is focused on...


  • Montreal, Canada Zortech Solutions Full time

    Role Name: Cloud Site Reliability Specialist_Montreal Location: Montreal / Hybrid JOB DESCRIPTION: Years of experience : 5+ years Location: Montreal (Office attendance from Day 1 - Hybrid mode) Position Description: The Private Cloud SRE L3 team is part of the Enterprise Computing organization. The team has presence in cities globally and is focused on...


  • Montreal, Quebec, G4F, CA LanceSoft, Inc. Full time

    Site Reliability EngineerMontreal, Quebec, Canada HybridDuration: 12+ monthsResponsibilities: • Are interested in distributed systems and working with highly scalable and reliable services. • Like to work in a fast-moving environment and you aren't afraid to change things to make them better. • Enjoy new technological challenges and solving hard...


  • Montreal, Canada LanceSoft, Inc. Full time

    Location : Montreal (Hybrid 3 days)Duration: 12+ MonthsJob ProfileSystems Reliability Engineering (SRE) is a discipline focused on improving system service availability, observability, scalability, performance, and resilience across *** by applying sound software engineering principles and adopting the latest technology and tooling.Responsibilities:Are...


  • Montreal, Canada LanceSoft, Inc. Full time

    Location : Montreal (Hybrid 3 days)Duration: 12+ MonthsJob ProfileSystems Reliability Engineering (SRE) is a discipline focused on improving system service availability, observability, scalability, performance, and resilience across *** by applying sound software engineering principles and adopting the latest technology and tooling.Responsibilities:Are...


  • Montreal, Canada LanceSoft, Inc. Full time

    Location : Montreal (Hybrid 3 days) Duration: 12+ Months Job Profile Systems Reliability Engineering (SRE) is a discipline focused on improving system service availability, observability, scalability, performance, and resilience across *** by applying sound software engineering principles and adopting the latest technology and tooling. Responsibilities: ...


  • Montreal, Canada Experience AI Solutions Full time

    Senior Systems Administrator Start Date : as soon as possible Type of employment: permanent Location: Montreal, QC (hybrid model for working in the office) Number of Positions: 1 Language skills : Excellent English language skills Perks: Work for a multinational, award winning, socially responsible company with an operational presence in many...


  • Montreal, Canada Experience AI Solutions Full time

    Senior Systems AdministratorStart Date: as soon as possibleType of employment: permanentLocation: Montreal, QC (hybrid model for working in the office)Number of Positions: 1Language skills: Excellent English language skillsPerks: Work for a multinational, award winning, socially responsible company with an operational presence in many countries, having been...


  • Montreal, Canada Experience AI Solutions Full time

    Senior Systems AdministratorStart Date: as soon as possibleType of employment: permanentLocation: Montreal, QC (hybrid model for working in the office)Number of Positions: 1Language skills: Excellent English language skillsPerks: Work for a multinational, award winning, socially responsible company with an operational presence in many countries, having been...


  • Montreal, Quebec, G4F, CA Zortech Solutions Full time

    Role Name: Cloud Site Reliability Specialist_Montreal Location: Montreal / Hybrid JOB DESCRIPTION: Years of experience : 5+ years Location: Montreal (Office attendance from Day 1 - Hybrid mode) Position Description: The Private Cloud SRE L3 team is part of the Enterprise Computing organization. The team has presence in cities globally and is focused on...


  • Montreal, Quebec, Québec, Canada LanceSoft, Inc. Full time

    Location : Montreal (Hybrid 3 days)Duration: 12+ MonthsJob ProfileSystems Reliability Engineering (SRE) is a discipline focused on improving system service availability, observability, scalability, performance, and resilience across *** by applying sound software engineering principles and adopting the latest technology and tooling.Responsibilities:Are...