Site Reliability Engineer

1 month ago


Montreal, Canada LanceSoft, Inc. Full time

Location : Montreal (Hybrid 3 days)

Duration: 12+ Months


Job Profile

Systems Reliability Engineering (SRE) is a discipline focused on improving system service availability, observability, scalability, performance, and resilience across *** by applying sound software engineering principles and adopting the latest technology and tooling.


Responsibilities:

  • Are interested in distributed systems and working with highly scalable and reliable services.
  • Like to work in a fast-moving environment and you aren't afraid to change things to make them better.
  • Enjoy new technological challenges and solving hard problems.
  • Believe a team working well together is smarter than the single smartest person on that team.
  • Have grit, drive and a deep sense of ownership.
  • Working closely with engineering/development teams to design, build, and maintain systems.
  • Troubleshooting issues across the entire technology stack: hardware, software, application, and network.
  • Identifying and driving opportunities to improve automation for our platforms; scope and create automation for deployment, management, and visibility of our services.
  • Proactively identifying and addressing systems reliability risks.
  • Working alongside existing global and regional team members on a follow-the-sun basis.
  • Represent the RPE organization in design reviews and operational readiness exercises for new and existing services.

Qualifications - Skill Set

  • Demonstrated ability to troubleshoot problems and debug to identify root cause.
  • Hands on experience on enterprise tools such as AppDynamics, Grafana, Splunk, Dynatrace.
  • Experience with Ansible, GitHub or any automation/configuration/release management tools.
  • Automation-related experience is particularly valued using scripting languages such as python, bash, perl. One higher level language is desired.
  • Awareness of, and ability to reason about modern software and systems architectures, including load-balancing, databases, queueing, caching, distributed systems failure modes, micro services, Cloud, etc.
  • Practical experience running large scale systems is an advantage.
  • Should be able to contribute to system design and architecture with strong database knowledge.


Experience: Intermediate with 2 to 5 years

Top 3 Must have :

1. Strong experience with Python and / or Shell scripting

2. Strong experience with data base (DB2 knowledges is a plus)

3. Strong communication skills. The consultant will work with business users in day to day basis.


Top 2 Nice to have :

1. Good knowledges of Grafana, Prometheus

2. Good experience with debugging



  • Montreal, Canada SAP SE Full time

    We help the world run betterAt SAP, we enable you to bring out your best. Our company culture is focused on collaboration and a shared passion to help the world run better. We offer a highly collaborative, caring team environment with a strong focus on learning and development, recognition for your individual contributions, and a variety of benefit options...


  • Montreal, Quebec, Québec, Canada Soho Square Solutions Full time

    Site Reliability Engineer (SRE) - ServiceNow, Application InfrastructureThe Application Infrastructure (AI) department is seeking a Site Reliability Engineer (SRE) to drive reliability engineering, operations, and customer support services for a ServiceNow SaaS implementation. Reporting to a Site Reliability Engineering & Operations Lead, this role involves...


  • Montreal, Canada LanceSoft, Inc. Full time

    Site Reliability Engineer Montreal, Quebec, Canada Hybrid Duration: 12+ months Responsibilities: • Are interested in distributed systems and working with highly scalable and reliable services. • Like to work in a fast-moving environment and you aren't afraid to change things to make them better. • Enjoy new technological challenges and...


  • Montreal, Canada LanceSoft, Inc. Full time

    Site Reliability EngineerMontreal, Quebec, Canada HybridDuration: 12+ monthsResponsibilities: • Are interested in distributed systems and working with highly scalable and reliable services. • Like to work in a fast-moving environment and you aren't afraid to change things to make them better. • Enjoy new technological challenges and solving hard...


  • Montreal, Quebec, Canada Genpact Full time

    Genpact is a dynamic global company that aims to make business process outcomes more impactful and sustainable for our clients. Our mission is to help organizations improve efficiency and productivity, while maintaining the highest level of quality.We are currently looking for an experienced Site Reliability Engineer to join our team in Montreal, Canada. The...


  • Montreal, Canada SAP SE Full time

    p>We help the world run betterAt SAP, we enable you to bring out your best. Our company culture is focused on collaboration and a shared passion to help the world run better. p>The Reliability Engineering organization provides a multitude of products and services related to operations and continuity of business delivery.The Site Reliability Engineering teams...


  • Montreal, Canada Domtar Full time

    Software-Development Operations Site Reliability Engineer We help the world run better At SAP, we enable you to bring out your best. Our company culture is focused on collaboration and a shared passion to help the world run better. How? We focus every day on building the foundation for tomorrow and creating a...


  • Montreal, Canada Zortech Solutions Full time

    Role Name: Cloud Site Reliability Specialist_Montreal Location: Montreal / Hybrid JOB DESCRIPTION: Years of experience : 5+ years Location: Montreal (Office attendance from Day 1 - Hybrid mode) Position Description: The Private Cloud SRE L3 team is part of the Enterprise Computing organization. The team has presence in cities globally and is focused on...


  • Montreal, Canada Zortech Solutions Full time

    Role Name: Cloud Site Reliability Specialist_Montreal Location: Montreal / Hybrid JOB DESCRIPTION: Years of experience : 5+ years Location: Montreal (Office attendance from Day 1 - Hybrid mode) Position Description: The Private Cloud SRE L3 team is part of the Enterprise Computing organization. The team has presence in cities globally and is focused on...


  • Montreal, Canada Zortech Solutions Full time

    Role Name: Cloud Site Reliability Specialist_Montreal Location: Montreal / Hybrid JOB DESCRIPTION: Years of experience : 5+ years Location: Montreal (Office attendance from Day 1 - Hybrid mode) Position Description: The Private Cloud SRE L3 team is part of the Enterprise Computing organization. The team has presence in cities globally and is...


  • Montreal, Canada SAP SE Full time

    p>We help the world run betterAt SAP, we enable you to bring out your best. Our company culture is focused on collaboration and a shared passion to help the world run better. We focus every day on building the foundation for tomorrow and creating a workplace that embraces differences, values flexibility, and is aligned to our purpose-driven and...


  • Montreal, Canada Domtar Full time

    Software-Development OperationsSite Reliability EngineerWe help the world run betterAt SAP, we enable you to bring out your best. Our company culture is focused on collaboration and a shared passion to help the world run better. How? We focus every day on building the foundation for tomorrow and creating a workplace that embraces differences, values...


  • Montreal, Quebec, Canada LanceSoft, Inc. Full time

    Unlock a career as a Site Reliability Engineer at LanceSoft, Inc., a cutting-edge technology company based in Montreal, Quebec, Canada. We are seeking an experienced and highly motivated individual to join our team.Job Type: Full-timeDuration: 12+ monthsCompany OverviewLanceSoft, Inc. is a leading technology firm dedicated to delivering innovative solutions...


  • Montreal, Canada Domtar Full time

    Software-Development Operations Site Reliability Engineer We help the world run better At SAP, we enable you to bring out your best. Our company culture is focused on collaboration and a shared passion to help the world run better. How? We focus every day on building the foundation for tomorrow and creating a workplace that embraces...


  • Montreal, Canada Domtar Full time

    Software-Development Operations Site Reliability Engineer We help the world run better At SAP, we enable you to bring out your best. Our company culture is focused on collaboration and a shared passion to help the world run better. How? We focus every day on building the foundation for tomorrow and creating a workplace that embraces...


  • Montreal, Canada LanceSoft, Inc. Full time

    Location : Montreal (Hybrid 3 days)Duration: 12+ MonthsJob ProfileSystems Reliability Engineering (SRE) is a discipline focused on improving system service availability, observability, scalability, performance, and resilience across *** by applying sound software engineering principles and adopting the latest technology and tooling.Responsibilities:Are...


  • Montreal, Canada LanceSoft, Inc. Full time

    Location : Montreal (Hybrid 3 days)Duration: 12+ MonthsJob ProfileSystems Reliability Engineering (SRE) is a discipline focused on improving system service availability, observability, scalability, performance, and resilience across *** by applying sound software engineering principles and adopting the latest technology and tooling.Responsibilities:Are...


  • Montreal, Canada SAP SE Full time

    p>We help the world run betterAt SAP, we enable you to bring out your best. Our company culture is focused on collaboration and a shared passion to help the world run better. We focus every day on building the foundation for tomorrow and creating a workplace that embraces differences, values flexibility, and is aligned to our purpose-driven and...


  • Montreal, Canada LanceSoft, Inc. Full time

    Location : Montreal (Hybrid 3 days) Duration: 12+ Months Job Profile Systems Reliability Engineering (SRE) is a discipline focused on improving system service availability, observability, scalability, performance, and resilience across *** by applying sound software engineering principles and adopting the latest technology and tooling. Responsibilities: ...


  • Montreal, Quebec, G4F, CA LanceSoft, Inc. Full time

    Site Reliability EngineerMontreal, Quebec, Canada HybridDuration: 12+ monthsResponsibilities: • Are interested in distributed systems and working with highly scalable and reliable services. • Like to work in a fast-moving environment and you aren't afraid to change things to make them better. • Enjoy new technological challenges and solving hard...