Reliability Engineering Expert

4 weeks ago


Montreal, Canada Soho Square Solutions Full time
About Us

Soho Square Solutions is a leading provider of innovative solutions for businesses, aiming to deliver high-quality services that meet our clients' needs.

Job Title: Site Reliability Engineer

We are seeking a skilled and experienced Site Reliability Engineer to join our team. As a key member of our Application Infrastructure department, you will be responsible for driving reliability engineering, operations, and customer support services for our ServiceNow SaaS implementation.

Estimated Salary: $120,000 per year

This role offers a competitive salary and a range of benefits, including a comprehensive health insurance package, retirement plan, and generous paid time off.

Key Responsibilities:
  • Optimize System Reliability:
  • Drive improvements to maximize system availability and performance by automating operational tasks, developing tools, managing technical debt, and participating in architecture reviews.
  • ServiceNow and Infrastructure Support:
  • Troubleshoot ServiceNow issues and related on-premise capabilities in a Linux environment, collaborating to identify root causes and implement lasting improvements.
  • Observability and Monitoring:
  • Design and deliver solutions for metrics, logging, tracing, and alerting to measure and improve system reliability.
  • On-Call Support:
  • Participate in a global on-call rotation, ensuring dependability and responsiveness during agreed hours, with time-off in lieu for on-call duties.
  • Documentation and Knowledge Sharing:
  • Contribute to and maintain thorough documentation of the ServiceNow environment and its dependencies.
  • Technical Debt Management:
  • Identify and prioritize technical debt impacting client satisfaction and operational efficiency.
  • Process Feedback:
  • Provide input on policies and procedures to enhance SRE practices, operational efficiency, and system safety.
Required Skills:
  • ServiceNow Expertise:
  • Experience in ServiceNow administration or development (preferred but not mandatory; on-the-job training available).
  • Programming Skills:
  • Proficiency in at least one programming language (e.g., Python).
  • Communication and Collaboration:
  • Strong verbal and written communication skills, with the ability to build effective relationships with global teams.
  • Problem Solving:
  • Ability to respond to technical emergencies, troubleshoot effectively, and implement sustainable solutions.
  • Teamwork and Dependability:
  • A committed team player with a client-focused approach.

In return for your expertise, we offer a dynamic and supportive work environment, opportunities for professional growth and development, and a chance to be part of a talented team that is shaping the future of technology.



  • Montreal, Quebec, Canada Capgemini Engineering Full time

    About the RoleCapgemini is seeking a highly skilled Digital Engineering Expert to join its team in Canada. As a key member of our organization, you will be responsible for creating innovative solutions to existing technical challenges with one of the world's largest social media platforms.Key Responsibilities:Collaborate with project leads and team members...


  • Montreal, Quebec, Canada Genpact Full time

    Job Title: Technical Lead, Site Reliability Engineering ExpertEstimated Salary: $150,000 - $200,000 per yearAbout Us:Genpact is a global professional services and solutions firm that delivers outcomes that shape the future. Our purpose is to create a world that works better for people, and we serve leading enterprises with our deep business and industry...

  • Reliability Expert

    2 weeks ago


    Montreal, Quebec, Canada Soho Square Solutions Full time

    Soho Square Solutions is seeking a highly skilled Reliability Expert to drive reliability engineering, operations, and customer support services for our ServiceNow SaaS implementation.The position involves delivering SRE practices within a global community of engineers, focusing on implementing ServiceNow Software as a Service that supports IT service...


  • Montreal, Quebec, Canada SAP SE Full time

    We enable innovation at SAP, and we need your expertise to make our platform more reliable. Our focus is on delivering a seamless experience for our customers, and we're looking for someone to join our team in ensuring the high availability of our cloud services.The Site Reliability Engineering organization provides critical support for operations and...


  • Montreal, Quebec, Canada LanceSoft, Inc. Full time

    Unlock a career as a Site Reliability Engineer at LanceSoft, Inc., a cutting-edge technology company based in Montreal, Quebec, Canada. We are seeking an experienced and highly motivated individual to join our team.Job Type: Full-timeDuration: 12+ monthsCompany OverviewLanceSoft, Inc. is a leading technology firm dedicated to delivering innovative solutions...


  • Montreal, Canada SAP SE Full time

    p>We help the world run betterAt SAP, we enable you to bring out your best. Our company culture is focused on collaboration and a shared passion to help the world run better. p>The Reliability Engineering organization provides a multitude of products and services related to operations and continuity of business delivery.The Site Reliability Engineering teams...


  • Montreal, Canada Phase Consulting Full time

    Job SummaryWe are seeking an experienced Reliability Advisor to join our asset management team in Quebec Operations. As a Reliability Expert, you will support operational units in strengthening reliability practices and enhancing asset integrity.About the RoleIn this role, you will develop and refine maintenance strategies and practices for operational...


  • Montreal, Quebec, Canada SAP SE Full time

    About SAPWe help the world run better by enabling organizations to harness the power of innovation. Our company culture is focused on collaboration and a shared passion for delivering excellence.At SAP, we're committed to creating a workplace that embraces diversity, values flexibility, and is aligned with our purpose-driven and future-focused work.About the...


  • Montreal, Canada Soho Square Solutions Full time

    Soho Square Solutions is seeking a skilled Reliability Engineer Specialist to drive reliability engineering, operations, and customer support services for our ServiceNow SaaS implementation. As a key member of the Application Infrastructure team, you will report to the Site Reliability Engineering & Operations Lead and work closely with a global community of...


  • Montreal, Canada SAP SE Full time

    p>We help the world run betterAt SAP, we enable you to bring out your best. Our company culture is focused on collaboration and a shared passion to help the world run better. We focus every day on building the foundation for tomorrow and creating a workplace that embraces differences, values flexibility, and is aligned to our purpose-driven and...


  • Montreal, Canada Soho Square Solutions Full time

    Site Reliability Engineer (SRE) - ServiceNow, Application InfrastructureThe Application Infrastructure (AI) department is seeking a Site Reliability Engineer (SRE) to drive reliability engineering, operations, and customer support services for a ServiceNow SaaS implementation. Reporting to a Site Reliability Engineering & Operations Lead, this role involves...


  • Montreal, Canada Soho Square Solutions Full time

    Site Reliability Engineer (SRE) - ServiceNow, Application InfrastructureThe Application Infrastructure (AI) department is seeking a Site Reliability Engineer (SRE) to drive reliability engineering, operations, and customer support services for a ServiceNow SaaS implementation. Reporting to a Site Reliability Engineering & Operations Lead, this role involves...


  • Montreal, Canada Soho Square Solutions Full time

    Site Reliability Engineer (SRE) - ServiceNow, Application InfrastructureThe Application Infrastructure (AI) department is seeking a Site Reliability Engineer (SRE) to drive reliability engineering, operations, and customer support services for a ServiceNow SaaS implementation. Reporting to a Site Reliability Engineering & Operations Lead, this role involves...


  • Montreal, Canada Soho Square Solutions Full time

    Soho Square Solutions is seeking a highly skilled Reliability Engineering Specialist to join our team.As a key member of our Application Infrastructure department, you will be responsible for driving reliability engineering, operations, and customer support services for our ServiceNow SaaS implementation.This role involves delivering SRE practices within a...


  • Montreal, Canada LanceSoft, Inc. Full time

    LanceSoft, Inc. is seeking a highly skilled Site Reliability Engineer to join our team.The ideal candidate will have experience in designing and maintaining scalable and reliable systems, as well as troubleshooting complex technical issues.This role is an excellent opportunity for someone looking to work in a fast-paced environment and contribute to the...


  • Montreal, Canada Axelon Services Corporation Full time

    Systems Reliability Engineer12 Months ContractLocation : Montreal Looking for role in production support team. Scripting knowledge – UNIX or shell or python Relational databases Grafana or Prometheus is an added advantage. 2 rounds – Zoom and Onsite. Application support role. 2-5 years of experience. Sometimes need to work on rotational basis only on...


  • Montreal, Quebec, Canada LanceSoft, Inc. Full time

    At LanceSoft, Inc., we are seeking a skilled Reliability Engineering Specialist to join our team in Montreal. This is a hybrid role that requires working 3 days on-site and the rest of the time remotely.The successful candidate will have at least 2 years of experience in Systems Reliability Engineering (SRE) and will be responsible for improving system...


  • Montreal, Canada Lyft Full time

    At Lyft, our mission is to improve people’s lives with the world’s best transportation. Imagine cities where streets are safe, communities thrive, and personal cars are a thing of the past. We envision a future where shared and active transportation modes are the norm, fostering vibrant, connected neighborhoods.As a leader in micromobility, Lyft powers...


  • Montreal, Quebec, Québec, Canada Soho Square Solutions Full time

    Site Reliability Engineer (SRE) - ServiceNow, Application InfrastructureThe Application Infrastructure (AI) department is seeking a Site Reliability Engineer (SRE) to drive reliability engineering, operations, and customer support services for a ServiceNow SaaS implementation. Reporting to a Site Reliability Engineering & Operations Lead, this role involves...


  • Montreal, Quebec, Canada Capgemini Engineering Full time

    About the RoleCapgemini Engineering is seeking a skilled Senior Software Architect to join our team in Canada. As a key member of our cloud engineering team, you will be responsible for designing and implementing scalable and reliable cloud-based solutions.This role offers a unique opportunity to work with cutting-edge technologies and collaborate with a...