Reliability Engineering Specialist
4 weeks ago
Soho Square Solutions is seeking a highly skilled Reliability Engineering Specialist to join our team.
As a key member of our Application Infrastructure department, you will be responsible for driving reliability engineering, operations, and customer support services for our ServiceNow SaaS implementation.
This role involves delivering SRE practices within a global community of engineers, with a focus on implementing ServiceNow Software as a Service that supports IT service management and integrates with technologies like chatbots, on-call escalation, incident management, SQL databases, APIs, and web infrastructure.
You will combine development, process improvement, and production-side operational responsibilities, including occasional participation in on-call rotations.
We welcome candidates from diverse backgrounds who are passionate about reliability and resilience principles.
Key Responsibilities:- Optimize System Reliability:
- Drive improvements to maximize system availability and performance by automating operational tasks, developing tools, managing technical debt, and participating in architecture reviews.
- ServiceNow and Infrastructure Support:
- Troubleshoot ServiceNow issues and related on-premise capabilities in a Linux environment, collaborating to identify root causes and implement lasting improvements.
- Observability and Monitoring:
- Design and deliver solutions for metrics, logging, tracing, and alerting to measure and improve system reliability.
- On-Call Support:
- Participate in a global on-call rotation, ensuring dependability and responsiveness during agreed hours, with time-off in lieu for on-call duties.
- Documentation and Knowledge Sharing:
- Contribute to and maintain thorough documentation of the ServiceNow environment and its dependencies.
- Technical Debt Management:
- Identify and prioritize technical debt impacting client satisfaction and operational efficiency.
- Process Feedback:
- Provide input on policies and procedures to enhance SRE practices, operational efficiency, and system safety.
We estimate the salary range for this role to be around $120,000 - $180,000 per year, depending on location and experience.
-
Reliability Engineer Specialist
3 weeks ago
Montreal, Canada Soho Square Solutions Full timeSoho Square Solutions is seeking a skilled Reliability Engineer Specialist to drive reliability engineering, operations, and customer support services for our ServiceNow SaaS implementation. As a key member of the Application Infrastructure team, you will report to the Site Reliability Engineering & Operations Lead and work closely with a global community of...
-
Reliability Engineering Specialist
4 weeks ago
Montreal, Quebec, Canada LanceSoft, Inc. Full timeAt LanceSoft, Inc., we are seeking a skilled Reliability Engineering Specialist to join our team in Montreal. This is a hybrid role that requires working 3 days on-site and the rest of the time remotely.The successful candidate will have at least 2 years of experience in Systems Reliability Engineering (SRE) and will be responsible for improving system...
-
Cloud Reliability Engineering Specialist
4 weeks ago
Montreal, Quebec, Canada SAP SE Full timeWe enable innovation at SAP, and we need your expertise to make our platform more reliable. Our focus is on delivering a seamless experience for our customers, and we're looking for someone to join our team in ensuring the high availability of our cloud services.The Site Reliability Engineering organization provides critical support for operations and...
-
Reliability Engineering Specialist
3 weeks ago
Montreal, Canada Soho Square Solutions Full timeWe are seeking a highly skilled Reliability Engineering Specialist to join our team at Soho Square Solutions. This role is focused on ensuring the reliability and performance of our ServiceNow SaaS implementation.The ideal candidate will have a strong background in software development, infrastructure, or system administration and be passionate about...
-
Reliability Engineer
4 weeks ago
Montreal, Quebec, Canada Soho Square Solutions Full timeSoho Square Solutions is a leading provider of ServiceNow solutions, and we are seeking a highly skilled Reliability Engineer - Software Infrastructure Specialist to join our team.The estimated salary for this position is $120,000 - $180,000 per year, depending on experience.About the JobWe are looking for an experienced engineer who can drive reliability...
-
Senior Software Engineer
4 weeks ago
Montreal, Quebec, Canada Capgemini Engineering Full timeAbout Capgemini EngineeringCapgemini is a global leader in digital transformation and technology services, helping organizations unlock the value of technology to address their entire breadth of business needs.Job OverviewWe are seeking an experienced Senior Software Engineer to join our team as a Data Coding Specialist. As a key member of our engineering...
-
Reliability Management Specialist
4 weeks ago
Montreal, Canada Phase Consulting Full timeJob Title: Reliability Management SpecialistAbout the Role:This is an exciting opportunity for a seasoned Reliability Advisor to join Phase Consulting's asset management team in Quebec Operations. As a Reliability Management Specialist, you will play a key role in strengthening reliability practices and enhancing asset integrity across operational units.Key...
-
Reliability Engineer
1 month ago
Montreal, Quebec, Canada National Bank Full timeWe are seeking a skilled Reliability Engineer to join our team at National Bank. As a specialist in reliability, efficiency, and performance of systems, you will play a critical role in ensuring the stability and scalability of our applications.Key ResponsibilitiesPromote and implement best practices for resilience and stability within teamsSupport and...
-
AWS Site Reliability Engineer
4 months ago
Montreal, Canada Alltech Consulting Services Full timeJob Description Level 4 The Application Infrastructure (AI) department is seeking a Site Reliability Engineer (SRE) to help drive the reliability engineering, operations, and customer support services for Company’s ServiceNow SaaS implementation. Reporting to a Site Reliability Engineering & Operations Lead, this role requires delivering a range of SRE...
-
Site Reliability Engineer
4 weeks ago
Montreal, Canada Soho Square Solutions Full timeSite Reliability Engineer (SRE) - ServiceNow, Application InfrastructureThe Application Infrastructure (AI) department is seeking a Site Reliability Engineer (SRE) to drive reliability engineering, operations, and customer support services for a ServiceNow SaaS implementation. Reporting to a Site Reliability Engineering & Operations Lead, this role involves...
-
Site Reliability Engineer
4 weeks ago
Montreal, Canada Soho Square Solutions Full timeSite Reliability Engineer (SRE) - ServiceNow, Application InfrastructureThe Application Infrastructure (AI) department is seeking a Site Reliability Engineer (SRE) to drive reliability engineering, operations, and customer support services for a ServiceNow SaaS implementation. Reporting to a Site Reliability Engineering & Operations Lead, this role involves...
-
Site Reliability Engineer
2 weeks ago
Montreal, Canada Soho Square Solutions Full timeSite Reliability Engineer (SRE) - ServiceNow, Application InfrastructureThe Application Infrastructure (AI) department is seeking a Site Reliability Engineer (SRE) to drive reliability engineering, operations, and customer support services for a ServiceNow SaaS implementation. Reporting to a Site Reliability Engineering & Operations Lead, this role involves...
-
Systems Reliability Engineer
6 hours ago
Montreal, Canada Axelon Services Corporation Full timeSystems Reliability Engineer12 Months ContractLocation : Montreal Looking for role in production support team. Scripting knowledge – UNIX or shell or python Relational databases Grafana or Prometheus is an added advantage. 2 rounds – Zoom and Onsite. Application support role. 2-5 years of experience. Sometimes need to work on rotational basis only on...
-
Reliability Engineering Expert
4 weeks ago
Montreal, Canada Soho Square Solutions Full timeAbout UsSoho Square Solutions is a leading provider of innovative solutions for businesses, aiming to deliver high-quality services that meet our clients' needs.Job Title: Site Reliability EngineerWe are seeking a skilled and experienced Site Reliability Engineer to join our team. As a key member of our Application Infrastructure department, you will be...
-
Cloud Platform Reliability Specialist
4 weeks ago
Montreal, Quebec, Canada SAP SE Full timeWe are a leading provider of cloud-based solutions, and we're looking for a skilled specialist to join our team.As a Cloud Platform Reliability Specialist, you will be responsible for ensuring the smooth operation of our cloud services. This includes analyzing system metrics, identifying areas for improvement, and implementing changes to increase reliability...
-
Site Reliability Engineer
7 months ago
Montreal, Canada Lyft Full timeAt Lyft, our mission is to improve people’s lives with the world’s best transportation. Imagine cities where streets are safe, communities thrive, and personal cars are a thing of the past. We envision a future where shared and active transportation modes are the norm, fostering vibrant, connected neighborhoods.As a leader in micromobility, Lyft powers...
-
Site Reliability Engineer
1 month ago
Montreal, Quebec, Québec, Canada Soho Square Solutions Full timeSite Reliability Engineer (SRE) - ServiceNow, Application InfrastructureThe Application Infrastructure (AI) department is seeking a Site Reliability Engineer (SRE) to drive reliability engineering, operations, and customer support services for a ServiceNow SaaS implementation. Reporting to a Site Reliability Engineering & Operations Lead, this role involves...
-
Senior Software Architect in Cloud Engineering
4 weeks ago
Montreal, Quebec, Canada Capgemini Engineering Full timeAbout the RoleCapgemini Engineering is seeking a skilled Senior Software Architect to join our team in Canada. As a key member of our cloud engineering team, you will be responsible for designing and implementing scalable and reliable cloud-based solutions.This role offers a unique opportunity to work with cutting-edge technologies and collaborate with a...
-
Site Reliability Engineer
4 weeks ago
Montreal, Canada LanceSoft, Inc. Full timeLocation : Montreal (Hybrid 3 days)Duration: 12+ MonthsJob ProfileSystems Reliability Engineering (SRE) is a discipline focused on improving system service availability, observability, scalability, performance, and resilience across *** by applying sound software engineering principles and adopting the latest technology and tooling.Responsibilities:Are...
-
Site Reliability Engineer
4 weeks ago
Montreal, Canada LanceSoft, Inc. Full timeLocation : Montreal (Hybrid 3 days)Duration: 12+ MonthsJob ProfileSystems Reliability Engineering (SRE) is a discipline focused on improving system service availability, observability, scalability, performance, and resilience across *** by applying sound software engineering principles and adopting the latest technology and tooling.Responsibilities:Are...