Site Reliability Engineer
1 week ago
Site Reliability Engineer (SRE) - ServiceNow, Application Infrastructure
The Application Infrastructure (AI) department is seeking a Site Reliability Engineer (SRE) to drive reliability engineering, operations, and customer support services for a ServiceNow SaaS implementation. Reporting to a Site Reliability Engineering & Operations Lead, this role involves delivering SRE practices within a global community of engineers.
The position focuses on implementing ServiceNow Software as a Service , which supports IT service management and integrates with technologies like chatbots, on-call escalation, incident management, SQL databases, APIs, and web infrastructure. This role combines development, process improvement, and production-side operational responsibilities, including occasional participation in on-call rotations.
We welcome candidates from diverse backgrounds, whether transitioning from development, infrastructure, or system administration, who are passionate about reliability and resilience principles.
Key Responsibilities:
- Optimize System Reliability:
- Drive improvements to maximize system availability and performance by automating operational tasks, developing tools, managing technical debt, and participating in architecture reviews.
- ServiceNow and Infrastructure Support:
- Troubleshoot ServiceNow issues and related on-premise capabilities in a Linux environment, collaborating to identify root causes and implement lasting improvements.
- Observability and Monitoring:
- Design and deliver solutions for metrics, logging, tracing, and alerting to measure and improve system reliability.
- On-Call Support:
- Participate in a global on-call rotation, ensuring dependability and responsiveness during agreed hours, with time-off in lieu for on-call duties.
- Documentation and Knowledge Sharing:
- Contribute to and maintain thorough documentation of the ServiceNow environment and its dependencies.
- Technical Debt Management:
- Identify and prioritize technical debt impacting client satisfaction and operational efficiency.
- Process Feedback:
- Provide input on policies and procedures to enhance SRE practices, operational efficiency, and system safety.
Required Skills:
- ServiceNow Expertise:
- Experience in ServiceNow administration or development (preferred but not mandatory;
on-the-job training available). - Programming Skills:
- Proficiency in at least one programming language (e.G., Python).
- Communication and Collaboration:
- Strong verbal and written communication skills, with the ability to build effective relationships with global teams.
- Problem Solving:
- Ability to respond to technical emergencies, troubleshoot effectively, and implement sustainable solutions.
- Teamwork and Dependability:
- A committed team player with a client-focused approach.
Preferred Skills:
- ServiceNow administration or development experience.
- Familiarity with Linux environments and operational troubleshooting.
- Knowledge of observability tools and techniques (metrics, logging, tracing).
-
Site Reliability Engineer
1 week ago
Montréal, QC, Canada Soho Square Solutions Full timeSite Reliability Engineer (SRE) - ServiceNow, Application Infrastructure The Application Infrastructure (AI) department is seeking a Site Reliability Engineer (SRE) to drive reliability engineering, operations, and customer support services for a ServiceNow SaaS implementation. Reporting to a Site Reliability Engineering & Operations Lead, this role...
-
Site Reliability Engineer
1 week ago
Montréal, QC, Canada LanceSoft, Inc. Full timeLocation : Montreal (Hybrid 3 days) Duration: 12+ Months Job Profile Systems Reliability Engineering (SRE) is a discipline focused on improving system service availability, observability, scalability, performance, and resilience across *** by applying sound software engineering principles and adopting the latest technology and tooling. ...
-
Site Reliability Engineer
1 week ago
Montréal, QC, Canada LanceSoft, Inc. Full timeLocation : Montreal (Hybrid 3 days) Duration: 12+ Months Job Profile Systems Reliability Engineering (SRE) is a discipline focused on improving system service availability, observability, scalability, performance, and resilience across *** by applying sound software engineering principles and adopting the latest technology and tooling. ...
-
Site Reliability Engineer
1 week ago
Montréal, QC, Canada Genpact Full timeDescription - External Genpact (NYSE: G) is a global professional services and solutions firm delivering outcomes that shape the future. Our 125,000+ people across 30+ countries are driven by our innate curiosity, entrepreneurial agility, and desire to create lasting value for clients. Powered by our purpose – the relentless pursuit of a world that...
-
Site Reliability Engineer
1 week ago
Montréal, QC, Canada Genpact Full timeDescription - External Genpact (NYSE: G) is a global professional services and solutions firm delivering outcomes that shape the future. Our 125,000+ people across 30+ countries are driven by our innate curiosity, entrepreneurial agility, and desire to create lasting value for clients. Powered by our purpose – the relentless pursuit of a world that...
-
Senior Site Reliability Engineer/DevOps
1 week ago
Montréal, QC, Canada LanceSoft, Inc. Full timeLocation : Montreal (Hybrid 3 days) Duration: 12+ Months Job Profile Systems Reliability Engineering (SRE) is a discipline focused on improving system service availability, observability, scalability, performance, and resilience across *** by applying sound software engineering principles and adopting the latest technology and tooling. ...
-
Site Reliability Engineer
2 months ago
montréal, Canada National Bank Full timeAs a Specialist in site reliability engineering on the National Bank Data Protection team, you will ensure the operational reliability of data protection assets. With your experience and knowledge in the operational management of high-availability assets (HA), you will have a positive impact on the Bank's stability and reputation with its internal and...
-
Site Reliability Engineer
2 months ago
montréal, Canada National Bank Full timeAs a Specialist in site reliability engineering on the National Bank Data Protection team, you will ensure the operational reliability of data protection assets. With your experience and knowledge in the operational management of high-availability assets (HA), you will have a positive impact on the Bank's stability and reputation with its internal and...
-
Site Reliability Engineer
2 months ago
montréal, Canada National Bank Full timeAs a Specialist in site reliability engineering on the National Bank Data Protection team, you will ensure the operational reliability of data protection assets. With your experience and knowledge in the operational management of high-availability assets (HA), you will have a positive impact on the Bank's stability and reputation with its internal and...
-
Senior Site Reliability Engineer/DevOps
1 week ago
Montréal, QC, Canada Genpact Full timeG) is a global professional services and solutions firm delivering outcomes that shape the future. Powered by our purpose – the relentless pursuit of a world that works better for people – we serve and transform leading enterprises, including the Fortune Global 500, with our deep business and industry knowledge, digital operations services, and expertise...
-
Site Reliability Engineer
6 months ago
Montréal, Canada Bourse de Montreal Inc. Full timeVenture outside the ordinary - TMX Careers The TMX group of companies includes leading global exchanges such as the Toronto Stock Exchange, Montreal Exchange, and numerous innovative organizations enhancing capital markets. United as a global team, we’re connecting cross-functionally, traversing industries and geographies, moving opportunity into...
-
Private Cloud Site Reliability Specialist
1 week ago
Montréal, QC, Canada Bounteous Full timeBounteous x Accolite makes the future faster for the world's most ambitious brands. Our services span Strategy, Analytics, Digital Engineering, Cloud, Data & AI, Experience Design, and Marketing. We are guided by Co-Innovation, our proven methodology of collaborative partnership. Bounteous x Accolite brings together 5000+ employees spanning North America,...
-
Coordinator Reliability and Maintenance
6 months ago
Montréal, Canada Centre de production de produits biologiques Inc Full time**Coordinator, Reliability and Maintenance** The Biologics Manufacturing Center (BMC) inc. is a new bio-manufacturing facility whose construction was completed in June 2021. As a not-for-profit organization having a public interest mandate, BMC is a key player in Canada's ability to respond rapidly to future national and global health emergencies. In...
-
Aero Engines Specialist
3 months ago
Montréal, QC, Canada Database Technology Full time**Job Role: Aero Engines Specialist - Engine Programs & Accessories** **Location: Montreal, QC-Canada** **Pls share skill matrix with submission**: - Yrs of exp in Aero Engine experience - Yrs of exp in stakeholder management. - Yrs of exp in LRU reliability data. **Must have Key skill - Aero Engine experience, LRU (Line replaceable Unit) specialist,...
-
Spécialiste en Acquisition de Sites
3 weeks ago
Montréal, QC, Canada FLO EV Charging Full time**Contribuez à l’Expansion de la Mobilité Électrique à Travers le Canada !** Chez FLO, nous transformons la mobilité électrique en construisant un réseau de bornes de recharge accessible, fiable et innovant. Si vous êtes motivé par la transition vers des transports plus propres et que vous excellez dans l’établissement de relations...
-
Inspecteur sites industriels F/H
1 week ago
Montréal, QC, Canada Applus+ Laboratories Full timeQui recherchons-nous Nous élargissons actuellement nos services d'évaluation de terrain et recherchons un professionnel motivé et qualifié pour rejoindre notre équipe d'experts techniques. Le candidat idéal est un électricien ou un ingénieur expérimenté avec une forte expertise technique dans l'évaluation de produits électriques et désireux de...
-
On-site Assistant to Project Manager
6 months ago
Montréal, Canada ENERCON Full time**On-site Assistant to Project Manager (Intern)**: - Location: Montreal, Quebec, CA, H3B 5M2- Onsite/Hybrid/Remote: Onsite- ENERCON, a globally renowned company established in Germany over 35 years ago, positions itself as a leader in the wind turbine design and manufacturing industry. Our reputation is built on an unwavering commitment to excellence, the...
-
Engineering Applications Lead
1 week ago
Montréal, QC, Canada Air Tek Inc Full timeAbout Us Air-tek is a Canadian-based software company with a powerful suite of unique products that have already achieved a significant share of a huge global market. The product market fit is excellent, and customers are lining up to buy. Although our global customers know us, we intentionally operate in stealth mode during this growth phase. Our...
-
Field Service Engineer
1 week ago
Montréal, QC, Canada Lock Search Group Full timeField Service Engineer Montreal, QC Our client, a medical device company specializing in diagnostics, is seeking a Field Service Engineer to join their team. The successful candidate will be responsible for planning and coordinating activities related to the installation and maintenance of diagnostic instrumentation. Provide emergency repair services,...
-
Inspecteur Technique Site Industriel H/F
2 weeks ago
Montréal, QC, Canada Applus+ Laboratories Full timeQui recherchons-nous Nous élargissons actuellement nos services d'évaluation de terrain et recherchons un professionnel motivé et qualifié pour rejoindre notre équipe d'experts techniques. Le candidat idéal est un électricien ou un ingénieur expérimenté avec une forte expertise technique dans l'évaluation de produits électriques et désireux de...