Site Reliability Engineer

3 weeks ago


Montreal, Canada Compunnel, Inc. Full time

Client is seeking an experienced Site Reliability Engineer (SRE) to support and enhance the reliability, performance, and operational efficiency of our global ServiceNow SaaS platform. As part of the Application Infrastructure (AI) team, you will be instrumental in advancing SRE practices, ensuring seamless integration and stability across on-premise infrastructure and cloud systems. This role combines software development, automation, systems engineering, and operations in a highly collaborative environment. This is a hybrid role with both development-focused and production operational responsibilities, including periodic on-call participation. Key Responsibilities Drive automation and reliability improvements to reduce operational overhead and increase system availability Troubleshoot ServiceNow issues and occasionally resolve Linux-based infrastructure problems Develop and maintain observability tools including metrics, logging, tracing, and alerting to track and enhance system health and performance Collaborate with global SRE peers to deliver reliable and resilient ServiceNow capabilities Identify, document, and prioritize technical debt and propose long-term solutions to reduce recurring issues Contribute to the design and documentation of the ServiceNow ecosystem, including integrations with SQL databases, APIs, and web platforms Participate in on-call rotation and respond effectively to technical incidents or outages Provide input to policies and procedures with the goal of improving security, efficiency, and operational consistency Champion a culture of continuous improvement, resilience, and operational excellence Required Qualifications Minimum 7+ years of professional experience in software development, system administration, or site reliability engineering Experience in at least one of the following areas: ServiceNow administration or development Strong troubleshooting skills and a proactive approach to problem-solving Familiarity with Linux systems, shell scripting, and general infrastructure support Effective verbal and written communication skills Demonstrated ability to collaborate and build strong working relationships in a team environment Willingness to work in an on-call rotation and respond to critical incidents when needed Preferred Qualificatio nsDirect experience with ServiceNow (administration or development) Exposure to observability tools (e.g., Prometheus, Grafana, ELK, Splunk) Familiarity with DevOps/SRE best practices and tools Experience with infrastructure automation (e.g., Ansible, Terraform) Knowledge of incident management, capacity planning, and monitoring frameworks Certifications (if any) ServiceNow certifications (Administrator, Developer) are a plus but not required Relevant certifications in Linux, DevOps, or SRE disciplines are desirable #J-18808-Ljbffr



  • Montreal, Canada ApTask Full time

    Direct message the job poster from ApTask Looking for an intermediate between 2 to 5 years' experience. The Application Infrastructure (Al) department is seeking a Site Reliability Engineer (SRE) to help drive the reliability engineering, operations and customer support services clients ServiceNow SaaS implementation. Reporting to a Site Reliability...


  • Montreal (administrative region), Canada Noramtec Consultants Inc. Full time

    A major global financial services institution is partnering with us to hire a Site Reliability Engineer (SRE) for their growing Montreal-based Application Infrastructure team. This pivotal role will focus on ensuring the reliability, performance, and operational stability of enterprise applications, with a primary emphasis on ServiceNow SaaS implementations...


  • Montreal, Canada Tecsys Inc. Full time

    Having recognized the advantages of remote work, including employee morale, productivity, reduced commuting on employee wellbeing and the environment, we are proud to be a digital-first company. The technologies and programs in which we invested have provided a fantastic foundation to this end. Our digital-first work environment, together with our...


  • Montreal, Canada Compunnel, Inc. Full time

    We are seeking a Site Reliability Engineer (SRE) to support and enhance the reliability engineering, operations, and customer support for our ServiceNow SaaS platform. This is a hybrid role combining automation, process improvement, and production support with a strong emphasis on building and maintaining reliable and scalable systems. As part of a global...


  • Montreal, Canada Compunnel, Inc. Full time

    We are seeking a Site Reliability Engineer (SRE) to support and enhance the reliability engineering, operations, and customer support for our ServiceNow SaaS platform. This is a hybrid role combining automation, process improvement, and production support with a strong emphasis on building and maintaining reliable and scalable systems. As part of a global...


  • Montreal, Canada Compunnel, Inc. Full time

    Client’s Application Infrastructure (AI) division is seeking a Site Reliability Engineer (SRE) to join the Client Development Environment team. This role is focused on driving reliability, operational efficiency, and support for core development lifecycle tools used by over 17,000 developers across the firm. The ideal candidate will play a critical role in...


  • Montreal, Canada Compunnel, Inc. Full time

    Client’s Application Infrastructure (AI) division is seeking a Site Reliability Engineer (SRE) to join the Client Development Environment team. This role is focused on driving reliability, operational efficiency, and support for core development lifecycle tools used by over 17,000 developers across the firm. The ideal candidate will play a critical role in...


  • Montreal, Canada AKUR8 Full time

    Site Reliability Engineer – AKUR8 – Paris, Île-de-France, France Overview Akur8 is a fast-growing Insurtech scale‑up that transforms insurance pricing and reserving with transparent machine learning. Our SaaS platform injects speed, performance and reliability into insurers’ pricing processes. With teams in 8 global cities and over 320 clients...


  • Montreal, Canada AKUR8 Full time

    Site Reliability Engineer – AKUR8 – Paris, Île-de-France, France Overview Akur8 is a fast-growing Insurtech scale‑up that transforms insurance pricing and reserving with transparent machine learning. Our SaaS platform injects speed, performance and reliability into insurers’ pricing processes. With teams in 8 global cities and over 320 clients...


  • Montreal (administrative region), Canada Canonical Full time

    Site Reliability Engineer Canonical is a leading provider of open source software and operating systems to the global enterprise and technology markets. Our platform, Ubuntu, is widely used in breakthrough enterprise initiatives such as public cloud, data science, AI, engineering innovation, and IoT. Our customers include the world's leading public cloud and...