Reliability Production Engineer

2 weeks ago


Montreal, Canada Compunnel, Inc. Full time

The Reliability Production Engineer (RPE) plays a critical role in providing production support services within the RPE organization. This role involves developing automation and tooling to support Site Reliability Engineering (SRE) activities, with a focus on improving system reliability and supportability—such as reducing manual toil, optimizing monitoring, and enhancing alerting efficiency. The RPE collaborates with global teams to maintain and improve production systems. Key Responsibilities Provide production support for in-scope systems under the RPE organization Develop automation and tooling to improve reliability and reduce manual tasks Monitor databases and perform performance tuning across platforms including DB2, Greenplum, MongoDB, and Snowflake Create and maintain database scripts (stored procedures, complex SQL) for data analysis and operations Develop and maintain Python and Linux Shell scripts for operational support Troubleshoot containerized environments using Docker and Kubernetes Analyze system metrics and trends using observability tools Collaborate effectively with global teams and communicate clearly in both verbal and written forms Support shift work and participate in on-call rotations to ensure continuous system availability Comply with current policies requiring a minimum of three days working from the office weekly Required Qualifications Bachelor’s degree in Computer Science or related field 4–5 years of experience with database scripting, monitoring, and performance tuning (DB2, Greenplum, MongoDB, Snowflake) Proficiency in Linux operating systems Experience with Python and Linux Shell scripting Hands-on experience with Docker and Kubernetes, including troubleshooting and observability stack tools Strong verbal and written communication skills for global collaboration Flexibility to work shifts and fulfill on-call responsibilities Preferred Qualifications Experience in financial services or investment banking environments Familiarity with advanced monitoring and alerting tools such as Splunk, AppDynamics, or Elastic Search Knowledge of development tools including GIT and Jenkins Agile, DevOps, or SRE mindset and related tooling experience Understanding of cloud technologies and their applications in reliability engineering Certifications (if any) No specific certifications required, though relevant certifications in DevOps, Cloud, or SRE are a plus Email ID * This field is required Please enter valid emailId.Cell phone * This field is required Please enter valid cell phone. First Name * This field is required Please enter valid first name. Last Name * This field is required Please enter valid last name. #J-18808-Ljbffr



  • Montreal, Canada Compunnel, Inc. Full time

    The Reliability Production Engineer (RPE) plays a critical role in providing production support services within the RPE organization. This role involves developing automation and tooling to support Site Reliability Engineering (SRE) activities, with a focus on improving system reliability and supportability—such as reducing manual toil, optimizing...


  • Montreal, Canada Compunnel, Inc. Full time

    The Reliability Production Engineer (RPE) plays a critical role in providing production support services within the RPE organization. This role involves developing automation and tooling to support Site Reliability Engineering (SRE) activities, with a focus on improving system reliability and supportability—such as reducing manual toil, optimizing...


  • Montreal, Canada ApTask Full time

    Direct message the job poster from ApTask Looking for an intermediate between 2 to 5 years' experience. The Application Infrastructure (Al) department is seeking a Site Reliability Engineer (SRE) to help drive the reliability engineering, operations and customer support services clients ServiceNow SaaS implementation. Reporting to a Site Reliability...


  • Montreal, Canada ApTask Full time

    Direct message the job poster from ApTaskLooking for an intermediate between 2 to 5 years' experience.The Application Infrastructure (Al) department is seeking a Site Reliability Engineer (SRE) to help drive the reliabilityengineering, operations and customer support services clients ServiceNow SaaS implementation.Reporting to a Site Reliability Engineering...


  • Montreal, Quebec, Canada Noramtec Consultants Inc. Full time

    A major global financial services institution is partnering with us to hire aSite Reliability Engineer (SRE)for their growing Montreal-based Application Infrastructure team. This pivotal role will focus onensuring the reliability, performance, and operational stability of enterprise applications, with a primary emphasis onServiceNow SaaS implementations and...


  • Montreal, Canada Open Systems Technologies Full time

    Site Reliability Engineer (SRE), ServiceNow, Application Infrastructure Location: Montreal – Hybrid – 3 days/week The Application Infrastructure (AI) department is seeking a Site Reliability Engineer (SRE) to help drive reliability engineering, operations and customer support services for client’s ServiceNow SaaS implementation. Reporting to a Site...


  • Montreal, Canada Open Systems Technologies Full time

    Site Reliability Engineer (SRE), ServiceNow, Application Infrastructure Location: Montreal – Hybrid – 3 days/week The Application Infrastructure (AI) department is seeking a Site Reliability Engineer (SRE) to help drive reliability engineering, operations and customer support services for client’s ServiceNow SaaS implementation. Reporting to a Site...


  • Montreal, Canada Compunnel, Inc. Full time

    We are seeking a Site Reliability Engineer (SRE) to support and enhance the reliability engineering, operations, and customer support for our ServiceNow SaaS platform. This is a hybrid role combining automation, process improvement, and production support with a strong emphasis on building and maintaining reliable and scalable systems. As part of a global...


  • Montreal, Canada Compunnel, Inc. Full time

    We are seeking a Site Reliability Engineer (SRE) to support and enhance the reliability engineering, operations, and customer support for our ServiceNow SaaS platform. This is a hybrid role combining automation, process improvement, and production support with a strong emphasis on building and maintaining reliable and scalable systems. As part of a global...


  • Montreal, Canada Compunnel, Inc. Full time

    We are seeking a Site Reliability Engineer (SRE) to support and enhance the reliability engineering, operations, and customer support for our ServiceNow SaaS platform. This is a hybrid role combining automation, process improvement, and production support with a strong emphasis on building and maintaining reliable and scalable systems. As part of a global...