Reliability Production Engineer
5 days ago
The Reliability Production Engineer (RPE) plays a critical role in providing production support services within the RPE organization. This role involves developing automation and tooling to support Site Reliability Engineering (SRE) activities, with a focus on improving system reliability and supportability—such as reducing manual toil, optimizing monitoring, and enhancing alerting efficiency. The RPE collaborates with global teams to maintain and improve production systems.Key ResponsibilitiesProvide production support for in-scope systems under the RPE organizationDevelop automation and tooling to improve reliability and reduce manual tasksMonitor databases and perform performance tuning across platforms including DB2, Greenplum, MongoDB, and SnowflakeCreate and maintain database scripts (stored procedures, complex SQL) for data analysis and operationsDevelop and maintain Python and Linux Shell scripts for operational supportTroubleshoot containerized environments using Docker and KubernetesAnalyze system metrics and trends using observability toolsCollaborate effectively with global teams and communicate clearly in both verbal and written formsSupport shift work and participate in on-call rotations to ensure continuous system availabilityComply with current policies requiring a minimum of three days working from the office weeklyRequired QualificationsBachelor’s degree in Computer Science or related field4–5 years of experience with database scripting, monitoring, and performance tuning (DB2, Greenplum, MongoDB, Snowflake)Proficiency in Linux operating systemsExperience with Python and Linux Shell scriptingHands-on experience with Docker and Kubernetes, including troubleshooting and observability stack toolsStrong verbal and written communication skills for global collaborationFlexibility to work shifts and fulfill on-call responsibilitiesPreferred QualificationsExperience in financial services or investment banking environmentsFamiliarity with advanced monitoring and alerting tools such as Splunk, AppDynamics, or Elastic SearchKnowledge of development tools including GIT and JenkinsAgile, DevOps, or SRE mindset and related tooling experienceUnderstanding of cloud technologies and their applications in reliability engineeringCertifications (if any)No specific certifications required, though relevant certifications in DevOps, Cloud, or SRE are a plusEmail ID * This field is required Please enter valid emailId.Cell phone * This field is required Please enter valid cell phone.First Name * This field is required Please enter valid first name.Last Name * This field is required Please enter valid last name. #J-18808-Ljbffr
-
Reliability Production Engineer
3 days ago
Montreal, Canada Compunnel, Inc. Full timeThe Reliability Production Engineer (RPE) plays a critical role in providing production support services within the RPE organization. This role involves developing automation and tooling to support Site Reliability Engineering (SRE) activities, with a focus on improving system reliability and supportability—such as reducing manual toil, optimizing...
-
Reliability Production Engineer
1 day ago
Montreal, Canada Compunnel, Inc. Full timeThe Reliability Production Engineer (RPE) plays a critical role in providing production support services within the RPE organization. This role involves developing automation and tooling to support Site Reliability Engineering (SRE) activities, with a focus on improving system reliability and supportability—such as reducing manual toil, optimizing...
-
Site Reliability Engineer
3 days ago
Montreal, Canada ApTask Full timeDirect message the job poster from ApTask Looking for an intermediate between 2 to 5 years' experience. The Application Infrastructure (Al) department is seeking a Site Reliability Engineer (SRE) to help drive the reliability engineering, operations and customer support services clients ServiceNow SaaS implementation. Reporting to a Site Reliability...
-
Site Reliability Engineer
5 days ago
Montreal, Canada ApTask Full timeDirect message the job poster from ApTaskLooking for an intermediate between 2 to 5 years' experience.The Application Infrastructure (Al) department is seeking a Site Reliability Engineer (SRE) to help drive the reliabilityengineering, operations and customer support services clients ServiceNow SaaS implementation.Reporting to a Site Reliability Engineering...
-
Site Reliability Engineer
2 weeks ago
Montreal, Canada Open Systems Technologies Full timeSite Reliability Engineer (SRE), ServiceNow, Application Infrastructure Location: Montreal – Hybrid – 3 days/week The Application Infrastructure (AI) department is seeking a Site Reliability Engineer (SRE) to help drive reliability engineering, operations and customer support services for client’s ServiceNow SaaS implementation. Reporting to a Site...
-
Site Reliability Engineer
2 weeks ago
Montreal, Canada Open Systems Technologies Full timeSite Reliability Engineer (SRE), ServiceNow, Application Infrastructure Location: Montreal – Hybrid – 3 days/week The Application Infrastructure (AI) department is seeking a Site Reliability Engineer (SRE) to help drive reliability engineering, operations and customer support services for client’s ServiceNow SaaS implementation. Reporting to a Site...
-
Site Reliability Engineer
6 days ago
Montreal, Quebec, Canada Open Systems Technologies Full timeJob Title: Site Reliability EngineerLocation: Montreal – Hybrid – 3 days/weekTerm: 12 months contract plus extensionThe Application Infrastructure (AI) department is seeking a Site Reliability Engineer (SRE) to help drive reliability engineering, operations and customer support services for client's ServiceNow SaaS implementation. Reporting to a Site...
-
Site Reliability Engineer
5 days ago
Montreal, Canada Compunnel, Inc. Full timeWe are seeking a Site Reliability Engineer (SRE) to support and enhance the reliability engineering, operations, and customer support for our ServiceNow SaaS platform. This is a hybrid role combining automation, process improvement, and production support with a strong emphasis on building and maintaining reliable and scalable systems. As part of a global...
-
Site Reliability Engineer
3 days ago
Montreal, Canada Compunnel, Inc. Full timeWe are seeking a Site Reliability Engineer (SRE) to support and enhance the reliability engineering, operations, and customer support for our ServiceNow SaaS platform. This is a hybrid role combining automation, process improvement, and production support with a strong emphasis on building and maintaining reliable and scalable systems. As part of a global...
-
Site Reliability Engineer
3 days ago
Montreal, Canada Compunnel, Inc. Full timeWe are seeking a Site Reliability Engineer (SRE) to support and enhance the reliability engineering, operations, and customer support for our ServiceNow SaaS platform. This is a hybrid role combining automation, process improvement, and production support with a strong emphasis on building and maintaining reliable and scalable systems. As part of a global...