Specialist Site Reliability Engineer

25 minutes ago


Montreal administrative region, Canada Global Talent Alliance, Canada Full time

About the job Specialist Site Reliability Engineer(#11072)The role of the Specialist Site Reliability Engineer (SRE) is to execute RAM analysis and engineering in support of the I&T solutions. The overall mandate is to ensure that these solutions have attributes of high robustness, reliability, and availability. This involves system and product analysis, modeling and requirements assessment during the development phase and the analysis of field RAM data to determine solution RAM KPIs and to drive corrective action programs. With the advent of Cloud Computing there is also a need for a RAM specialist that is well versed in Cloud based technologies as well as solution architectures for the cloud.Separate specializations may exist for hardware and software RAM. The technologies used are primarily distributed digital control systems, communication networks, Global Navigation Satellite Systems (GNSS), embedded and virtualized computing as well as Cloud based solutions.Main ResponsibilitiesSolution RAM AssessmentsReview and approve solution requirements for RAMDetermine non-functional requirements and targets for RAM performancePerform analysis and modeling to predict RAM behaviourAdhere to the I&T Development ProcessSolution RAM Field PerformanceAssign requirements to solutions and products to ensure they support the ability to measure RAM Key Performance Indicators (KPIs)Use the field performance measurement to identify key contributors and drive corrective action plans when necessaryReview vendor specifications, test results, analysis artifactsParticipate in failure review board for selected vendorsReview corrective action plans from the vendorsDrive to completion the vendor corrective action plansUse the field performance measurement to identify key contributors and drive corrective action plans when necessaryRequirementsExperienceMinimum 5-10 years overall work experienceMinimum 5 years experience in RAM engineering for complex systems, or 7 years experience in product development for high reliability/availability, or safety critical systems with accountability for product field performanceSkills/KnowledgeKnowledge of hardware and/or software design and development practices and processes with focus on high reliability and high availability applicationsKnowledge of RAM analysis techniques such as failure rate prediction, Reliability Block Diagrams (RBD), Markov models, Monte Carlo methods, Failure Modes Effects Analysis (FMEA), Fault Tree Analysis (FTA)Analysis of reliability and failure field data, statistical estimation, Root Cause Analysis (RCA)Critical thinking and judgementAbility to assimilate new information quickly and apply to the assignmentAbility to deliver with autonomyOrganizing work to support multiple projects in parallelKnowledge and/or experience in the following areasMulti-Cloud/Multi-Zone-Based designs with High Availability (HA)Compute Infrastructure: Google Compute Engine (GCE) (servers, databases, firewalls, load balancers, networking and storage)Services for Google Cloud Platform (GCP)Databases including NoSQL Databases, Big Data technologies (Oracle, SQL Server, Postgres, Spark, Hadoop, Cloud databases)Application development concepts and technologies (CI/CD, Java, Python)Education/Certification/DesignationBachelors degree in Electrical Engineering, Mechanical Engineering, Computer Science, Computer Engineering or equivalent degree & experienceAssetsKnowledge of product design and standards for the rail industryKnowledge of rail industry or other transportation industry operationsWorking ConditionsThis role may require occasional business travel within North America in accordance with company policy #J-18808-Ljbffr


  • Site Reliability Engineer

    23 minutes ago


    Montreal (administrative region), Canada Noramtec Consultants Inc. Full time

    A major global financial services institution is partnering with us to hire a Site Reliability Engineer (SRE) for their growing Montreal-based Application Infrastructure team. This pivotal role will focus on ensuring the reliability, performance, and operational stability of enterprise applications, with a primary emphasis on ServiceNow SaaS implementations...


  • Montreal (administrative region), Canada Compunnel Inc. Full time

    Site Reliability Engineer (SRE) – AWADC5704026 Work Location: Montreal, QC (3 days onsite/week). Job Title: Site Reliability Engineer (SRE), ServiceNow, Application Infrastructure. Must Have: Hands on Python Scripting. Job Description: Successful candidates for SRE roles in Application Infrastructure come from a variety of backgrounds; a developer looking...


  • Montreal (administrative region), Canada Compunnel Inc. Full time

    Site Reliability Engineer (SRE) – AWADC Work Location: Montreal, QC (3 days onsite/week). Job Title: Site Reliability Engineer (SRE), ServiceNow, Application Infrastructure. Must Have: Hands on Python Scripting. Job Description: Successful candidates for SRE roles in Application Infrastructure come from a variety of backgrounds; a developer looking to...


  • Montreal (administrative region), Canada Compunnel Inc. Full time

    Site Reliability Engineer (SRE) – AWADC5704026 Work Location: Montreal, QC (3 days onsite/week). Job Title: Site Reliability Engineer (SRE), ServiceNow, Application Infrastructure. Must Have: Hands on Python Scripting. Job Description: Successful candidates for SRE roles in Application Infrastructure come from a variety of backgrounds; a developer looking...

  • Site Reliability Engineer

    26 minutes ago


    Montreal (administrative region), Canada Canonical Full time

    Site Reliability Engineer Canonical is a leading provider of open source software and operating systems to the global enterprise and technology markets. Our platform, Ubuntu, is widely used in breakthrough enterprise initiatives such as public cloud, data science, AI, engineering innovation, and IoT. Our customers include the world's leading public cloud and...


  • Montreal, Canada ApTask Full time

    Direct message the job poster from ApTask Looking for an intermediate between 2 to 5 years' experience. The Application Infrastructure (Al) department is seeking a Site Reliability Engineer (SRE) to help drive the reliability engineering, operations and customer support services clients ServiceNow SaaS implementation. Reporting to a Site Reliability...

  • Site Reliability Engineer

    24 minutes ago


    Montreal, Canada ApTask Full time

    Direct message the job poster from ApTaskLooking for an intermediate between 2 to 5 years' experience.The Application Infrastructure (Al) department is seeking a Site Reliability Engineer (SRE) to help drive the reliabilityengineering, operations and customer support services clients ServiceNow SaaS implementation.Reporting to a Site Reliability Engineering...


  • Montreal (administrative region), Canada TMC Canada Full time

    Summary The Application Infrastructure (AI) department is seeking a Site Reliability Engineer (SRE) to help drive the reliability engineering, operations and customer support services for Morgan Stanley's ServiceNow SaaS implementation. Reporting to a Site Reliability Engineering & Operations Lead. This role requires delivering a range of SRE practices...

  • Site Reliability Engineer

    23 minutes ago


    Montreal (administrative region), Canada TMC Canada Full time

    Summary The Application Infrastructure (AI) department is seeking a Site Reliability Engineer (SRE) to help drive the reliability engineering, operations and customer support services for Morgan Stanley's ServiceNow SaaS implementation. Reporting to a Site Reliability Engineering & Operations Lead. This role requires delivering a range of SRE practices...

  • Site Reliability Engineer

    23 minutes ago


    Montreal (administrative region), Canada High Tech Genesis Full time

    Join to apply for the Site Reliability Engineer role at High Tech Genesis WE'RE HIRING! At HTG, you’ll push boundaries with the latest tech and collaborate with a team that loves what they do. Be part of a design services company that is among the companies that lead the world in technology and innovation. Your next chapter starts here. Responsibilities...