Systems Reliability Engineer

2 days ago


Montreal, Canada Axelon Services Corporation Full time

Systems Reliability Engineer

12 Months Contract

Location : Montreal


  • Looking for role in production support team.
  • Scripting knowledge – UNIX or shell or python
  • Relational databases
  • Grafana or Prometheus is an added advantage.
  • 2 rounds – Zoom and Onsite.
  • Application support role.
  • 2-5 years of experience.
  • Sometimes need to work on rotational basis only on Sundays, once in 6 weeks.


Experience : Intermediate with 2 to 5 years

Top 3 Must have :

1. Strong experience with Python and / or Shell scripting

2. Strong experience with data base (DB2 knowledges is a plus)

3. Strong communication skills. The consultant will work with business users in day to day basis.


Top 2 Nice to have :

1. Good knowledges of Grafana, Prometheus

2. Good experience with debugging



Reliability & Production Engineering

Resiliency Engineering is a production-oriented discipline focused on improving service availability, latency, scalability, performance, and efficiency for technology products in ***. Our core infrastructure processes hundreds of millions of transactions, and we serve assets of more than a trillion dollars daily. This role will be responsible for the design & implementation of the platform, and corresponding frameworks, application and gameday exercises, for testing critical applications at scale. If this scale resonates with you, come join us.


Job Profile

Systems Reliability Engineering (SRE) is a discipline focused on improving system service availability, observability, scalability, performance, and resilience across *** by applying sound software engineering principles and adopting the latest technology and tooling.


We are growing SRE capabilities within our Reliability & Production Engineering (RPE) organization as part of the transformation of ***’s Technology.


Responsibilities:


• Are interested in distributed systems and working with highly scalable and reliable services.

• Like to work in a fast-moving environment and you aren't afraid to change things to make them better.

• Enjoy new technological challenges and solving hard problems.

• Believe a team working well together is smarter than the single smartest person on that team.

• Have grit, drive and a deep sense of ownership.

• Working closely with engineering/development teams to design, build, and maintain systems.

• Troubleshooting issues across the entire technology stack: hardware, software, application, and network.

• Identifying and driving opportunities to improve automation for our platforms; scope and create automation for deployment, management, and visibility of our services.

• Proactively identifying and addressing systems reliability risks.

• Working alongside existing global and regional team members on a follow-the-sun basis.

• Represent the RPE organization in design reviews and operational readiness exercises for new and existing services.



Qualifications - Skill Set

• Demonstrated ability to troubleshoot problems and debug to identify root cause.

• Hands on experience on enterprise tools such as AppDynamics, Grafana, Splunk, Dynatrace.

• Experience with Ansible, GitHub or any automation/configuration/release management tools.

• Automation-related experience is particularly valued using scripting languages such as python, bash, perl. One higher level language is desired.

• Awareness of, and ability to reason about modern software and systems architectures, including load-balancing, databases, queueing, caching, distributed systems failure modes, micro services, Cloud, etc.

• Practical experience running large scale systems is an advantage.

• Should be able to contribute to system design and architecture with strong database knowledge.



Qualifications/Criterion

• Background in Computer Science/Engineering or similar field.


*** is an equal opportunities employer. We work to provide a supportive and inclusive environment where all individuals can maximize their full potential. Our skilled and creative workforce is comprised of individuals drawn from a broad cross section of the global communities in which we operate and who reflect a variety of backgrounds, talents, perspectives and experiences. Our strong commitment to a culture of inclusion is evident through our constant focus on recruiting, developing and advancing individuals based on their skills and talents.


Company Profile

*** is a leading global financial services firm providing a wide range of investment banking, securities, wealth management and investment management services. With offices in more than 41 countries, the Firm's employees serve clients worldwide including corporations, governments, institutions and individuals. For further information about ***, please visit www.Brokerage.com.



  • Montreal, Canada Axelon Services Corporation Full time

    Systems Reliability Engineer 12 Months Contract Location : Montreal Looking for role in production support team. Scripting knowledge – UNIX or shell or python Relational databases Grafana or Prometheus is an added advantage. 2 rounds – Zoom and Onsite. Application support role. 2-5 years of experience. Sometimes need to work on rotational...


  • Montreal, Canada LanceSoft, Inc. Full time

    Job DescriptionLanceSoft, Inc. is seeking a skilled System Reliability Engineer to join our team in Montreal.Job Summary:We are looking for an experienced System Reliability Engineer to design, build, and maintain scalable and reliable systems. As a key member of our engineering team, you will be responsible for troubleshooting issues across the technology...


  • Montreal, Quebec, Canada LanceSoft, Inc. Full time

    **Job Overview**LanceSoft, Inc. is seeking a skilled Site Reliability Engineer to join our team in Montreal, Quebec, Canada.**Estimated Salary:** $120,000 - $180,000 per annum, depending on experience.Company OverviewWe are a leading technology company that values innovation and collaboration.About the JobThe successful candidate will be responsible for...


  • Montreal, Quebec, Canada LanceSoft Full time

    LanceSoft is seeking a highly skilled Reliable System Architect to join our team in Montreal. This is a 12+ month hybrid opportunity, working 3 days a week from home and 2 days in the office.In this role, you will be responsible for designing, building, and maintaining scalable and reliable systems that meet our business needs. You will work closely with our...


  • Montreal, Canada Soho Square Solutions Full time

    Soho Square Solutions is seeking a skilled Reliability Engineer Specialist to drive reliability engineering, operations, and customer support services for our ServiceNow SaaS implementation. As a key member of the Application Infrastructure team, you will report to the Site Reliability Engineering & Operations Lead and work closely with a global community of...


  • Montreal, Canada LanceSoft, Inc. Full time

    LanceSoft, Inc. is seeking a highly skilled Site Reliability Engineer to join our team.The ideal candidate will have experience in designing and maintaining scalable and reliable systems, as well as troubleshooting complex technical issues.This role is an excellent opportunity for someone looking to work in a fast-paced environment and contribute to the...


  • Montreal, Canada Soho Square Solutions Full time

    Site Reliability Engineer (SRE) - ServiceNow, Application InfrastructureThe Application Infrastructure (AI) department is seeking a Site Reliability Engineer (SRE) to drive reliability engineering, operations, and customer support services for a ServiceNow SaaS implementation. Reporting to a Site Reliability Engineering & Operations Lead, this role involves...


  • Montreal, Quebec, Canada Axelon Services Corporation Full time

    Job SummaryWe are seeking a skilled Systems Reliability Engineer to join our team. This is a 12-month contract role with a competitive salary of $90,000 - $110,000 per year, depending on experience.About the RoleThe successful candidate will work as part of our production support team, providing expertise in systems reliability and availability. Key...


  • Montreal, Quebec, Canada LanceSoft, Inc. Full time

    At LanceSoft, Inc., we are seeking a skilled Reliability Engineering Specialist to join our team in Montreal. This is a hybrid role that requires working 3 days on-site and the rest of the time remotely.The successful candidate will have at least 2 years of experience in Systems Reliability Engineering (SRE) and will be responsible for improving system...


  • Montreal, Canada Soho Square Solutions Full time

    Soho Square Solutions is seeking a highly skilled Reliability Engineering Specialist to join our team.As a key member of our Application Infrastructure department, you will be responsible for driving reliability engineering, operations, and customer support services for our ServiceNow SaaS implementation.This role involves delivering SRE practices within a...

  • Reliability Engineer

    2 months ago


    Montreal, Quebec, Canada National Bank Full time

    We are seeking a skilled Reliability Engineer to join our team at National Bank. As a specialist in reliability, efficiency, and performance of systems, you will play a critical role in ensuring the stability and scalability of our applications.Key ResponsibilitiesPromote and implement best practices for resilience and stability within teamsSupport and...


  • Montreal, Quebec, Canada Axelon Services Corporation Full time

    We are seeking a highly skilled Senior Systems Reliability Engineer to join our team at Axelon Services Corporation. As a key member of our Reliability & Production Engineering organization, you will play a critical role in improving the availability, latency, scalability, performance, and efficiency of our technology products.Our core infrastructure...


  • Montreal, Canada LanceSoft, Inc. Full time

    Location : Montreal (Hybrid 3 days)Duration: 12+ MonthsJob ProfileSystems Reliability Engineering (SRE) is a discipline focused on improving system service availability, observability, scalability, performance, and resilience across *** by applying sound software engineering principles and adopting the latest technology and tooling.Responsibilities:Are...


  • Montreal, Canada LanceSoft, Inc. Full time

    Location : Montreal (Hybrid 3 days)Duration: 12+ MonthsJob ProfileSystems Reliability Engineering (SRE) is a discipline focused on improving system service availability, observability, scalability, performance, and resilience across *** by applying sound software engineering principles and adopting the latest technology and tooling.Responsibilities:Are...


  • Montreal, Canada LanceSoft, Inc. Full time

    Location : Montreal (Hybrid 3 days) Duration: 12+ Months Job Profile Systems Reliability Engineering (SRE) is a discipline focused on improving system service availability, observability, scalability, performance, and resilience across *** by applying sound software engineering principles and adopting the latest technology and tooling. Responsibilities: ...


  • Montreal, Canada LanceSoft, Inc. Full time

    Site Reliability Engineer Montreal, Quebec, Canada Hybrid Duration: 12+ months Responsibilities: • Are interested in distributed systems and working with highly scalable and reliable services. • Like to work in a fast-moving environment and you aren't afraid to change things to make them better. • Enjoy new technological challenges and...


  • Montreal, Canada LanceSoft, Inc. Full time

    Site Reliability EngineerMontreal, Quebec, Canada HybridDuration: 12+ monthsResponsibilities: • Are interested in distributed systems and working with highly scalable and reliable services. • Like to work in a fast-moving environment and you aren't afraid to change things to make them better. • Enjoy new technological challenges and solving hard...


  • Montreal, Quebec, Québec, Canada Soho Square Solutions Full time

    Site Reliability Engineer (SRE) - ServiceNow, Application InfrastructureThe Application Infrastructure (AI) department is seeking a Site Reliability Engineer (SRE) to drive reliability engineering, operations, and customer support services for a ServiceNow SaaS implementation. Reporting to a Site Reliability Engineering & Operations Lead, this role involves...


  • Montreal, Quebec, Canada Axelon Services Corporation Full time

    About the RoleWe are seeking a highly skilled Reliability Engineering Specialist to join our team. In this role, you will be responsible for designing and implementing scalable systems that ensure high availability and performance.Your primary focus will be on building and maintaining critical applications, ensuring they meet the required standards of...


  • Montreal, Quebec, Québec, Canada LanceSoft, Inc. Full time

    Location : Montreal (Hybrid 3 days)Duration: 12+ MonthsJob ProfileSystems Reliability Engineering (SRE) is a discipline focused on improving system service availability, observability, scalability, performance, and resilience across *** by applying sound software engineering principles and adopting the latest technology and tooling.Responsibilities:Are...