Current jobs related to L3 Site Reliability Engineer - Montreal - Astra-North Infoteck Inc. ~ Conquering today’s challenges, achieving tomorrow’s vision!


  • Montreal, Canada Lancesoft Full time

    Job Title Site Reliability Engineer Experience Level Level 4 (advanced) : 7-15 years Location Montreal (Day 1 onboarding onsite / in office presence 3x week) Job Description The Private Cloud SRE L3 team is part of the Enterprise Computing organization within the Company. The team has presence in cities globally and is focused on supporting cloud and...


  • Montreal, Canada Lancesoft Full time

    Job Title Site Reliability Engineer Experience Level Level 4 (advanced) : 7-15 years Location Montreal (Day 1 onboarding onsite / in office presence 3x week) Job Description The Private Cloud SRE L3 team is part of the Enterprise Computing organization within the Company. The team has presence in cities globally and is focused on supporting cloud and...


  • Montreal, Canada TMC Canada Full time

    Head of Talent Acquisition Department at TMC North America The Private Cloud SRE L3 team is part of the Enterprise Computing organization. The team has presence in cities globally and is focused on supporting cloud and container-based platforms for internal and external clients. You will integrate with the global follow the sun operations model, which...


  • Montreal, Canada Compunnel Inc. Full time

    Site Reliability Engineer – KUMDC Long Term Contract The Application Infrastructure (AI) department is seeking a Site Reliability Engineer (SRE) to drive reliability engineering, operations, and customer support services for ServiceNow SaaS implementation. Reporting to the Site Reliability Engineering & Operations Lead, this role involves delivering SRE...


  • Montreal, Canada Compunnel Inc. Full time

    Site Reliability Engineer – KUMDC5681698 Long Term Contract The Application Infrastructure (AI) department is seeking a Site Reliability Engineer (SRE) to drive reliability engineering, operations, and customer support services for ServiceNow SaaS implementation. Reporting to the Site Reliability Engineering & Operations Lead, this role involves delivering...

  • Core L3 SRE

    3 days ago


    Montreal, Canada Artech LLC Full time

    Job Title: Core L3 SRE Location: Montreal, Quebec  Duration: 6 Months  Introduction We are seeking a highly skilled and experienced Site Reliability Engineer to join our dynamic team. The ideal candidate will have a strong background in IT with hands-on experience in supporting BI platforms and a deep understanding of Linux/Unix systems, scripting, and...


  • Montreal, Canada Open Systems Technologies Full time

    Site Reliability Engineer (SRE), ServiceNow, Application Infrastructure Location: Montreal – Hybrid – 3 days/week The Application Infrastructure (AI) department is seeking a Site Reliability Engineer (SRE) to help drive reliability engineering, operations and customer support services for client’s ServiceNow SaaS implementation. Reporting to a Site...


  • Montreal, Canada Open Systems Technologies Full time

    Site Reliability Engineer (SRE), ServiceNow, Application Infrastructure Location: Montreal – Hybrid – 3 days/week The Application Infrastructure (AI) department is seeking a Site Reliability Engineer (SRE) to help drive reliability engineering, operations and customer support services for client’s ServiceNow SaaS implementation. Reporting to a Site...


  • Montreal, Canada Astra North Infoteck Inc. Full time

    Job DescriptionSystem AdministratorPosition Description:The Core Services L3 support team is part of the Enterprise Computing Data Services Organization.The team manages and supports a variety of applications developed in-house for purposes like application management and application coordination using Apache Zookeeper, API Proxy, Automation Platform using...


  • Montreal, Canada Compunnel, Inc. Full time

    Client is seeking an experienced Site Reliability Engineer (SRE) to support and enhance the reliability, performance, and operational efficiency of our global ServiceNow SaaS platform. As part of the Application Infrastructure (AI) team, you will be instrumental in advancing SRE practices, ensuring seamless integration and stability across on-premise...

L3 Site Reliability Engineer

4 weeks ago


Montreal, Canada Astra-North Infoteck Inc. ~ Conquering today’s challenges, achieving tomorrow’s vision! Full time

L3 Site Reliability Engineer - Linux, Automation (Ansible), IaC (Terraform), Zookeeper) Position Description: The Core Services L3 support team is part of the Enterprise Computing Data Services Organization. The team manages and supports a variety of applications developed in-house for purposes like application management and application coordination using Apache Zookeeper, API Proxy, Automation Platform using Ansible Automation Platform and Infrastructure as Code using Terraform. It serves as the highest level of escalation, and actively engages engineering teams that develop the products and tooling to maintain service stability. This position is a Level 3 support and SRE role with global responsibility for managing and providing support for these middleware products with on call coverage to handle production escalations. The successful candidate will be involved in day-to-day management of the infrastructure environment, troubleshooting with users, handling changes, incidents, escalations, and problem management. The person would also routinely work with engineering teams that developed these products to resolve problems and proactively automate operational and user processes to reduce toil and time to market. Required Skills 8+ years of overall IT experience. Advanced Linux / Unix support experience. Strong shell scripting and Python programming skills for SRE related activities. Experience using Splunk OR Grafana/Prometheus/Loki stack, preferably both. General understanding of Veritas Cluster Service, Load Balancers, and VMware. Knowledge of ITIL principles. Effective oral and written communication skills, and interpersonal skills to work well in a team environment. Strong organizational and coordination skills with the ability to manage multiple tasks and high-pressure situations for outage handling, management, or resolution. Availability for weekend work. Desired Skills Experience in application support, code release and liaison with development teams. Experience with automation using Ansible playbooks. Experience with Ansible Automation Platform administration. Experience with Terraform, especially Terraform Enterprise. Knowledge of Docker, Kubernetes/OpenShift. Experience in development tool chain such as Git, Bitbucket and CI/CD tools. Experience in Agile methodologies. Good knowledge of JVMs and garbage collection mechanisms. Experience with relational databases. Seniority Level Associate Employment Type Full-time Job Function Information Technology Industries IT Services and IT Consulting #J-18808-Ljbffr