Site Reliability Engineer
4 weeks ago
Job Description:We are growing our team globally. It’s a unique opportunity to work on leading edge projects leveraging the latest technologies such as Cloud solutions and Analytics. The primary objective of the team is to ensure reliability across the production plant by developing a deep understanding of how our application code is running, configured, and scaled. This allows us to effectively resolve open incidents in the shortest amount of time, develop monitors to detect future occurrences and implement automation technologies to enable the environment to self-heal. Our team manages all entitlements/accesses in Production in a scope of more than 35 systems and user distributed globally around the world with accesses span from Trading to payment to vendor apps. Role andResponsibilities:Ensure Production Management is closely aligned/embedded in the Agile software development process and our code meets production standardsIncorporate System Reliability Engineering and DevOps implementations into the day-to-day role by developing automated solutions to long standing problems to ensuring minimal downtime and manual effortConfiguring application monitors using industry standard monitoring tools, as well as developing customized monitoring solutionsBuild extensive business and application knowledge required for supporting client facing applicationsRevisit SRE Metrics and confirm against the firm and department goalsImplement tooling / create automations to help with Toil Elimination (manual or repetitive work)Engage early in SDLC with our Development teams to have an active role in creating a resilient and reliable solutionPrioritize project work based on critical incidents and key business stakeholdersInterface with clients and other technology teams to provide governance and control around the production environment.Qualifications:You should apply on this requisition if you have, at minimum, the following profile: Bachelor’s degree in Computer Science or related fieldExperience with Service Oriented Architecture, Distributed Systems, Business Intelligence Reporting such as Power BI, Scripting such as Python or shell, Front end development (HTML, Java Script, AngularJS), Cloud Computing such as MS AZURE and SaaS integrationsClear understanding of Logging, Monitoring, and Knowledge Management practices such as Docs as CodeAbility to manage an incident call and coordinate multiple teams towards a common goal of resolving a business impactful outage, once trainedStrong knowledge of DevOps and SRE Principles with grasp over tools / approach to apply themStrong infrastructure knowledge in Linux / Unix admin, Storage, Networking and Web TechnologiesAdvanced Unix Shell / Python scripting experienceAdvanced SQL query language knowledge such as Sybase, DB2, MongoDB and Snowflake preferred.
-
Principal Site Reliability Engineer
1 month ago
Montréal, QC, Canada Lightspeed Commerce Full timeHi there! Thanks for stopping by Are you actively looking for a new opportunity? Or just checking the market? Well… you might just be in the right place! We’re looking for a Principal Site Reliability Engineer to join our NuOrder by Lightspeed team in North America. NuORDER by Lightspeed builds software solutions that help merchants grow the size and...
-
Oracle: Principal Site Reliability Engineer
1 month ago
Montréal, QC, Canada Lightspeed Full timeWe’re looking for a Principal Site Reliability Engineer to join our NuOrder by Lightspeed team in North America. NuORDER by Lightspeed builds software solutions that help merchants grow the size and the profitability of their business. You'll join a team responsible for supporting the group in cross-cutting concerns, such as cloud infrastructure,...
-
Principal Site Reliability Engineer
3 weeks ago
Montréal, QC, Canada Lightspeed Full timeHi there! Thanks for stopping by Are you actively looking for a new opportunity? Or just checking the market? Well… you might just be in the right place! We’re looking for a Principal Site Reliability Engineer to join our NuOrder by Lightspeed team in North America. NuORDER by Lightspeed builds software solutions that help merchants grow the size and...
-
Site Reliability Engineer
4 weeks ago
Montréal, QC, Canada LanceSoft, Inc. Full timeJob Title: Production Reliability & Support Expert (SRE)Location : Montreal ( Office attendance from Day 1 – Hybrid mode 3x per week)Years of experience : 3 to 5 years • Ensure Production Management is closely aligned/embedded in the Agile software development process and our code meets production standards • Incorporate System Reliability Engineering...
-
Site Reliability Engineer
4 weeks ago
Montréal, QC, Canada LanceSoft, Inc. Full timeJob Title: Production Reliability & Support Expert (SRE)Location : Montreal ( Office attendance from Day 1 – Hybrid mode 3x per week)Years of experience : 3 to 5 years • Ensure Production Management is closely aligned/embedded in the Agile software development process and our code meets production standards • Incorporate System Reliability Engineering...
-
Site Reliability Engineer
1 week ago
Montréal, QC, Canada LanceSoft, Inc. Full timeJob Description:We are growing our team globally. It’s a unique opportunity to work on leading edge projects leveraging the latest technologies such as Cloud solutions and Analytics. The primary objective of the team is to ensure reliability across the production plant by developing a deep understanding of how our application code is running, configured,...
-
Site Reliability Engineer
1 week ago
Montréal, QC, Canada LanceSoft, Inc. Full timeJob Description:We are growing our team globally. It’s a unique opportunity to work on leading edge projects leveraging the latest technologies such as Cloud solutions and Analytics. The primary objective of the team is to ensure reliability across the production plant by developing a deep understanding of how our application code is running, configured,...
-
Site Reliability Engineer, Data Pipelines
3 weeks ago
Montréal, QC, Canada Cisco Full timeAs a part of Cisco, Accedian is a leader in performance analytics and end user experience solutions for service providers and mid-to-large size enterprises. The Accedian Skylight service assurance platform offers granular end-to-end visibility within "the massive multi" - multi-layer, multi-domain, and multi-vendor networks. You are an expert in deployment...
-
Site Reliability Engineer
3 weeks ago
Montréal, QC, Canada LanceSoft, Inc. Full timeResponsibilities include:• SRE duties for the relevant squad, Snowflake or Flexera, providing engineering support for observability and enhancements to the overall functionality of the ITSM platforms.• A commitment to understanding ITSM’s range of products with a view to specializing in one or two of them and contributing to their documentation.•...
-
Site Reliability Engineer
3 weeks ago
Montréal, QC, Canada LanceSoft, Inc. Full timeResponsibilities include:• SRE duties for the relevant squad, Snowflake or Flexera, providing engineering support for observability and enhancements to the overall functionality of the ITSM platforms.• A commitment to understanding ITSM’s range of products with a view to specializing in one or two of them and contributing to their documentation.•...
-
Site Reliability Engineer
1 week ago
Montréal, QC, Canada LanceSoft, Inc. Full timeResponsibilities include:• SRE duties for the relevant squad, Snowflake or Flexera, providing engineering support for observability and enhancements to the overall functionality of the ITSM platforms.• A commitment to understanding ITSM’s range of products with a view to specializing in one or two of them and contributing to their documentation.•...
-
Site Reliability Engineer
1 week ago
Montréal, QC, Canada LanceSoft, Inc. Full timeResponsibilities include:• SRE duties for the relevant squad, Snowflake or Flexera, providing engineering support for observability and enhancements to the overall functionality of the ITSM platforms.• A commitment to understanding ITSM’s range of products with a view to specializing in one or two of them and contributing to their documentation.•...
-
Site Reliability Engineering
2 months ago
Montréal, QC, Canada Noverka Conseil Full timeAt Noverka, our values illustrate who we are and define our beliefs: Human, Transparent, Passionate. We are driven by innovation and success, both in our relationships and in our practices. Finding the right job for the right person is what we do best! Our client, an organization in the banking industry is looking for a Site Reliability Engineering (SRE)...
-
Site Reliability Engineer
3 weeks ago
Montréal, QC, Canada LanceSoft, Inc. Full timeResponsibilities include: • SRE duties for the relevant squad, Snowflake or Flexera, providing engineering support for observability and enhancements to the overall functionality of the ITSM platforms. • A commitment to understanding ITSM’s range of products with a view to specializing in one or two of them and contributing to their documentation. •...
-
FinOps and Site Reliability Engineering specialist
2 months ago
Montréal, QC, Canada CGI Full timePosition Description: CGI is a dynamic and innovative technology firm committed to delivering cutting-edge solutions. We are currently seeking a highly skilled and motivated individual to join our team as a FinOps and Site Reliability Engineer (SRE). This role is pivotal in bridging our finance and technology teams to ensure the successful implementation...
-
Site Reliability Engineer 3
1 month ago
Montréal, QC, Canada Behavox Full timeAbout Behavox Behavox is shaping the future for how businesses harness their most important raw material - data. Our mission is bold: Organize enterprise data into actionable information that protects and promotes the business growth of multinational companies around the world. From managing enterprise risk and compliance to maximizing revenue and value,...
-
Site Reliability Engineer
3 weeks ago
Montréal, Canada LanceSoft, Inc. Full timeResponsibilities include:• SRE duties for the relevant squad, Snowflake or Flexera, providing engineering support for observability and enhancements to the overall functionality of the ITSM platforms.• A commitment to understanding ITSM’s range of products with a view to specializing in one or two of them and contributing to their documentation.•...
-
Site Reliability Engineer
3 weeks ago
montréal, Canada LanceSoft, Inc. Full timeResponsibilities include:• SRE duties for the relevant squad, Snowflake or Flexera, providing engineering support for observability and enhancements to the overall functionality of the ITSM platforms.• A commitment to understanding ITSM’s range of products with a view to specializing in one or two of them and contributing to their documentation.•...
-
Site Reliability Engineer
3 weeks ago
montréal, Canada LanceSoft, Inc. Full timeResponsibilities include:• SRE duties for the relevant squad, Snowflake or Flexera, providing engineering support for observability and enhancements to the overall functionality of the ITSM platforms.• A commitment to understanding ITSM’s range of products with a view to specializing in one or two of them and contributing to their documentation.•...
-
Senior Site Reliability Engineer/DevOps
2 months ago
Montréal, QC, Canada Synechron Full timeNous sommes Synechron est un cabinet de conseil leader mondial en transformation numérique, axé sur les services financiers et les organisations technologiques. Nos spécialités incluent l'intelligence artificielle de bout en bout, le conseil, le numérique, le cloud & DevOps, les données et l'ingénierie logicielle. Notre client dans le domaine de la...