Site Reliability Engineer

7 months ago

Montreal, Canada Lyft Full time

At Lyft, our mission is to improve people’s lives with the world’s best transportation. Imagine cities where streets are safe, communities thrive, and personal cars are a thing of the past. We envision a future where shared and active transportation modes are the norm, fostering vibrant, connected neighborhoods.

As a leader in micromobility, Lyft powers millions of rides daily across over 200 cities with our cutting-edge ride-sharing, bike-sharing, and scooter-sharing technologies. Our Montreal office is the birthplace of North America's first automated bike-share system, Bixi, which has since revolutionized urban mobility. Today, our pioneering system is operational in more than 50 cities worldwide, including Barcelona, Bogota, Boston, Buenos Aires, Chicago, Dubai, London, Madrid, Mexico City, Montreal, New York, Rio de Janeiro, San Francisco, and Washington DC, to name just a few. Join us and be part of the team behind some of the world's largest and most successful bike-share systems

The Transit, Bikes, and Scooters (TBS) infrastructure team at Lyft in Montreal is growing, and we are looking for a Site Reliability Engineer to support our production systems, platforms, and the tools our developers use, while ensuring the reliability of our systems.

Every engineering team at Lyft is responsible for running and operating the software that they build. The Infrastructure team works towards standardizing and supporting all the rapidly evolving teams throughout our organization, assessing their architecture, helping them design scalable services, and fostering excellent operational practices. It's a mission-critical role of ensuring that our systems are always healthy, monitored, automated, and designed to scale.

The nature of work is interdisciplinary, and our teammates come from varying backgrounds e.g. (Site Reliability Engineer (SRE), Systems Engineer, Software Engineer, DevOps Engineer, Infrastructure Engineer, Production Engineer). We urge you to apply even if you feel uncertain that you have the exact background.

Technical interviews and interactions with the other offices in the company will be mainly in English; however, the working environment in Montreal is bilingual.

Responsibilities:

Help define the team’s roadmap and architecture based on technology and business needsDesign and implement effective infrastructure abstractions that increase velocity of our application teamsBe responsible for, design, develop, deploy, monitor, operate and maintain existing or new elements of our systems infrastructure.Build holistic visibility into SLIs, SLOs, SLAs, dependency graphs, past performance of software, network, and system to ensure that we can continue to scale without increasing operational burden or toilUse the core Site Reliability Engineering principles of change management, monitoring, emergency response, capacity planning, and production readiness reviews to run the platformStep back to observe patterns and develop innovative tools and automation to minimize toil. Use those learnings to drive the best operational practices.Partner with the broader Lyft organization to build a culture of rigorously learning from incidentsUnblock, support, and effectively communicate across teams to achieve resultsHave a good grasp and ability to explain the various tradeoffs made in decisionsShare your knowledge by giving brown bags, tech talks, and evangelizing appropriate tech and engineering best practices.

Experience:

5+ years of software engineering/production infrastructure industry experienceExperience designing, debugging and running fault-tolerant large-scale distributed systemsExperience with high level programming languages (Python, Go, Java, etc.)Experience working with public cloud platforms (e.g., AWS, Google Cloud Platform, Microsoft Azure, etc.)Experience bringing software to production at high scaleExperience with common CI tools (Jenkins, Buildkite, CircleCI, TeamCity), and proficiency in at least one of those tools an assetExperience working with databases, relational or NoSQL an assetExperience in Linux system administration, or familiarity with managing a fleet of Linux servers an assetMust be fluent in spoken and written English and minimally be willing to learn French if required

Benefits:

Comprehensive health, dental, and vision insurance plans, including family coverageLife insurance and disability benefitsMental health support programsHealthcare Spending Account (HSA)Fertility and family-building supportComplimentary lunch, snacks, beverages, coffee, and tea in our officesAdditional holidays (13 in 2024, 5 more than the legal requirement)15 days of paid time off, with an extra day for each year of service, up to a maximum of 25 days4 floating holidays per year10 paid sick days annuallyOccasional company-wide recharge days (5 in 2024)Up to 18 weeks of fully paid parental leave, subject to certain conditions, for biological, adoptive, and foster parentsAnd other special benefits related to our services

Lyft proudly pursues and hires a diverse workforce. Lyft believes that every person has a right to equal employment opportunities without discrimination because of race, ancestry, place of origin, colour, ethnic origin, citizenship, creed, sex, sexual orientation, gender identity, gender expression, age, marital status, family status, disability, pardoned record of offences, or any other basis protected by applicable law or by Company policy. Lyft also strives for a healthy and safe workplace and strictly prohibits harassment of any kind. Accommodation for persons with disabilities will be provided upon request in accordance with applicable law during the application and hiring process. Please contact your recruiter now if you wish to make such a request.

This role will be in-office on a hybrid schedule — Team Members will be expected to work in the office 3 days per week on Mondays, Thursdays and a team-specific third day. Additionally, hybrid roles have the flexibility to work from anywhere for up to 4 weeks per year. #Hybrid

Site Reliability Engineer

2 weeks ago

Montreal, Canada Soho Square Solutions Full time

Site Reliability Engineer (SRE) - ServiceNow, Application InfrastructureThe Application Infrastructure (AI) department is seeking a Site Reliability Engineer (SRE) to drive reliability engineering, operations, and customer support services for a ServiceNow SaaS implementation. Reporting to a Site Reliability Engineering & Operations Lead, this role involves...
Site Reliability Engineer

4 weeks ago

Montreal, Canada Soho Square Solutions Full time

Site Reliability Engineer (SRE) - ServiceNow, Application InfrastructureThe Application Infrastructure (AI) department is seeking a Site Reliability Engineer (SRE) to drive reliability engineering, operations, and customer support services for a ServiceNow SaaS implementation. Reporting to a Site Reliability Engineering & Operations Lead, this role involves...
Site Reliability Engineer

4 weeks ago

Montreal, Canada Soho Square Solutions Full time

Site Reliability Engineer (SRE) - ServiceNow, Application InfrastructureThe Application Infrastructure (AI) department is seeking a Site Reliability Engineer (SRE) to drive reliability engineering, operations, and customer support services for a ServiceNow SaaS implementation. Reporting to a Site Reliability Engineering & Operations Lead, this role involves...
Site Reliability Engineer

1 month ago

Montreal, Quebec, Québec, Canada Soho Square Solutions Full time

Site Reliability Engineer (SRE) - ServiceNow, Application InfrastructureThe Application Infrastructure (AI) department is seeking a Site Reliability Engineer (SRE) to drive reliability engineering, operations, and customer support services for a ServiceNow SaaS implementation. Reporting to a Site Reliability Engineering & Operations Lead, this role involves...
Site Reliability Engineer

5 hours ago

Montreal, Canada LanceSoft, Inc. Full time

Site Reliability EngineerMontreal, Quebec, Canada HybridDuration: 12+ monthsResponsibilities: • Are interested in distributed systems and working with highly scalable and reliable services. • Like to work in a fast-moving environment and you aren't afraid to change things to make them better. • Enjoy new technological challenges and solving hard...
AWS Site Reliability Engineer

4 weeks ago

Montreal, Canada SAP SE Full time

p>We help the world run betterAt SAP, we enable you to bring out your best. Our company culture is focused on collaboration and a shared passion to help the world run better. p>The Reliability Engineering organization provides a multitude of products and services related to operations and continuity of business delivery.The Site Reliability Engineering teams...
Site Reliability Engineer

1 day ago

Montreal, Quebec, Canada LanceSoft, Inc. Full time

Unlock a career as a Site Reliability Engineer at LanceSoft, Inc., a cutting-edge technology company based in Montreal, Quebec, Canada. We are seeking an experienced and highly motivated individual to join our team.Job Type: Full-timeDuration: 12+ monthsCompany OverviewLanceSoft, Inc. is a leading technology firm dedicated to delivering innovative solutions...
Site Reliability Engineering Leader

2 days ago

Montreal, Quebec, Canada Royal Bank of Canada Full time

Transform Your Career with a Leadership Role in Site Reliability Engineering We are seeking an experienced Senior Site Reliability Engineer to join our team at the Royal Bank of Canada. As a key member of our Digital Branch SRE organization, you will play a critical role in developing, implementing, and supporting SRE solutions for applications supported by...
AWS Site Reliability Engineer

3 weeks ago

Montreal, Canada SAP SE Full time

p>We help the world run betterAt SAP, we enable you to bring out your best. Our company culture is focused on collaboration and a shared passion to help the world run better. We focus every day on building the foundation for tomorrow and creating a workplace that embraces differences, values flexibility, and is aligned to our purpose-driven and...
AWS Site Reliability Engineer

4 months ago

Montreal, Canada Alltech Consulting Services Full time

Job Description Level 4 The Application Infrastructure (AI) department is seeking a Site Reliability Engineer (SRE) to help drive the reliability engineering, operations, and customer support services for Company’s ServiceNow SaaS implementation. Reporting to a Site Reliability Engineering & Operations Lead, this role requires delivering a range of SRE...
Site Reliability Engineer

1 day ago

Montreal, Quebec, G4F, CA LanceSoft, Inc. Full time

Site Reliability EngineerMontreal, Quebec, Canada HybridDuration: 12+ monthsResponsibilities: • Are interested in distributed systems and working with highly scalable and reliable services. • Like to work in a fast-moving environment and you aren't afraid to change things to make them better. • Enjoy new technological challenges and solving hard...
Site Reliability Engineer for Global ServiceNow Implementation

4 weeks ago

Montreal, Quebec, Canada Alltech Consulting Services Full time

We are seeking an experienced Site Reliability Engineer to join our team at Alltech Consulting Services. As a key member of our Application Infrastructure department, you will play a vital role in driving the reliability engineering, operations, and customer support services for our ServiceNow SaaS implementation.The ideal candidate will have experience in...
Site Reliability Engineer

4 weeks ago

Montreal, Canada LanceSoft, Inc. Full time

Location : Montreal (Hybrid 3 days)Duration: 12+ MonthsJob ProfileSystems Reliability Engineering (SRE) is a discipline focused on improving system service availability, observability, scalability, performance, and resilience across *** by applying sound software engineering principles and adopting the latest technology and tooling.Responsibilities:Are...
Site Reliability Engineer

4 weeks ago

Montreal, Canada LanceSoft, Inc. Full time

Location : Montreal (Hybrid 3 days)Duration: 12+ MonthsJob ProfileSystems Reliability Engineering (SRE) is a discipline focused on improving system service availability, observability, scalability, performance, and resilience across *** by applying sound software engineering principles and adopting the latest technology and tooling.Responsibilities:Are...
Site Reliability Engineer

2 weeks ago

Montreal, Canada Experience AI Solutions Full time

Senior Systems Administrator Start Date : as soon as possible Type of employment: permanent Location: Montreal, QC (hybrid model for working in the office) Number of Positions: 1 Language skills : Excellent English language skills Perks: Work for a multinational, award winning, socially responsible company with an operational presence in many...
Site Reliability Engineer

2 weeks ago

Montreal, Canada Experience AI Solutions Full time

Senior Systems AdministratorStart Date: as soon as possibleType of employment: permanentLocation: Montreal, QC (hybrid model for working in the office)Number of Positions: 1Language skills: Excellent English language skillsPerks: Work for a multinational, award winning, socially responsible company with an operational presence in many countries, having been...
Site Reliability Engineer

2 days ago

Montreal, Canada Experience AI Solutions Full time

Senior Systems AdministratorStart Date: as soon as possibleType of employment: permanentLocation: Montreal, QC (hybrid model for working in the office)Number of Positions: 1Language skills: Excellent English language skillsPerks: Work for a multinational, award winning, socially responsible company with an operational presence in many countries, having been...
Site Reliability Engineer

4 weeks ago

Montreal, Quebec, Québec, Canada LanceSoft, Inc. Full time

Location : Montreal (Hybrid 3 days)Duration: 12+ MonthsJob ProfileSystems Reliability Engineering (SRE) is a discipline focused on improving system service availability, observability, scalability, performance, and resilience across *** by applying sound software engineering principles and adopting the latest technology and tooling.Responsibilities:Are...
Technical Site Reliability Engineering

1 month ago

Montreal, Canada Ubisoft Entertainment Full time

h3>Technical Site Reliability Engineering (SRE) LeadFull-timeContract: PermanentFlexible Working Organization: HybridUbisoft’s 19,000 team members, working across more than 30 countries around the world, are bound by a common mission to enrich players’ lives with original and memorable gaming experiences. If you are excited about solving game-changing...
Site Reliability Engineer

2 months ago

Montreal, Canada National Bank Full time

As a Specialist in site reliability engineering on the National Bank Data Protection team, you will ensure the operational reliability of data protection assets. With your experience and knowledge in the operational management of high-availability assets (HA), you will have a positive impact on the Bank's stability and reputation with its internal and...

Americas

Europe

Asia / Oceania

Africa

Site Reliability Engineer