Infrastructure Engineer, Observability Specialist

2 weeks ago


Toronto, Ontario, Canada Lyft Full time

About Lyft

At Lyft, our mission is to revolutionize the way people move around cities by providing a reliable, efficient, and enjoyable transportation experience. To achieve this, we rely on a robust and scalable infrastructure that enables us to process millions of requests every day.

About the Role

We are seeking an experienced Infrastructure Engineer to join our Observability team. As an Observability Engineer, you will be responsible for designing, building, and maintaining the systems that enable us to monitor and analyze our infrastructure performance. Your primary goal will be to ensure that our platform is highly available, scalable, and reliable, and that we can quickly identify and resolve any issues that may arise.

Key Responsibilities

  • Maintain and Analyze Metrics: You will be responsible for maintaining and analyzing metrics from operating systems, control planes, and applications to assist in fault detection and performance enhancement.
  • Develop and Improve Tooling: You will develop and improve tooling and systems that enhance the reliability, scalability, and efficiency of our platform.
  • Collaborate with Engineering Teams: You will collaborate with cross-functional engineering teams to enhance Lyft's observability and meet developers' needs, ensuring alignment with design and production readiness reviews, platform management, and capacity planning.
  • Document Infrastructure Operations: You will maintain and improve our documentation at a world-class level by documenting infrastructure operations processes and insights.
  • Participate in On-Call Rotations: You will participate in our team's on-call rotations, respond to incidents, and support other teams to mitigate customer-impacting events.

Requirements

  • 3+ Years of Experience: You should have at least 3 years of experience working on teams responsible for software development, automation, and systems engineering.
  • Proficiency in Programming Languages: You should be proficient in creating production-ready code in one or more high-level languages, such as Go or Python.
  • Experience with Cloud Infrastructure: You should have experience operating infrastructure in public cloud environments, such as AWS, including familiarity with Managed Services.
  • Familiarity with Kubernetes: You should be familiar with Kubernetes and managing multi-cluster environments in production settings.
  • Experience with Monitoring and Logging Systems: You should have experience using monitoring, alerting, and logging systems such as Prometheus, Grafana, Kibana, and Telegraph.

Benefits

  • Extended Health and Dental Coverage: We offer extended health and dental coverage options, along with life insurance and disability benefits.
  • Mental Health Benefits: We provide mental health benefits to support our employees' well-being.
  • Family Building Benefits: We offer family building benefits to support our employees' family planning needs.
  • Access to a Health Care Savings Account: We provide access to a Health Care Savings Account to help our employees save for medical expenses.
  • Paid Time Off: We offer 15 days paid time off, with an additional day for each year of service.
  • Floating Holidays: We provide 4 Floating Holidays each calendar year prorated based on date of hire.
  • Paid Sick Days: We offer 10 paid sick days per year regardless of province.
  • Paid Parental Leave: We provide 18 weeks of paid parental leave for biological, adoptive, and foster parents.

Lyft's Commitment to Diversity and Inclusion

Lyft is an equal opportunity employer and welcomes applications from diverse candidates. We are committed to creating an inclusive and diverse workplace where everyone feels valued and respected. We strive to provide a healthy and safe workplace and strictly prohibit harassment of any kind. Accommodation for persons with disabilities will be provided upon request in accordance with applicable law during the application and hiring process.



  • Toronto, Ontario, Canada Lyft Full time

    About the RoleWe are seeking an experienced Infrastructure Engineer to join our Observability team at Lyft. As an Observability team member, you will be responsible for the operation and maintenance of our logging and metrics infrastructure.Key ResponsibilitiesMaintain and analyze metrics from operating systems, control planes, and applications to assist in...


  • Toronto, Ontario, Canada Lyft Full time

    About the RoleWe are seeking an experienced Infrastructure Engineer to join our Observability team at Lyft. As an Observability team member, you will be responsible for the operation and maintenance of our logging and metrics infrastructure.Key ResponsibilitiesMaintain and analyze metrics from operating systems, control planes, and applications to assist in...


  • Toronto, Ontario, Canada Lyft Full time

    About the RoleWe are seeking an experienced Senior Infrastructure Engineer, Observability Specialist to join our team at Lyft. As a key member of our Infrastructure team, you will be responsible for the operation and maintenance of our logging and metrics infrastructure.Key ResponsibilitiesMaintain and Analyze Metrics: You will be responsible for maintaining...


  • Toronto, Ontario, Canada Lyft Full time

    About the RoleWe are seeking an experienced Senior Infrastructure Engineer, Observability Specialist to join our team at Lyft. As a key member of our Infrastructure team, you will be responsible for the operation and maintenance of our logging and metrics infrastructure.Key ResponsibilitiesMaintain and Analyze Metrics: You will be responsible for maintaining...


  • Toronto, Ontario, Canada Lyft Full time

    Infrastructure Engineer at LyftLyft is on a mission to enhance transportation for people around the world. Our Infrastructure team is dedicated to building software that can tackle problems on a large scale. We believe in sharing our solutions with the community for the benefit of all.As a member of the Observability team at Lyft, you will be responsible for...


  • Toronto, Ontario, Canada Lyft Full time

    About the RoleWe are seeking an experienced Infrastructure Engineer to join our Observability team at Lyft. As a key member of our team, you will be responsible for the operation and maintenance of our logging and metrics infrastructure. Your expertise will ensure that all teams at Lyft are aware of the operational health of their products, and you will take...


  • Toronto, Ontario, Canada Lyft Full time

    About the RoleWe are seeking an experienced Infrastructure Engineer to join our Observability team at Lyft. As a key member of our team, you will be responsible for the operation and maintenance of our logging and metrics infrastructure. Your expertise will ensure that all teams at Lyft are aware of the operational health of their products, and you will take...


  • Toronto, Ontario, Canada Lyft Full time

    About LyftAt Lyft, our mission is to revolutionize the way people move around cities by providing a reliable, efficient, and enjoyable transportation experience. To achieve this, we rely on a robust and scalable infrastructure that enables us to process millions of requests every day.About the RoleWe are seeking an experienced Infrastructure Engineer to join...


  • Toronto, Ontario, Canada Lyft Full time

    About LyftAt Lyft, our mission is to revolutionize the way people move around cities by providing a reliable, efficient, and enjoyable transportation experience. To achieve this, we rely on a robust and scalable infrastructure that enables us to process millions of requests every day.About the RoleWe are seeking an experienced Infrastructure Engineer to join...


  • Toronto, Ontario, Canada Lyft Full time

    About the RoleWe are seeking an experienced Senior Infrastructure Engineer, Observability Specialist to join our team at Lyft. As a key member of our Infrastructure team, you will be responsible for the operation and maintenance of our logging and metrics infrastructure.Key ResponsibilitiesMaintain and Analyze Metrics: You will be responsible for maintaining...


  • Toronto, Ontario, Canada Lyft Full time

    About the RoleWe are seeking an experienced Senior Infrastructure Engineer, Observability Specialist to join our team at Lyft. As a key member of our Infrastructure team, you will be responsible for the operation and maintenance of our logging and metrics infrastructure.Key ResponsibilitiesMaintain and Analyze Metrics: You will be responsible for maintaining...


  • Old Toronto, Ontario, Canada Lyft Full time

    About the RoleWe are seeking an experienced Infrastructure Engineer to join our Observability team at Lyft. As a key member of our team, you will be responsible for the operation and maintenance of our logging and metrics infrastructure.Key ResponsibilitiesTechnical Leadership: Provide technical mentorship within the team and lead by example in developing...


  • Old Toronto, Ontario, Canada Lyft Full time

    About the RoleWe are seeking an experienced Infrastructure Engineer to join our Observability team at Lyft. As a key member of our team, you will be responsible for the operation and maintenance of our logging and metrics infrastructure.Key ResponsibilitiesTechnical Leadership: Provide technical mentorship within the team and lead by example in developing...


  • Toronto, Ontario, Canada Lyft Full time

    About the RoleWe are seeking an experienced Infrastructure Engineer to join our Observability team at Lyft. As a key member of our team, you will be responsible for the operation and maintenance of our logging and metrics infrastructure.Key ResponsibilitiesProvide technical mentorship within the team and lead by example in developing robust, scalable, and...


  • Toronto, Ontario, Canada Lyft Full time

    About the RoleWe are seeking an experienced Infrastructure Engineer to join our Observability team at Lyft. As a key member of our team, you will be responsible for the operation and maintenance of our logging and metrics infrastructure.Key ResponsibilitiesProvide technical mentorship within the team and lead by example in developing robust, scalable, and...


  • Old Toronto, Ontario, Canada Lyft Full time

    At Lyft, we are dedicated to enhancing lives through exceptional transportation solutions. Our Infrastructure team is committed to developing software that addresses challenges at an extensive scale. When we create solutions that we believe can benefit the broader community, such as Envoy Proxy, we share our innovations through open-source contributions.As a...


  • Toronto, Ontario, Canada Lyft Full time

    About the RoleWe are seeking an experienced Infrastructure Engineer to join our Observability team at Lyft. As a key member of our team, you will be responsible for the operation and maintenance of our logging and metrics infrastructure. Your expertise will ensure that all teams at Lyft are aware of the operational health of their products by monitoring...


  • Toronto, Ontario, Canada Lyft Full time

    About the RoleWe are seeking an experienced Infrastructure Engineer to join our Observability team at Lyft. As a key member of our team, you will be responsible for the operation and maintenance of our logging and metrics infrastructure. Your expertise will ensure that all teams at Lyft are aware of the operational health of their products by monitoring...


  • Toronto, Ontario, Canada Lyft Full time

    About the RoleWe are seeking an experienced Infrastructure Engineer to join our Observability team at Lyft. As a key member of our team, you will be responsible for designing, developing, and deploying advanced tooling and systems that enhance the reliability, scalability, and efficiency of our platform.ResponsibilitiesProvide technical mentorship within the...


  • Toronto, Ontario, Canada Lyft Full time

    About the RoleWe are seeking an experienced Infrastructure Engineer to join our Observability team at Lyft. As a key member of our team, you will be responsible for designing, developing, and deploying advanced tooling and systems that enhance the reliability, scalability, and efficiency of our platform.ResponsibilitiesProvide technical mentorship within the...