Infrastructure Engineer, Observability Expert

2 weeks ago


Toronto, Ontario, Canada Lyft Full time
About the Role

We are seeking an experienced Senior Infrastructure Engineer, Observability Specialist to join our team at Lyft. As a key member of our Infrastructure team, you will be responsible for the operation and maintenance of our logging and metrics infrastructure.

Key Responsibilities
  • Maintain and Analyze Metrics: You will be responsible for maintaining and analyzing metrics from operating systems, control planes, and applications to assist in fault detection and performance enhancement.
  • Design and Develop Tooling: You will design, develop, and deploy tooling and systems that continually improve the reliability, scalability, and efficiency of our platform.
  • Balance Feature Development and Reliability: You will balance feature development speed and reliability with service-level objectives.
  • Operate and Enhance Infrastructure: You will operate and enhance our Infrastructure using industry best practices and tools.
  • Collaborate with Cross-Functional Teams: You will collaborate with cross-functional engineering teams to enhance Lyft's observability and meet developers' needs.
  • Document Infrastructure Operations: You will document Infrastructure operations processes and insights, identify repeatable actions, and ruthlessly automate repetitive tasks.
  • Participate in On-Call Rotations: You will participate in our teams on-call rotations, respond to incidents, and support other teams mitigate customer-impacting events.
Requirements
  • 5+ Years of Experience: You will have 5+ years of experience working on teams responsible for software development, automation, and systems engineering.
  • Bachelor's Degree or Equivalent: You will have a Bachelor's Degree or equivalent experience in Computer Science or a relevant discipline.
  • Proficiency in High-Level Languages: You will have proficiency in creating production-ready code in one or more high-level languages, such as Go, Python.
  • Experience with Public Cloud Environments: You will have experience operating large-scale infrastructure in public cloud environments, such as AWS, including familiarity with Managed Services like Amazon OpenSearch Service and Amazon Managed Service for Prometheus.
  • Demonstrated Expertise in Observability: You will have demonstrated expertise in building observability infrastructure at scale to support robust monitoring and analysis.
  • Experience with Kubernetes and Envoy Proxy: You will have experience with Kubernetes and Envoy Proxy, managing multi-cluster environments in large-scale production settings.
  • Experience with Distributed Storage Technologies: You will have experience with distributed storage technologies such as S3, RDS, DynamoDB, Aurora, and distributed configuration systems such as Zookeeper and etcd.
  • Experience with Monitoring, Alerting, and Logging Systems: You will have experience using monitoring, alerting, and logging systems at massive-scale, such as Prometheus, Grafana, Kibana, Telegraph, and M3.
Benefits
  • Extended Health and Dental Coverage: You will have extended health and dental coverage options, along with life insurance and disability benefits.
  • Mental Health Benefits: You will have mental health benefits.
  • Family Building Benefits: You will have family building benefits.
  • Access to a Health Care Savings Account: You will have access to a Health Care Savings Account.
  • Paid Time Off: You will have 15 days paid time off, with an additional day for each year of service.
  • Floating Holidays: You will have 4 Floating Holidays each calendar year prorated based on date of hire.
  • Paid Sick Days: You will have 10 paid sick days per year regardless of province.
  • Paid Parental Leave: You will have 18 weeks of paid parental leave. Biological, adoptive, and foster parents are all eligible.
About Lyft

Lyft proudly pursues and hires a diverse workforce. We believe that every person has a right to equal employment opportunities without discrimination because of race, ancestry, place of origin, color, ethnic origin, citizenship, creed, sex, sexual orientation, gender identity, gender expression, age, marital status, family status, disability, pardoned record of offenses, or any other basis protected by applicable law or by Company policy.

We also strive for a healthy and safe workplace and strictly prohibit harassment of any kind. Accommodation for persons with disabilities will be provided upon request in accordance with applicable law during the application and hiring process.

This role will be in-office on a hybrid schedule — Team Members will be expected to work in the office 3 days per week on Mondays, Thursdays, and a team-specific third day. Additionally, hybrid roles have the flexibility to work from anywhere for up to 4 weeks per year.



  • Toronto, Ontario, Canada Lyft Full time

    Infrastructure Engineer at LyftLyft is on a mission to enhance transportation for people around the world. Our Infrastructure team is dedicated to building software that can tackle problems on a large scale. We believe in sharing our solutions with the community for the benefit of all.As a member of the Observability team at Lyft, you will be responsible for...


  • Old Toronto, Ontario, Canada Lyft Full time

    About the RoleWe are seeking an experienced Infrastructure Engineer to join our Observability team at Lyft. As a key member of our team, you will be responsible for the operation and maintenance of our logging and metrics infrastructure.Key ResponsibilitiesTechnical Leadership: Provide technical mentorship within the team and lead by example in developing...


  • Old Toronto, Ontario, Canada Lyft Full time

    About the RoleWe are seeking an experienced Infrastructure Engineer to join our Observability team at Lyft. As a key member of our team, you will be responsible for the operation and maintenance of our logging and metrics infrastructure.Key ResponsibilitiesTechnical Leadership: Provide technical mentorship within the team and lead by example in developing...


  • Toronto, Ontario, Canada Lyft Full time

    About the RoleWe are seeking an experienced Infrastructure Engineer to join our Observability team at Lyft. As a key member of our team, you will be responsible for the operation and maintenance of our logging and metrics infrastructure.Key ResponsibilitiesProvide technical mentorship within the team and lead by example in developing robust, scalable, and...


  • Toronto, Ontario, Canada Lyft Full time

    About the RoleWe are seeking an experienced Infrastructure Engineer to join our Observability team at Lyft. As a key member of our team, you will be responsible for the operation and maintenance of our logging and metrics infrastructure.Key ResponsibilitiesProvide technical mentorship within the team and lead by example in developing robust, scalable, and...


  • Old Toronto, Ontario, Canada Lyft Full time

    About Lyft: At Lyft, we are dedicated to enhancing lives through exceptional transportation solutions. Our commitment to fostering an open, inclusive, and diverse workplace drives our Infrastructure team to innovate and tackle challenges at scale. We take pride in sharing our advancements with the community, exemplified by our open-source projects like Envoy...


  • Old Toronto, Ontario, Canada Lyft Full time

    About Lyft: Lyft is dedicated to enhancing the lives of individuals through superior transportation solutions. Our commitment to fostering an open, inclusive, and diverse workplace is at the core of our mission. Role Overview: As a key member of the Observability team, you will oversee the operation and upkeep of our logging and metrics systems. Your role is...


  • Toronto, Ontario, Canada Lyft Full time

    About the RoleWe are seeking an experienced Infrastructure Engineer to join our Observability team at Lyft. As an Observability team member, you will be responsible for the operation and maintenance of our logging and metrics infrastructure.Key ResponsibilitiesMaintain and analyze metrics from operating systems, control planes, and applications to assist in...


  • Toronto, Ontario, Canada Lyft Full time

    About the RoleWe are seeking an experienced Infrastructure Engineer to join our Observability team at Lyft. As an Observability team member, you will be responsible for the operation and maintenance of our logging and metrics infrastructure.Key ResponsibilitiesMaintain and analyze metrics from operating systems, control planes, and applications to assist in...


  • Toronto, Ontario, Canada Lyft Full time

    About the RoleWe are seeking an experienced Infrastructure Engineer to join our Observability team at Lyft. As a key member of our team, you will be responsible for the operation and maintenance of our logging and metrics infrastructure. Your expertise will ensure that all teams at Lyft are aware of the operational health of their products by monitoring...


  • Toronto, Ontario, Canada Lyft Full time

    About the RoleWe are seeking an experienced Infrastructure Engineer to join our Observability team at Lyft. As a key member of our team, you will be responsible for the operation and maintenance of our logging and metrics infrastructure. Your expertise will ensure that all teams at Lyft are aware of the operational health of their products by monitoring...


  • Toronto, Ontario, Canada Lyft Full time

    About the RoleWe are seeking an experienced Infrastructure Engineer to join our Observability team at Lyft. As a key member of our team, you will be responsible for designing, developing, and deploying advanced tooling and systems that enhance the reliability, scalability, and efficiency of our platform.ResponsibilitiesProvide technical mentorship within the...


  • Toronto, Ontario, Canada Lyft Full time

    About the RoleWe are seeking an experienced Infrastructure Engineer to join our Observability team at Lyft. As a key member of our team, you will be responsible for designing, developing, and deploying advanced tooling and systems that enhance the reliability, scalability, and efficiency of our platform.ResponsibilitiesProvide technical mentorship within the...


  • Toronto, Ontario, Canada Lyft Full time

    About LyftAt Lyft, our mission is to revolutionize the way people move around cities by providing a reliable, efficient, and enjoyable transportation experience. To achieve this, we rely on a robust and scalable infrastructure that enables us to process millions of requests every day.About the RoleWe are seeking an experienced Infrastructure Engineer to join...


  • Toronto, Ontario, Canada Lyft Full time

    About LyftAt Lyft, our mission is to revolutionize the way people move around cities by providing a reliable, efficient, and enjoyable transportation experience. To achieve this, we rely on a robust and scalable infrastructure that enables us to process millions of requests every day.About the RoleWe are seeking an experienced Infrastructure Engineer to join...


  • Toronto, Ontario, Canada Lyft Full time

    About the RoleWe are seeking an experienced Infrastructure Engineer to join our Observability team at Lyft. As a key member of our team, you will be responsible for the operation and maintenance of our logging and metrics infrastructure. Your expertise will ensure that all teams at Lyft are aware of the operational health of their products, and you will take...


  • Toronto, Ontario, Canada Lyft Full time

    About the RoleWe are seeking an experienced Infrastructure Engineer to join our Observability team at Lyft. As a key member of our team, you will be responsible for the operation and maintenance of our logging and metrics infrastructure. Your expertise will ensure that all teams at Lyft are aware of the operational health of their products, and you will take...


  • Old Toronto, Ontario, Canada Lyft Full time

    About Lyft: At Lyft, we are dedicated to enhancing lives through superior transportation solutions. Our commitment begins with fostering a diverse, inclusive, and open environment within our community. The Infrastructure team is enthusiastic about developing software that addresses challenges at an extensive scale. We believe in sharing our successful...


  • Toronto, Ontario, Canada Lyft Full time

    About LyftAt Lyft, our mission is to revolutionize the way people move around cities by providing a reliable, efficient, and enjoyable transportation experience. To achieve this, we rely on a robust and scalable infrastructure that enables us to process millions of requests every day.About the RoleWe are seeking an experienced Infrastructure Engineer to join...


  • Toronto, Ontario, Canada Lyft Full time

    About LyftAt Lyft, our mission is to revolutionize the way people move around cities by providing a reliable, efficient, and enjoyable transportation experience. To achieve this, we rely on a robust and scalable infrastructure that enables us to process millions of requests every day.About the RoleWe are seeking an experienced Infrastructure Engineer to join...