Staff Infrastructure Engineer, Observability Specialist

5 days ago


Old Toronto, Ontario, Canada Lyft Full time
About the Role

We are seeking an experienced Infrastructure Engineer to join our Observability team at Lyft. As a key member of our team, you will be responsible for the operation and maintenance of our logging and metrics infrastructure.

Key Responsibilities
  1. Technical Leadership: Provide technical mentorship within the team and lead by example in developing robust, scalable, and efficient observability solutions.
  2. Cross-Functional Collaboration: Drive cross-functional collaboration with engineering teams to advance Lyft's observability capabilities, ensuring they align with business objectives and developers' requirements.
  3. Roadmap and Strategy: Play a pivotal role in steering the observability roadmap and strategic direction, utilizing a comprehensive understanding of business context to influence key decisions and initiatives.
  4. Infrastructure Development: Design, develop, and deploy advanced tooling and systems that enhance the reliability, scalability, and efficiency of our platform.
  5. Infrastructure Operations: Operate and improve our infrastructure using industry best practices and tools, setting standards for excellence.
  6. Documentation and Automation: Document infrastructure operations processes and insights, identify repeatable actions, and lead the automation of repetitive tasks.
  7. On-Call Support: Participate in our team's on-call rotations, respond to incidents, and provide expert support to other teams in mitigating customer-impacting events.
Requirements
  • 8+ years of experience in roles focused on software development, automation, and systems engineering, with a proven track record of technical leadership.
  • Bachelor's Degree or equivalent experience in Computer Science or a relevant discipline, with a strong foundation in observability principles.
  • Proven expertise in architecting and scaling observability infrastructure to support comprehensive monitoring and analysis in large production environments.
  • Advanced proficiency in creating production-ready code in high-level languages, such as Go, Python.
  • Extensive experience operating large-scale infrastructure in public cloud environments, such as AWS, and with Managed Services like Amazon OpenSearch Service and Amazon Managed Service for Prometheus.
  • Deep experience with Kubernetes and Envoy Proxy, managing multi-cluster environments in large-scale production settings.
  • Familiarity with distributed storage technologies such as S3, RDS, DynamoDB, Aurora, and distributed configuration systems such as Zookeeper and etcd.
  • Expertise in deploying and managing monitoring, alerting, and logging systems at massive-scale, such as Prometheus, Grafana, Kibana, Telegraph, and M3.
What We Offer
  • Extended health and dental coverage options, along with life insurance and disability benefits.
  • Mental health benefits.
  • Family building benefits.
  • Access to a Health Care Savings Account.
  • In addition to provincial observed holidays, team members get 15 days paid time off, with an additional day for each year of service.
  • 4 Floating Holidays each calendar year prorated based off of date of hire.
  • 10 paid sick days per year regardless of province.
  • 18 weeks of paid parental leave. Biological, adoptive, and foster parents are all eligible.


  • Toronto, Ontario, Canada Lyft Full time

    Infrastructure Engineer at LyftLyft is on a mission to enhance transportation for people around the world. Our Infrastructure team is dedicated to building software that can tackle problems on a large scale. We believe in sharing our solutions with the community for the benefit of all.As a member of the Observability team at Lyft, you will be responsible for...


  • Toronto, Ontario, Canada Lyft Full time

    About the RoleWe are seeking an experienced Infrastructure Engineer to join our Observability team at Lyft. As a key member of our team, you will be responsible for the operation and maintenance of our logging and metrics infrastructure.Key ResponsibilitiesProvide technical mentorship within the team and lead by example in developing robust, scalable, and...


  • Toronto, Ontario, Canada Lyft Full time

    About the RoleWe are seeking an experienced Infrastructure Engineer to join our Observability team at Lyft. As a key member of our team, you will be responsible for the operation and maintenance of our logging and metrics infrastructure.Key ResponsibilitiesProvide technical mentorship within the team and lead by example in developing robust, scalable, and...


  • Toronto, Ontario, Canada Lyft Full time

    About the RoleWe are seeking an experienced Infrastructure Engineer to join our Observability team at Lyft. As a key member of our team, you will be responsible for designing, developing, and deploying advanced tooling and systems that enhance the reliability, scalability, and efficiency of our platform.ResponsibilitiesProvide technical mentorship within the...


  • Toronto, Ontario, Canada Lyft Full time

    About the RoleWe are seeking an experienced Infrastructure Engineer to join our Observability team at Lyft. As a key member of our team, you will be responsible for designing, developing, and deploying advanced tooling and systems that enhance the reliability, scalability, and efficiency of our platform.ResponsibilitiesProvide technical mentorship within the...


  • Old Toronto, Ontario, Canada Lyft Full time

    At Lyft, we are dedicated to enhancing lives through exceptional transportation solutions. Our Infrastructure team is committed to developing software that addresses challenges at an extensive scale. When we create solutions that we believe can benefit the broader community, such as Envoy Proxy, we share our innovations through open-source contributions.As a...


  • Toronto, Ontario, Canada Lyft Full time

    About the RoleWe are seeking an experienced Infrastructure Engineer to join our Observability team at Lyft. As an Observability team member, you will be responsible for the operation and maintenance of our logging and metrics infrastructure.Key ResponsibilitiesMaintain and analyze metrics from operating systems, control planes, and applications to assist in...


  • Toronto, Ontario, Canada Lyft Full time

    About the RoleWe are seeking an experienced Infrastructure Engineer to join our Observability team at Lyft. As an Observability team member, you will be responsible for the operation and maintenance of our logging and metrics infrastructure.Key ResponsibilitiesMaintain and analyze metrics from operating systems, control planes, and applications to assist in...


  • Toronto, Ontario, Canada Lyft Full time

    About the RoleWe are seeking an experienced Senior Infrastructure Engineer, Observability Specialist to join our team at Lyft. As a key member of our Infrastructure team, you will be responsible for the operation and maintenance of our logging and metrics infrastructure.Key ResponsibilitiesMaintain and Analyze Metrics: You will be responsible for maintaining...


  • Toronto, Ontario, Canada Lyft Full time

    About the RoleWe are seeking an experienced Senior Infrastructure Engineer, Observability Specialist to join our team at Lyft. As a key member of our Infrastructure team, you will be responsible for the operation and maintenance of our logging and metrics infrastructure.Key ResponsibilitiesMaintain and Analyze Metrics: You will be responsible for maintaining...


  • Old Toronto, Ontario, Canada Lyft Full time

    About Lyft: At Lyft, we are dedicated to enhancing the lives of individuals through superior transportation solutions. Our Infrastructure team is committed to developing software that addresses challenges at an unprecedented scale. We believe in sharing our innovations with the community, as demonstrated by our open-source projects like Envoy Proxy.Role...


  • Old Toronto, Ontario, Canada Lyft Full time

    About Lyft: Lyft is dedicated to enhancing the lives of individuals through exceptional transportation solutions. Our Infrastructure team is committed to developing software that addresses challenges at a grand scale. We believe in sharing our innovations with the community, as demonstrated by our open-source projects like Envoy Proxy.Role Overview: As a...


  • Toronto, Ontario, Canada Lyft Full time

    About LyftAt Lyft, our mission is to revolutionize the way people move around cities by providing a reliable, efficient, and enjoyable transportation experience. To achieve this, we rely on a robust and scalable infrastructure that enables us to process millions of requests every day.About the RoleWe are seeking an experienced Infrastructure Engineer to join...


  • Toronto, Ontario, Canada Lyft Full time

    About LyftAt Lyft, our mission is to revolutionize the way people move around cities by providing a reliable, efficient, and enjoyable transportation experience. To achieve this, we rely on a robust and scalable infrastructure that enables us to process millions of requests every day.About the RoleWe are seeking an experienced Infrastructure Engineer to join...


  • Toronto, Ontario, Canada Lyft Full time

    About the RoleWe are seeking an experienced Infrastructure Engineer to join our Observability team at Lyft. As a key member of our team, you will be responsible for the operation and maintenance of our logging and metrics infrastructure. Your expertise will ensure that all teams at Lyft are aware of the operational health of their products, and you will take...


  • Toronto, Ontario, Canada Lyft Full time

    About the RoleWe are seeking an experienced Infrastructure Engineer to join our Observability team at Lyft. As a key member of our team, you will be responsible for the operation and maintenance of our logging and metrics infrastructure. Your expertise will ensure that all teams at Lyft are aware of the operational health of their products, and you will take...


  • Toronto, Ontario, Canada Lyft Full time

    About LyftAt Lyft, our mission is to revolutionize the way people move around cities by providing a reliable, efficient, and enjoyable transportation experience. To achieve this, we rely on a robust and scalable infrastructure that enables us to process millions of requests every day.About the RoleWe are seeking an experienced Infrastructure Engineer to join...


  • Toronto, Ontario, Canada Lyft Full time

    About LyftAt Lyft, our mission is to revolutionize the way people move around cities by providing a reliable, efficient, and enjoyable transportation experience. To achieve this, we rely on a robust and scalable infrastructure that enables us to process millions of requests every day.About the RoleWe are seeking an experienced Infrastructure Engineer to join...


  • Toronto, Ontario, Canada Lyft Full time

    About the RoleWe are seeking an experienced Senior Infrastructure Engineer, Observability Specialist to join our team at Lyft. As a key member of our Infrastructure team, you will be responsible for the operation and maintenance of our logging and metrics infrastructure.Key ResponsibilitiesMaintain and Analyze Metrics: You will be responsible for maintaining...


  • Toronto, Ontario, Canada Lyft Full time

    About the RoleWe are seeking an experienced Senior Infrastructure Engineer, Observability Specialist to join our team at Lyft. As a key member of our Infrastructure team, you will be responsible for the operation and maintenance of our logging and metrics infrastructure.Key ResponsibilitiesMaintain and Analyze Metrics: You will be responsible for maintaining...