Senior Infrastructure Engineer, Observability

2 months ago


Toronto, Ontario, Canada Lyft Full time
About the Role

We are seeking an experienced Infrastructure Engineer to join our Observability team at Lyft. As a key member of our team, you will be responsible for the operation and maintenance of our logging and metrics infrastructure. Your expertise will ensure that all teams at Lyft are aware of the operational health of their products by monitoring system availability and taking a holistic view of our platform performance.

Responsibilities
  • Analyze metrics from operating systems, control planes, and applications to assist in fault detection and performance enhancement
  • Design, develop, and deploy tooling and systems that continually improve the reliability, scalability, and efficiency of our platform
  • Balance feature development speed and reliability with service-level objectives
  • Operate and enhance our Infrastructure using industry best practices and tools
  • Collaborate with cross-functional engineering teams to enhance Lyft's observability and meet developers' needs, ensuring alignment with design and production readiness reviews, platform management, and capacity planning
  • Document Infrastructure operations processes and insights, identify repeatable actions, and ruthlessly automate repetitive tasks
  • Participate in our teams' on-call rotations, respond to incidents, and support other teams in mitigating customer-impacting events
Requirements
  • 5+ years of experience working on teams responsible for software development, automation, and systems engineering
  • Bachelor's Degree or equivalent experience in Computer Science or a relevant discipline
  • Proficiency in creating production-ready code in one or more high-level languages, such as Go, Python
  • Experience operating large-scale infrastructure in public cloud environments, such as AWS, including familiarity with Managed Services like Amazon OpenSearch Service and Amazon Managed Service for Prometheus
  • Demonstrated expertise in building observability infrastructure at scale to support robust monitoring and analysis
  • In-depth experience with Kubernetes and Envoy Proxy, managing multi-cluster environments in large-scale production settings
  • Experience with distributed storage technologies such as S3, RDS, DynamoDB, Aurora, and distributed configuration systems such as Zookeeper and etcd
  • Experience using monitoring, alerting, and logging systems at massive-scale, such as Prometheus, Grafana, Kibana, Telegraph, and M3
What We Offer
  • Extended health and dental coverage options, along with life insurance and disability benefits
  • Mental health benefits
  • Family building benefits
  • Access to a Health Care Savings Account
  • In addition to provincial observed holidays, team members get 15 days paid time off, with an additional day for each year of service
  • 4 Floating Holidays each calendar year prorated based on date of hire
  • 10 paid sick days per year regardless of province
  • 18 weeks of paid parental leave. Biological, adoptive, and foster parents are all eligible
About Lyft

Lyft is a transportation network company that aims to improve people's lives with the world's best transportation. We are committed to creating an open, inclusive, and diverse organization that values our community and strives to make a positive impact on society.

We are an equal opportunity employer and welcome applications from diverse candidates. We believe that every person has the right to equal employment opportunities without discrimination because of race, ancestry, place of origin, color, ethnic origin, citizenship, creed, sex, sexual orientation, gender identity, gender expression, age, marital status, family status, disability, pardoned record of offenses, or any other basis protected by applicable law or by Company policy.



  • Toronto, Ontario, Canada Lyft Full time

    Job Title: Senior Infrastructure Engineer, ObservabilityAt Lyft, we're committed to building a transportation platform that's reliable, efficient, and accessible to everyone. As a Senior Infrastructure Engineer, Observability, you'll play a critical role in ensuring the smooth operation of our infrastructure and services.Responsibilities:Design, develop, and...


  • Toronto, Ontario, Canada Lyft Full time

    Job Title: Senior Infrastructure Engineer, ObservabilityAt Lyft, we're committed to building a transportation platform that's reliable, efficient, and accessible to everyone. As a Senior Infrastructure Engineer, Observability, you'll play a critical role in ensuring the smooth operation of our infrastructure and services.Responsibilities:Design, develop, and...


  • Toronto, Ontario, Canada Lyft Full time

    Job Title: Senior Infrastructure Engineer, ObservabilityAt Lyft, we're committed to creating a transportation platform that's reliable, efficient, and accessible to everyone. As a Senior Infrastructure Engineer, Observability, you'll play a critical role in ensuring the smooth operation of our infrastructure, enabling our teams to focus on delivering...


  • Toronto, Ontario, Canada Lyft Full time

    Job Title: Senior Infrastructure Engineer, ObservabilityAt Lyft, we're committed to creating a transportation platform that's reliable, efficient, and accessible to everyone. As a Senior Infrastructure Engineer, Observability, you'll play a critical role in ensuring the smooth operation of our infrastructure, enabling our teams to focus on delivering...


  • Toronto, Ontario, Canada Lyft Full time

    Job Title: Senior Infrastructure Engineer, ObservabilityAt Lyft, we're committed to building a transportation platform that's reliable, scalable, and efficient. As a Senior Infrastructure Engineer, Observability, you'll play a critical role in ensuring our platform's operational health and performance.Responsibilities:Design, develop, and deploy tooling and...


  • Toronto, Ontario, Canada Lyft Full time

    Job Title: Senior Infrastructure Engineer, ObservabilityAt Lyft, we're committed to building a transportation platform that's reliable, scalable, and efficient. As a Senior Infrastructure Engineer, Observability, you'll play a critical role in ensuring our platform's operational health and performance.Responsibilities:Design, develop, and deploy tooling and...


  • Old Toronto, Ontario, Canada https:www.energyjobline.comsitemap Full time

    Job DescriptionWe are seeking an experienced Senior Infrastructure Engineer, Observability to join our team. As a key member of our Infrastructure team, you will be responsible for the operation and maintenance of our logging and metrics infrastructure.Responsibilities:Metric Analysis and Fault Detection: Maintain and analyze metrics from operating systems,...


  • Old Toronto, Ontario, Canada https:www.energyjobline.comsitemap Full time

    Job DescriptionWe are seeking an experienced Senior Infrastructure Engineer, Observability to join our team. As a key member of our Infrastructure team, you will be responsible for the operation and maintenance of our logging and metrics infrastructure.Responsibilities:Metric Analysis and Fault Detection: Maintain and analyze metrics from operating systems,...


  • Toronto, Ontario, Canada Lyft Full time

    About the RoleWe are seeking an experienced Infrastructure Engineer to join our Observability team at Lyft. As a key member of our team, you will be responsible for the operation and maintenance of our logging and metrics infrastructure. Your expertise will ensure that all teams at Lyft are aware of the operational health of their products by monitoring...


  • Toronto, Ontario, Canada Lyft Full time

    About the RoleWe are seeking an experienced Infrastructure Engineer to join our Observability team at Lyft. As a key member of our team, you will be responsible for the operation and maintenance of our logging and metrics infrastructure. Your expertise will ensure that all teams at Lyft are aware of the operational health of their products by monitoring...


  • Toronto, Ontario, Canada Lyft Full time

    About the RoleWe are seeking an experienced Infrastructure Engineer to join our Observability team at Lyft. As a key member of our team, you will be responsible for the operation and maintenance of our logging and metrics infrastructure. Your expertise will ensure that all teams at Lyft are aware of the operational health of their products by monitoring...


  • Toronto, Ontario, Canada Lyft Full time

    About the RoleWe are seeking an experienced Senior Infrastructure Engineer, Observability Specialist to join our team at Lyft. As a key member of our Infrastructure team, you will be responsible for the operation and maintenance of our logging and metrics infrastructure.Key ResponsibilitiesMaintain and Analyze Metrics: You will be responsible for maintaining...


  • Toronto, Ontario, Canada Lyft Full time

    About the RoleWe are seeking an experienced Senior Infrastructure Engineer, Observability Specialist to join our team at Lyft. As a key member of our Infrastructure team, you will be responsible for the operation and maintenance of our logging and metrics infrastructure.Key ResponsibilitiesMaintain and Analyze Metrics: You will be responsible for maintaining...


  • Old Toronto, Ontario, Canada https:www.energyjobline.comsitemap Full time

    Job SummaryWe are seeking an experienced Senior Infrastructure Engineer to join our Observability team. As a key member of our infrastructure team, you will be responsible for designing, developing, and deploying tooling and systems that improve the reliability, scalability, and efficiency of our platform.Key ResponsibilitiesAnalyze metrics from operating...


  • Old Toronto, Ontario, Canada https:www.energyjobline.comsitemap Full time

    About the RoleWe are seeking an experienced Infrastructure Engineer to join our Observability team. As an Observability team member, you will be responsible for the operation and maintenance of our logging and metrics infrastructure. You will ensure all teams at Lyft are aware of the operational health of their products by monitoring system availability and...


  • Old Toronto, Ontario, Canada https:www.energyjobline.comsitemap Full time

    About the RoleWe are seeking an experienced Infrastructure Engineer to join our Observability team. As an Observability team member, you will be responsible for the operation and maintenance of our logging and metrics infrastructure. You will ensure all teams at Lyft are aware of the operational health of their products by monitoring system availability and...


  • Old Toronto, Ontario, Canada Lyft Full time

    Job Title: Staff Infrastructure Engineer, ObservabilityAt Lyft, we're committed to creating an open, inclusive, and diverse organization that empowers our community to improve people's lives with the world's best transportation. As a Staff Infrastructure Engineer on our Observability team, you'll play a pivotal role in building software to solve problems at...


  • Old Toronto, Ontario, Canada Lyft Full time

    Job Title: Staff Infrastructure Engineer, ObservabilityAt Lyft, we're committed to creating an open, inclusive, and diverse organization that empowers our community to improve people's lives with the world's best transportation. As a Staff Infrastructure Engineer on our Observability team, you'll play a pivotal role in building software to solve problems at...


  • Toronto, Ontario, Canada Lyft Full time

    About the RoleWe are seeking an experienced Senior Infrastructure Engineer, Observability Specialist to join our team at Lyft. As a key member of our Infrastructure team, you will be responsible for the operation and maintenance of our logging and metrics infrastructure.Key ResponsibilitiesMaintain and Analyze Metrics: You will be responsible for maintaining...


  • Toronto, Ontario, Canada Lyft Full time

    About the RoleWe are seeking an experienced Senior Infrastructure Engineer, Observability Specialist to join our team at Lyft. As a key member of our Infrastructure team, you will be responsible for the operation and maintenance of our logging and metrics infrastructure.Key ResponsibilitiesMaintain and Analyze Metrics: You will be responsible for maintaining...