Staff Infrastructure Engineer, Observability Solutions Specialist

4 weeks ago


Toronto, Ontario, Canada https:www.energyjobline.comsitemap Full time
Job Title: Staff Infrastructure Engineer, Observability

At EnergyJobLine, we're committed to creating a workplace that's as innovative as our technology. As a Staff Infrastructure Engineer, Observability, you'll play a pivotal role in shaping our observability capabilities, ensuring they align with business objectives and developers' requirements.

Responsibilities:
  1. Provide technical mentorship within the team and lead by example in developing robust, scalable, and efficient observability solutions.
  2. Drive cross-functional collaboration with engineering teams to advance our observability capabilities, ensuring they meet business objectives and developers' needs.
  3. Steer the observability roadmap and strategic direction, utilizing a comprehensive understanding of business context to influence key decisions and initiatives.
  4. Design, develop, and deploy advanced tooling and systems that enhance the reliability, scalability, and efficiency of our platform.
  5. Operate and improve our infrastructure using industry best practices and tools, setting standards for excellence.
  6. Document infrastructure operations processes and insights, identify repeatable actions, and lead the automation of repetitive tasks.
  7. Participate in our team's on-call rotations, respond to incidents, and provide expert support to other teams in mitigating customer-impacting events.
Requirements:
  1. 8+ years of experience in roles focused on software development, automation, and systems engineering, with a proven track record of technical leadership.
  2. Bachelor's Degree or equivalent experience in Computer Science or a relevant discipline, with a strong foundation in observability principles.
  3. Proven expertise in architecting and scaling observability infrastructure to support comprehensive monitoring and analysis in large production environments.
  4. Advanced proficiency in creating production-ready code in high-level languages, such as Go, Python.
  5. Extensive experience operating large-scale infrastructure in public cloud environments, such as AWS, and with Managed Services like Amazon OpenSearch Service and Amazon Managed Service for Prometheus.
  6. Deep experience with Kubernetes and Envoy Proxy, managing multi-cluster environments in large-scale production settings.
  7. Familiarity with distributed storage technologies such as S3, RDS, DynamoDB, Aurora, and distributed configuration systems such as Zookeeper and etcd.
  8. Expertise in deploying and managing monitoring, alerting, and logging systems at massive-scale, such as Prometheus, Grafana, Kibana, Telegraph, and M3.
What We Offer:
  1. Extended health and dental coverage options, along with life insurance and disability benefits.
  2. Mental health benefits.
  3. Family building benefits.
  4. Access to a Health Care Savings Account.
  5. In addition to provincial observed holidays, team members get 15 days paid time off, with an additional day for each year of service.
  6. 4 Floating Holidays each calendar year prorated based on date of hire.
  7. 10 paid sick days per year regardless of province.
  8. 18 weeks of paid parental leave. Biological, adoptive, and foster parents are all eligible.

We proudly pursue and hire a diverse workforce. We believe that every person has a right to equal employment opportunities without discrimination because of race, ancestry, place of origin, colour, ethnic origin, citizenship, creed, sex, sexual orientation, gender identity, gender expression, age, marital status, family status, disability, pardoned record of offences, or any other basis protected by applicable law or by Company policy. We also strive for a healthy and safe workplace and strictly prohibit harassment of any kind. Accommodation for persons with disabilities will be provided upon request in accordance with applicable law during the application and hiring process. Please contact your recruiter now if you wish to make such a request.

This role will be in-office on a hybrid schedule — Team Members will be expected to work in the office 3 days per week on Mondays, Thursdays, and a team-specific third day. Additionally, hybrid roles have the flexibility to work from anywhere for up to 4 weeks per year. #Hybrid



  • Old Toronto, Ontario, Canada Lyft Full time

    Job Title: Staff Infrastructure Engineer, ObservabilityAt Lyft, we're committed to creating an open, inclusive, and diverse organization that empowers our community to improve people's lives with the world's best transportation. As a Staff Infrastructure Engineer on our Observability team, you'll play a pivotal role in building software to solve problems at...


  • Old Toronto, Ontario, Canada Lyft Full time

    Job Title: Staff Infrastructure Engineer, ObservabilityAt Lyft, we're committed to creating an open, inclusive, and diverse organization that empowers our community to improve people's lives with the world's best transportation. As a Staff Infrastructure Engineer on our Observability team, you'll play a pivotal role in building software to solve problems at...


  • Toronto, Ontario, Canada Lyft Full time

    At Lyft, our mission is to improve people's lives with the world's best transportation. To achieve this, we focus on creating an open, inclusive, and diverse organization. Our Infrastructure team is passionate about building software to solve complex problems at massive scale. We share our solutions with the community when we believe they can benefit...


  • Old Toronto, Ontario, Canada Lyft Full time

    About the RoleWe are seeking an experienced Infrastructure Engineer to join our Observability team at Lyft. As a key member of our team, you will be responsible for the operation and maintenance of our logging and metrics infrastructure.Key ResponsibilitiesTechnical Leadership: Provide technical mentorship within the team and lead by example in developing...


  • Old Toronto, Ontario, Canada Lyft Full time

    About the RoleWe are seeking an experienced Infrastructure Engineer to join our Observability team at Lyft. As a key member of our team, you will be responsible for the operation and maintenance of our logging and metrics infrastructure.Key ResponsibilitiesTechnical Leadership: Provide technical mentorship within the team and lead by example in developing...


  • Toronto, Ontario, Canada https:www.energyjobline.comsitemap Full time

    Job SummaryAt Lyft, we're passionate about building software to solve problems at massive scale. As an Observability team member, you'll be responsible for the operation and maintenance of our logging and metrics infrastructure. You'll ensure all teams at Lyft are aware of the operational health of their products by monitoring system availability and take a...


  • Toronto, Ontario, Canada Lyft Full time

    Job Title: Senior Infrastructure Engineer, ObservabilityAt Lyft, we're committed to building a transportation platform that's reliable, efficient, and accessible to everyone. As a Senior Infrastructure Engineer, Observability, you'll play a critical role in ensuring the smooth operation of our infrastructure and services.Responsibilities:Design, develop, and...


  • Toronto, Ontario, Canada Lyft Full time

    Job Title: Senior Infrastructure Engineer, ObservabilityAt Lyft, we're committed to building a transportation platform that's reliable, efficient, and accessible to everyone. As a Senior Infrastructure Engineer, Observability, you'll play a critical role in ensuring the smooth operation of our infrastructure and services.Responsibilities:Design, develop, and...


  • Toronto, Ontario, Canada TSX Inc. Full time

    TSX Inc. Observability Engineer Role SummaryThe TSX Inc. Observability Engineer will play a crucial role in maintaining and improving the operational health of our applications and infrastructure. This position is responsible for setting up, configuring, and maintaining our monitoring and observability stack to ensure optimal system performance and...


  • Toronto, Ontario, Canada Lyft Full time

    Job Title: Infrastructure Engineer, ObservabilityAt Lyft, we're passionate about building software to solve problems at massive scale. Our Infrastructure team is responsible for the operation and maintenance of our logging and metrics infrastructure. We're seeking an experienced Infrastructure Engineer to join our Observability team and help us improve our...


  • Toronto, Ontario, Canada Lyft Full time

    Job Title: Infrastructure Engineer, ObservabilityAt Lyft, we're passionate about building software to solve problems at massive scale. Our Infrastructure team is responsible for the operation and maintenance of our logging and metrics infrastructure. We're seeking an experienced Infrastructure Engineer to join our Observability team and help us improve our...


  • Toronto, Ontario, Canada Lyft Full time

    At Lyft, our mission is to provide the world's best transportation experience. To achieve this, we focus on building a diverse and inclusive organization. Our Infrastructure team is passionate about creating software solutions that solve complex problems at massive scale. We share our knowledge with the community by open-sourcing our ideas.As an...


  • Toronto, Ontario, Canada Lyft Full time

    About the RoleWe are seeking an experienced Infrastructure Engineer to join our Observability team at Lyft. As an Observability team member, you will be responsible for the operation and maintenance of our logging and metrics infrastructure.Key ResponsibilitiesMaintain and analyze metrics from operating systems, control planes, and applications to assist in...


  • Toronto, Ontario, Canada Lyft Full time

    About the RoleWe are seeking an experienced Infrastructure Engineer to join our Observability team at Lyft. As an Observability team member, you will be responsible for the operation and maintenance of our logging and metrics infrastructure.Key ResponsibilitiesMaintain and analyze metrics from operating systems, control planes, and applications to assist in...

  • Observability Engineer

    2 months ago


    Toronto, Ontario, Canada TSX Inc. Full time

    Job Title: Observability EngineerAt TSX Inc., we're seeking an experienced Observability Engineer to join our team. As an Observability Engineer, you will play a crucial role in maintaining and improving the operational health of our applications and infrastructure.Key Responsibilities:Develop and maintain robust monitoring solutions using Splunk, Splunk...

  • Observability Engineer

    2 months ago


    Toronto, Ontario, Canada TSX Inc. Full time

    Job Title: Observability EngineerAt TSX Inc., we're seeking an experienced Observability Engineer to join our team. As an Observability Engineer, you will play a crucial role in maintaining and improving the operational health of our applications and infrastructure.Key Responsibilities:Develop and maintain robust monitoring solutions using Splunk, Splunk...

  • Observability Engineer

    2 months ago


    Toronto, Ontario, Canada TSX Inc. Full time

    Job Title: Observability EngineerAt TSX Inc., we're seeking a skilled Observability Engineer to join our team. As an Observability Engineer, you will play a crucial role in maintaining and improving the operational health of our applications and infrastructure.Key Responsibilities:Develop and maintain robust monitoring solutions using Splunk, Splunk...

  • Observability Engineer

    2 months ago


    Toronto, Ontario, Canada TSX Inc. Full time

    Job Title: Observability EngineerAt TSX Inc., we're seeking a skilled Observability Engineer to join our team. As an Observability Engineer, you will play a crucial role in maintaining and improving the operational health of our applications and infrastructure.Key Responsibilities:Develop and maintain robust monitoring solutions using Splunk, Splunk...


  • Toronto, Ontario, Canada Lyft Full time

    About the RoleWe are seeking an experienced Infrastructure Engineer to join our Observability team at Lyft. As a key member of our team, you will be responsible for the operation and maintenance of our logging and metrics infrastructure. Your expertise will ensure that all teams at Lyft are aware of the operational health of their products by monitoring...


  • Toronto, Ontario, Canada Lyft Full time

    About the RoleWe are seeking an experienced Infrastructure Engineer to join our Observability team at Lyft. As a key member of our team, you will be responsible for the operation and maintenance of our logging and metrics infrastructure. Your expertise will ensure that all teams at Lyft are aware of the operational health of their products by monitoring...