Observability Engineer

3 days ago


Toronto, Ontario, Canada TSX Inc. Full time
Job Title: Observability Engineer

At TSX Inc., we're seeking a skilled Observability Engineer to join our team. As an Observability Engineer, you will play a crucial role in maintaining and improving the operational health of our applications and infrastructure.

Key Responsibilities:
  • Develop and maintain robust monitoring solutions using Splunk, Splunk Observability, Grafana, AWS CloudWatch, and Prometheus.
  • Implement, maintain, and consult on the observability and monitoring framework that supports the needs of multiple internal stakeholders.
  • Create and manage dashboards and visualizations to provide actionable insights into system health, performance, and operational efficiency.
  • Help manage the Event, Incident, and Operations Escalation Management Policies.
  • Grow and evangelize the capabilities of our observability tools and platforms.
  • Collaborate with development and operations teams to integrate observability tools into the development lifecycle for continuous improvement.
  • Translate business requirements into technical solutions applying best practices and standards that meet the strategic business goal.
  • Conduct performance analysis, diagnose issues, and provide solutions to enhance system reliability and scalability.
  • Document observability best practices and maintain configuration documentation.
  • Provide 2nd and 3rd level systems support.
  • Liaise with vendors and other IT personnel for problem resolution.
Requirements:
  • Proven experience with key observability and monitoring tools such as Splunk, Splunk Observability, Otel, Grafana, AWS CloudWatch, and Prometheus.
  • Strong understanding of cloud environments, preferably AWS, including deployment, management, and operations.
  • Proficient in creating and managing monitoring dashboards and setting up alerts to monitor all phases of the environment.
  • Solid background in scripting and automation using languages such as Python, Bash, or similar.
  • Excellent problem-solving skills, with the ability to handle complex troubleshooting and make critical system-related decisions.
  • Familiarity with configuration languages such as Ansible, and Terraform.
  • Linux Operating System knowledge (RedHat Linux preferred).
  • Experience with Source Control Systems and familiar with basic branching and merging strategies. (Git, GitLab, Github, Bitbucket)
  • Strong communication skills, capable of effectively articulating technical challenges and solutions to stakeholders.
Preferred Qualifications:
  • Bachelor's degree in Computer Science, Engineering, or a related technical field.
  • 5+ years of experience in systems engineering/administration, platform/cloud/devops engineering, or a related field.
  • Relevant certifications in Splunk, AWS, or similar technologies.
  • Experience with additional observability and monitoring tools is a plus.
About TSX Inc.

TSX Inc. is a leading global exchange company that operates a diverse portfolio of exchanges, clearinghouses, and other financial market infrastructure businesses. We are committed to creating and sustaining a collegial work environment in which all individuals are treated with dignity and respect and one which reflects the diversity of the community in which we operate.

We provide accommodations for applicants and employees who require it.



  • Toronto, Ontario, Canada TSX Inc. Full time

    About the RoleThe TSX Inc. Observability Engineer will play a crucial role in maintaining and improving the operational health of our applications and infrastructure. This position is responsible for setting up, configuring, and maintaining our monitoring and observability stack to ensure optimal system performance and reliability.Key ResponsibilitiesDevelop...


  • Toronto, Ontario, Canada TSX Inc. Full time

    About the RoleThe TSX Inc. Observability Engineer will play a crucial role in maintaining and improving the operational health of our applications and infrastructure. This position is responsible for setting up, configuring, and maintaining our monitoring and observability stack to ensure optimal system performance and reliability.Key ResponsibilitiesDevelop...


  • Toronto, Ontario, Canada Data Theorem Full time

    iOS Engineer Data Theorem is a pioneering company dedicated to safeguarding the world's data. Our engineer-first culture empowers every employee to drive product innovation and direction. We're seeking exceptional talent to join our team and take ownership of projects that resonate with them. As an iOS engineer, you will be responsible for enhancing Data...


  • Toronto, Ontario, Canada Data Theorem Full time

    Data Theorem is an exciting company focused on creating a more secure world for data. Rooted in a strong engineer first culture, every employee has an impact on product and direction. We are searching for exceptional talent pursuing an opportunity to grow and take ownership of the projects that resonate most with them.As an iOS engineer, you will be...


  • Toronto, Ontario, Canada Rackspace Technology Full time

    Job Summary:Rackspace Technology is seeking a highly skilled Observability Engineer with expertise in OpenTelemetry to join our team. As an Observability Engineer, you will be responsible for designing and implementing observability and monitoring solutions on the Google Cloud Platform (GCP), leveraging OpenTelemetry frameworks.Key Responsibilities:Design...


  • Toronto, Ontario, Canada Rackspace Technology Full time

    Job Summary:Rackspace Technology is seeking a highly skilled Observability Engineer with expertise in OpenTelemetry to join our team. As an Observability Engineer, you will be responsible for designing and implementing observability and monitoring solutions on the Google Cloud Platform (GCP), leveraging OpenTelemetry frameworks.Key Responsibilities:Design...


  • Toronto, Ontario, Canada mccainfood Full time

    Job Title: Senior Engineering Manager, SRE & ObservabilityWe are seeking a highly skilled Senior Engineering Manager to lead our SRE & Observability team. As a key member of our Global Technology department, you will be responsible for designing, implementing, and monitoring enterprise-grade secure fault-tolerant SRE and Observability infrastructure.Key...


  • Toronto, Ontario, Canada mccainfood Full time

    Job Title: Senior Engineering Manager, SRE & ObservabilityWe are seeking a highly skilled Senior Engineering Manager to lead our SRE & Observability team. As a key member of our Global Technology department, you will be responsible for designing, implementing, and monitoring enterprise-grade secure fault-tolerant SRE and Observability infrastructure.Key...


  • Toronto, Ontario, Canada mccainfood Full time

    Job SummaryWe are seeking a highly skilled Senior Engineering Manager to lead our Site Reliability Engineering (SRE) and Observability team at McCain Foods. As a key member of our Global Technology department, you will be responsible for designing, implementing, and monitoring enterprise-grade secure fault-tolerant SRE and Observability infrastructure.Key...


  • Toronto, Ontario, Canada mccainfood Full time

    Job SummaryWe are seeking a highly skilled Senior Engineering Manager to lead our Site Reliability Engineering (SRE) and Observability team at McCain Foods. As a key member of our Global Technology department, you will be responsible for designing, implementing, and monitoring enterprise-grade secure fault-tolerant SRE and Observability infrastructure.Key...


  • Toronto, Ontario, Canada Lyft Full time

    About the RoleWe are seeking an experienced Infrastructure Engineer to join our Observability team at Lyft. As an Observability team member, you will be responsible for the operation and maintenance of our logging and metrics infrastructure.Key ResponsibilitiesMaintain and analyze metrics from operating systems, control planes, and applications to assist in...


  • Toronto, Ontario, Canada Lyft Full time

    About the RoleWe are seeking an experienced Infrastructure Engineer to join our Observability team at Lyft. As an Observability team member, you will be responsible for the operation and maintenance of our logging and metrics infrastructure.Key ResponsibilitiesMaintain and analyze metrics from operating systems, control planes, and applications to assist in...


  • Toronto, Ontario, Canada mccainfood Full time

    Job SummaryWe are seeking a highly skilled Senior Engineering Manager to lead our Site Reliability Engineering (SRE) and Observability team at McCain Foods. As a key member of our Global Technology department, you will be responsible for designing, implementing, and monitoring enterprise-grade secure fault-tolerant SRE and Observability infrastructure.Key...


  • Toronto, Ontario, Canada mccainfood Full time

    Job SummaryWe are seeking a highly skilled Senior Engineering Manager to lead our Site Reliability Engineering (SRE) and Observability team at McCain Foods. As a key member of our Global Technology department, you will be responsible for designing, implementing, and monitoring enterprise-grade secure fault-tolerant SRE and Observability infrastructure.Key...


  • Toronto, Ontario, Canada Theorem, LLC Full time

    **About Theorem, LLC**Theorem, LLC is a pioneering company dedicated to creating a more secure digital landscape. Our engineer-driven culture empowers every team member to make a meaningful impact on our products and direction.We're seeking exceptional talent to join our team and take ownership of projects that align with their passions.**Job Summary**As an...


  • Toronto, Ontario, Canada Theorem, LLC Full time

    **About Theorem, LLC**Theorem, LLC is a pioneering company dedicated to creating a more secure digital landscape. Our engineer-driven culture empowers every team member to make a meaningful impact on our products and direction.We're seeking exceptional talent to join our team and take ownership of projects that align with their passions.**Job Summary**As an...


  • Toronto, Ontario, Canada Lyft Full time

    Infrastructure Engineer at LyftLyft is on a mission to enhance transportation for people around the world. Our Infrastructure team is dedicated to building software that can tackle problems on a large scale. We believe in sharing our solutions with the community for the benefit of all.As a member of the Observability team at Lyft, you will be responsible for...


  • Toronto, Ontario, Canada Lyft Full time

    About the RoleWe are seeking an experienced Infrastructure Engineer to join our Observability team at Lyft. As a key member of our team, you will be responsible for the operation and maintenance of our logging and metrics infrastructure. Your expertise will ensure that all teams at Lyft are aware of the operational health of their products by monitoring...


  • Toronto, Ontario, Canada Lyft Full time

    About the RoleWe are seeking an experienced Infrastructure Engineer to join our Observability team at Lyft. As a key member of our team, you will be responsible for the operation and maintenance of our logging and metrics infrastructure. Your expertise will ensure that all teams at Lyft are aware of the operational health of their products by monitoring...


  • Toronto, Ontario, Canada Lyft Full time

    About the RoleWe are seeking an experienced Infrastructure Engineer to join our Observability team at Lyft. As a key member of our team, you will be responsible for the operation and maintenance of our logging and metrics infrastructure. Your expertise will ensure that all teams at Lyft are aware of the operational health of their products, and you will take...