SRE Lead

2 days ago


Halifax, Canada Haleon Full time

Welcome to Haleon. We’re a purpose-driven, world-class consumer company putting everyday health in the hands of millions. In just three years since our launch, we’ve grown, evolved and are now entering an exciting new chapter – one filled with bold ambitions and enormous opportunity.Our trusted portfolio of brands – including Sensodyne®, Panadol®, Advil®, Voltaren®, Theraflu®, Otrivin®, and Centrum® – lead in resilient and growing categories. What sets us apart is our unique blend of deep human understanding and trusted science.Now it’s time to fully realise the full potential of our business and our people. We do this through our Win as One strategy. It puts our purpose – to deliver better everyday health with humanity – at the heart of everything we do. It unites us, inspires us, and challenges us to be better every day, driven by our agile, performance-focused culture.Purpose of the Role:As an SRE Lead, in this newly created role, you will shape the future of Site Reliability Engineering( SRE) within our Commercial Tech organization.You will provide technical leadership and strategic direction in all aspects of site reliability engineering — from designing and implementing observability frameworks, automation, and incident response processes, to ensuring seamless delivery and stability of large-scale systems. You will play a pivotal role in shaping best practices, guiding cross-functional teams, and embedding reliability into every stage of the engineering lifecycle.Role responsibilities:This role will provide YOU the opportunity to lead key activities to progress YOUR career. These responsibilities include some of the following:Drive reliability, scalability, and performance across critical technology platforms to ensure seamless digital experiences.Lead the design and implementation of modern observability practices, with a particular focus on Datadog.Act as a bridge between development and operations, championing automation, resilience engineering, and incident management.Align reliability goals with business objectives while proactively identifying, troubleshooting, and resolving complex system issues.Build customized dashboards and configure advanced alerts (multi‑condition, anomaly detection, composite monitors).Use Application Performance Monitoring (APM) to trace distributed systems and implement log pipelines for troubleshooting.Leverage Datadog APIs for automation and CI/CD integration; connect with cloud providers (AWS, Azure, GCP), containers (Kubernetes, Docker), and serverless functions.Apply Datadog analytics for capacity planning, performance tuning, and cost optimization.Integrate Datadog with security monitoring, compliance dashboards, and business KPIs.Lead incident management using real‑time data to reduce MTTR.Coach teams on effective Datadog usage, establish observability standards and act as the go‑to expert for monitoring strategy.Define Datadog tagging standards to ensure consistent metadata, traceability, and cost allocation.Establish a framework for Datadog cost attribution, enabling transparency and accountability for monitoring expenses.Develop a Target Operating Model for observability, including ownership guidelines and a “who to contact” matrix.Create a structured logging strategy that identifies valuable logs, reduces noise, and ensures compliance with data privacy.Design a proactive alerting strategy to minimize end‑user incidents, reduce false positives, and prioritize actionable alerts.Set appropriate service and error thresholds for SLOs/SLAs and monitors, clearly defining failure criteria to align with business expectations.Basic Qualifications:We are looking for professionals with these required skills to achieve our goals:Min. Bachelor’s degree in computer science, Engineering, or related field8+ years in Site Reliability Engineering, DevOps, or Infrastructure roles, with at least 3 years in a leadership capacityDeep hands-on experience with Datadog for observability, monitoring, alerting, and performance optimizationExtensive knowledge of cloud platforms (AWS, Azure, or GCP) and container orchestration (Kubernetes, Docker)Proficiency in automation and configuration management tools (Terraform, Ansible, Chef, or equivalent)Solid understanding of CI/CD pipelines, distributed systems, and microservices architectureFamiliarity with scripting/programming languages (Python, Go, Shell, etc.) for automation and toolingExpertise in defining and managing SLIs, SLOs, and SLAsExperience in incident response, root cause analysis, and postmortem practicesSignificant background in performance tuning, capacity planning, and resilience engineeringDemonstrated ability to lead cross-functional teams and mentor engineersExcellent communication skills to collaborate across matrixed organizations with product, engineering, and business stakeholdersStrategic mindset with the ability to align reliability initiatives with organizational goalsEffective time management skillsExcellent written and verbal communications skills in EnglishExperience with agile/scrum techniquesPreferred Qualifications:If you have the following characteristics, it would be a plus:Master’s degree in computer science, Engineering, or related fieldExperience with other observability tools (Prometheus, Grafana, Splunk, etc.)Knowledge of ITIL practices and modern service management frameworksExposure to regulated industries (healthcare, pharma, consumer health)Our Win as One Frameworkis a simple, stretching definition of our future direction. It includes our purpose, ambitions, strategic drivers, and behaviors that will enable us to Win as One. This framework guides our decision-making, strategy, and culture. The successful candidate will demonstrate the following capabilities:Focused on consumer/ shopperCollaborates for better impactUnlocks value at paceFocused on growing him/herself and othersOpportunities for growth None of us should ever feel like we are standing still. Instead, we want Haleon to be a place where we feel like we are always progressing.Improving everyday health takes dedication. Energy. Effort. So we look to reward your contribution with a benefits package that includes:Career at one of the leading global healthcare companies Contract of employmentReward package (annual bonus that reflects Haleon’s and individual’s performance & awards for outstanding performance, recognition awards for additional achievements and engagement) Life insurance and pension planPrivate medical package with additional preventive healthcare services for employees and eligible persons.Sports cards (Multisport)Family benefits (extra parental leave, caregiver’s policy) Health and wellbeing programmes that take care of you physically and mentally< our philosophy to hybrid work (flexible approach)Possibilities of development within the role and company’s structureExtensive support of work life balance (flexible working solutions including working from home possibilities, health & wellbeing activities) Supportive community and integration events Modern office with creative rooms Remuneration: 19 650- 27 050 PLN gross/month, depending on the level of experience and competenciesLocation – this role is based in: Poland, Poznań#Li-Hybrid Job Posting End Date2026-02-21Equal OpportunitiesHaleon are committed to mobilising our purpose in a way that represents the diverse consumers and communities who rely on our brands every day. It guides us in creating an inclusive culture, where different backgrounds and views are valued and respected – all in support of understanding and best serving the needs of our consumers and unleashing the full potential of our people. It’s important to us that Haleon is a place where all our employees feel they truly belong. During the application process, we may ask you to share some personal information, which is entirely voluntary. This information ensures we meet certain regulatory and reporting obligations and supports the development, refinement, and execution of our inclusion and belonging programmes that are open to all Haleon employees. The personal information you provide will be kept confidential, used only for legitimate business purposes, and will never be used in making any employment decisions, including hiring decisions.Adjustment or Accommodations RequestIf you require a reasonable adjustment or accommodation or other assistance to apply for a job at Haleon at any stage of the application process, please let your recruiter know by providing them with a description of specific adjustments you are requesting. We’ll provide all reasonable adjustments to support you throughout the recruitment process and treat all information you provide us in confidence. Note to candidatesThe Haleon recruitment team will contact you using a Haleon email account (@haleon.com). If you are not sure whether the email you received is from Haleon, please get in touch.



  • Halifax, Canada Affirm Full time

    OverviewAffirm is reinventing credit to make it more honest and friendly, giving consumers the flexibility to buy now and pay later without any hidden fees or compounding interest.ResponsibilitiesSite Reliability Engineering at Affirm is a small, yet crucial, team that helps our Engineering partners to “Operate What They Own” with excellence to protect...


  • Halifax, Canada Affirm Full time

    Overview Affirm is reinventing credit to make it more honest and friendly, giving consumers the flexibility to buy now and pay later without any hidden fees or compounding interest. Responsibilities Site Reliability Engineering at Affirm is a small, yet crucial, team that helps our Engineering partners to “Operate What They Own” with excellence to...


  • Halifax, Canada Humankind Global Recruitment Full time

    Join to apply for the Site Reliability Engineer role at Humankind Global Recruitment . Linking exceptional talent with leading companies. Our client is a dynamic Information Technology services company that partners with leading global organizations to deliver innovative, high-quality IT solutions. We are looking for a Site Reliability Engineer. As a Site...


  • Halifax, Canada mthree Recruiting Portal Full time

    Want to work in technology at an investment bank Graduate training ongoing support opportunities at leading global employers the Alumni graduate program gives you everything you need. (And dont worry theres no training bond. No exit fees no hidden catches). Here at mthree we pair great graduates with brilliant global businesses. Our clients include tier one...

  • Site Release Manager

    2 weeks ago


    Halifax, Canada Cognizant Full time

    **Site Release Manager**: **In this role, you will**: - Analyze, design, code, test, and deploy new user stories and product features with high quality (security, reliability, operations) to production. Understands the software development lifecycle and leverages critical thinking skills to properly evaluate features and functionality. - Guides early-career...


  • Halifax, Canada Instacart Full time

    We're transforming the grocery industry At Instacart, we invite the world to share love through food because we believe everyone should have access to the food they love and more time to enjoy it together. Where others see a simple need for grocery delivery, we see exciting complexity and endless opportunity to serve the varied needs of our community. We...


  • Halifax, Canada Instacart Full time

    We're transforming the grocery industry At Instacart, we invite the world to share love through food because we believe everyone should have access to the food they love and more time to enjoy it together. Where others see a simple need for grocery delivery, we see exciting complexity and endless opportunity to serve the varied needs of our community. We...


  • Halifax, Canada AXIS Insurance Full time

    This is your opportunity to join AXIS Capital - a trusted global provider of specialty lines insurance and reinsurance. We stand apart for our outstanding client service, intelligent risk taking and superior risk adjusted returns for our shareholders. We also proudly maintain an entrepreneurial, disciplined and ethical corporate culture. As a member of AXIS,...

  • Senior SRE

    4 weeks ago


    Halifax, Canada Event Temple Full time

    A technology company is seeking a Senior Site Reliability Engineer to lead their web platform reliability efforts and application scalability strategy. This remote role requires expertise in web application architecture, performance optimization, and CI/CD management. Candidates should have over 5 years of experience and be ready to tackle a fast-paced...


  • Halifax, Canada Royal Bank of Canada Full time

    **What is the Opportunity?** Global Functions Technology (GFT) is part of RBC’s Technology and Operations division. GFT’s impact is far-reaching as we collaborate with partners from across the company to deliver innovative and transformative IT solutions. Our clients represent Risk, Finance, HR, CAO, Audit, Legal, Compliance, Financial Crime, Capital...