Site Reliability Engineer

2 days ago


Toronto, Canada Kyndryl Full time

Join to apply for the Site Reliability Engineer role at Kyndryl. Direct message the job poster from Kyndryl. Recruitment & Strategic Staffing @Kyndryl | Partnering with IT Consultants in Financial Services & Technology Position: Site Reliability Engineer Client: Financial Services - Capital Markets Technology Duration: 12-month contract with potential extensions Location: Toronto, Canada - 2 to 3 days onsite per week Language: English Hours: 37.5 hours/week Our client is looking for a Site Reliability Engineer (SRE) to enhance the reliability, performance, and efficiency of mission‑critical batch workloads across Capital Markets Technology. The SRE will serve as a technical lead focused on automation, application development, systems performance engineering, and observability using Dynatrace.

This position is pivotal in driving operational excellence and maturing reliability practices across the organization.

Qualifications

Expert‑level Python skills, including performance tuning, concurrency (async/multiprocessing), testing, and packaging. Strong Linux systems engineering expertise (kernel tuning, networking, process management, filesystem optimization). Proven experience optimizing batch workloads for performance, reliability, and cost efficiency. Deep knowledge of Dynatrace for observability (dashboards, KPIs, tagging, alerts, anomaly detection).

Hands‑on experience with Apache Airflow (DAG design, scheduler tuning, SLA management). Strong understanding of distributed systems concepts — retries, idempotency, backpressure, data integrity.

Experience

with CI/CD pipelines (GitHub Actions, Azure DevOps, Jenkins) and Infrastructure as Code (Terraform, Ansible). Familiarity with containers and orchestration tools (Docker, Kubernetes). Excellent incident management, troubleshooting, and communication skills.

Responsibilities

Reliability & Performance: Engineer resilient and performant batch processing pipelines by reducing runtime and minimizing failures. Observability: Implement and maintain Dynatrace dashboards, alerts, and runbooks to ensure deep visibility into system health. Systems Engineering: Configure and tune Linux and Windows environments for optimal reliability and speed. Automation & Orchestration: Design and refine Airflow DAGs, automate deployments with CI/CD pipelines, and reduce operational toil through code.

Incident Management: Lead incident response, conduct root‑cause analysis, and implement improvements based on post‑mortems and SLOs. Security & Compliance: Ensure all reliability and automation processes adhere to security best practices and regulatory compliance standards. Please note this is for a contract position with one of our clients and not a full-time employment role with Kyndryl Canada. Seniority level Mid‑Senior level Employment type Contract Job function Information Technology Industries IT Services and IT Consulting Referrals increase your chances of interviewing at Kyndryl by 2x.

Sign in to set job alerts for “Site Reliability Engineer” roles. #J-18808-Ljbffr



  • (s): Canada : Ontario : Toronto Scotiabank Global Site Full time

    Requisition ID: 245210Join a purpose driven winning team, committed to results, in an inclusive and high-performing culture.The TeamGlobal Banking and Markets Engineering (GBME) is the fast-moving, award-winning technology engine that powers Scotiabank's Corporate, Investment Banking and Capital Markets businesses.The RoleGBME is searching for a Site...


  • (s): Canada : Ontario : Toronto Scotiabank Global Site Full time

    Requisition ID: 244026Join a purpose driven winning team, committed to results, in an inclusive and high-performing culture.Overview: As a Site Reliability Engineer (SRE), you will join the Digital Engineering Operations team, responsible for ensuring the operations and reliability of Scotiabank digital applications. You will have the opportunity to drive...


  • (s): Canada : Ontario : Toronto Scotiabank Global Site Full time

    Requisition ID: 244027Join a purpose driven winning team, committed to results, in an inclusive and high-performing culture.Overview: As a Site Reliability Engineer (SRE), you will join the Digital Engineering Operations team, responsible for ensuring the operations and reliability of Scotiabank digital applications. You will have the opportunity to drive...


  • Toronto, Canada Kyndryl Full time

    Join to apply for the Site Reliability Engineer role at Kyndryl. Direct message the job poster from Kyndryl. Recruitment & Strategic Staffing @Kyndryl | Partnering with IT Consultants in Financial Services & Technology Position: Site Reliability Engineer Client: Financial Services - Capital Markets Technology Duration: 12-month contract with potential...


  • (s): Canada : Ontario : Toronto Scotiabank Global Site Full time

    Requisition ID: 247129Join a purpose driven winning team, committed to results, in an inclusive and high-performing culture.As a SRE, you will implement, measure and gather insights from Operational Level Indicators identifying areas for service improvements covering availability, performance, resilience, incidents and chronic problems. You will implement...


  • Toronto, Ontario, Canada Aarorn Technologies Inc Full time

    Job Title: Site-Reliability Engineer (SRE)Location: Toronto, ON (3x onsite a week)Employment Type: ContractJob DescriptionWe are seeking a highly skilled Site Reliability Engineer (SRE) to enhance the reliability, performance, and efficiency of mission-critical batch workloads within Capital Markets Technology. In this role, you will serve as the technical...


  • Toronto, Canada Global Technical Talent Full time

    Primary Job Title Site Reliability Engineer IV Alternate / Related Job Titles Site Reliability Engineer Senior SRE IT Reliability Engineer Systems Integration Engineer Location & Onsite Flexibility Toronto, ON — Hybrid (4 days onsite) Office Address: 66 Wellington Street West, 19th Floor, Toronto, ON Contract Details Position Type: Contract Contract...


  • Toronto, Canada Moneris Full time

    Your Moneris Career - The OpportunityAs the Site Reliability Engineer (SRE), you will play a crucial role in ensuring the reliability, performance, and scalability of our systems. You will work closely with development and operations teams to build and maintain robust infrastructure, automate processes, and improve overall system healthLocation: You will be...


  • Toronto, Canada Aarorn Technologies Inc Full time

    Job Title: Site Reliability Engineer Location: Toronto, ON (3x onsite a week) Employment Type: Contract Pay Rate: CAD$40/HR INC Job Description We are seeking a skilled Site Reliability Engineer (SRE) to enhance the reliability, scalability, and performance of our systems and applications. The ideal candidate will have strong experience in automation, cloud...


  • Toronto, Ontario, Canada Global Technical Talent, an Inc. 5000 Company Full time

    Primary Job Title:Site Reliability Engineer IVAlternate / Related Job Titles:Site Reliability EngineerSenior SREIT Reliability EngineerSystems Integration EngineerLocation & Onsite Flexibility:Toronto, ON —Hybrid (4 days onsite)Office Address:66 Wellington Street West, 19th Floor, Toronto, ONContract DetailsPosition Type:ContractContract...