Site Reliability Engineer

3 days ago


Toronto, Ontario, Canada Resonaite Full time $120,000 - $180,000 per year

Our client in the professional services sector is seeking an SRE to enhance service resiliency, automation, and operational excellence across critical cloud and on-prem workloads.

This role focuses on stability engineering, cloud migration readiness, observability, continuous improvement, and L2 production support.

Location: Hybrid 3d Toronto

Duration: 6 months + possible extension

Responsibilities

  • Drive service stability, automation, and optimization initiatives, ensuring compliance with SLAs and operational best practices.
  • Support AWS cloud workload migrations by validating readiness, ensuring observability coverage, and developing Day-2 runbook automation.
  • Collaborate with DevSecOps and Architecture teams to operationalize cloud-native and migrated applications using automated deployment, monitoring, and recovery pipelines.
  • Implement scalable solutions to reduce manual intervention, improve deployment efficiency, and enable self-healing and auto-scaling.
  • Use tools such as Splunk, Dynatrace, and Grafana to optimize performance, implement anomaly detection, and proactively address production issues.
  • Analyze trends from testing and production environments, conduct root-cause investigations, and recommend corrective actions to Agile squads.
  • Maintain clear technical and operational documentation including runbooks, SOPs, post-mortems, and architecture overviews.
  • Provide leadership in vulnerability remediation, security alignment, and technology lifecycle management.
  • Participate in a 24/7 rotating on-call schedule, providing L2 support and rapid response to production incidents.

Required Skills & Qualifications

  • Strong AppOps experience supporting highly resilient and high-performance workloads on
    AWS
    .
  • Proficiency with Git, PowerShell, Python, Ansible, Terraform, Docker, and microservices patterns.
  • Hands-on experience with Splunk, Dynatrace, Grafana, and ServiceNow.
  • Solid understanding of
    AWS ECS architecture
    , autoscaling, load balancing, and VPC integration.
  • Strong knowledge of Agile methodologies, SDLC processes, release management, and incident/problem/change management.
  • Ability to analyze data, identify issues early, and mitigate risks in production environments.
  • Excellent communication skills for working across technical and business stakeholders.

Nice to Have

  • SRE certification
  • Experience with OpenShift Kubernetes
  • Familiarity with DevOps concepts and CI/CD
  • Knowledge of Oracle or PostgreSQL
  • AWS certifications
  • Financial Services or Payments experience
  • ITIL certification


  • Toronto, Ontario, Canada Procom Full time $80,000 - $120,000 per year

    Site Reliability Engineer (SRE)/ Ingénieur Fiabilité des SitesOn behalf of our banking client, Procom is seeking a Site Reliability Engineer (SRE) for a 12-month contract role. This position is a hybrid role, 3 days a week onsite at our client's Montréal, Quebec office.Site Reliability Engineer - Job Description:The Site Reliability Engineer is...


  • Toronto, Ontario, Canada Maneva Full time US$80,000 - US$120,000 per year

    About ManevaManeva builds and deploys edge AI solutions powering real-time intelligence for industrial environments. Our systems run on distributed edge compute devices (NVIDIA Jetson platforms), integrate with local network cameras, PLCs, sensors, and other on-premise equipment, and securely communicate with cloud services via client- or site-based VPNs....


  • Toronto, Ontario, Canada Tecsys Inc. Full time $85,000 - $130,000 per year

    Having recognized the advantages of remote work, including employee morale, productivity, reduced commuting on employee wellbeing and the environment, we are proud to be a digital-first company. The technologies and programs in which we invested have provided a fantastic foundation to this end. Our digital-first work environment, together with our...


  • Toronto, Ontario, Canada Apptoza Inc. Full time $30,000 - $120,000 per year

    HI,Hope you are doing Great,If you are fine with below JD please share me your Updated resume ASAP.Site Reliability EngineerLocation: TORONTO (ONSITE)Duration: 6 monthsExp Required: 10 YearsJob Description: Job Title : SRETechnical/Functional Skills• 8+ years of overall IT experience.• Advanced Linux / Unix support experience required.• Strong shell...


  • Toronto, Ontario, Canada Moneris Full time $80,000 - $120,000 per year

    Your Moneris Career - The OpportunityWe are looking for a Site Reliability Engineer (SRE) to join our dynamic team. As an SRE, you will help ensure the reliability, performance, and scalability of our systems. You will work with development and operations teams to build and maintain robust infrastructure, automate processes, and improve overall system...


  • Toronto, Ontario, Canada Xplor Full time $125,000 - $150,000

    Company Description Take a seat on the Xplor rocketship and join us as Site Reliability Engineer to help people succeed across the world.From dropping your kids off at childcare, getting something at home repaired, going to the gym or a fitness studio, to picking up your dry cleaning — our software, payments, and commerce-enabling solutions help everyday...


  • Toronto, Ontario, Canada Pixomondo Full time $120,000 - $180,000 per year

    We're seeking an experienced Site Reliability Engineer to join our team and lead infrastructure automation, CI/CD workflows, and deployment operations for a custom web platform. You'll be working with a modern DevOps stack including GitHub Actions, GCP, Kubernetes, Terraform, PostgreSQL, CodeDeploy, and Cloudflare to ensure our platform is robust, scalable,...


  • Toronto, Ontario, Canada Kablamo Full time $90,000 - $120,000 per year

    Reports to: Technical Support ManagerLocation: Toronto (Hybrid)Role Type: Full timeLevel: Intermediate/MidIntroductionKablamo is a fast-growing cloud digital product development company. Founded in 2017 in Australia, the business has grown quickly over the last several years, including the expansion of the team to Canada in 2021. We are proud to have...


  • Toronto, Ontario, Canada McCain Foods Full time $102,700 - $137,000 per year

    Position Title:Site Reliability EngineerPosition Type:Regular - Full-TimePosition Location:Toronto HQRequisition ID:36904Our Global Technology team's goal is to leverage technology and data to drive profitable growth, focus on enhancing customer experience and to further our purpose of 'Celebrating real connections through delicious, planet-friendly food'....


  • Toronto, Ontario, Canada AceStack Full time $120,000 - $200,000 per year

    Job Title: Lead Site Reliability Engineer – Banking Domain (Wealth Management Preferred)Location: Toronto Downtown, ON (Onsite – 5 Days/Week)Duration: ContractExperience: 14+ YearsAbout the Role:We are looking for a highly skilled Site Reliability Engineering (SRE) Lead with a strong background in the Banking domain, ideally within Wealth Management. The...