Current jobs related to Site Reliability Engineer - Toronto - Resonaite


  • (s): Canada : Ontario : Toronto Scotiabank Global Site Full time $105,000 - $170,000 per year

    Requisition ID: 244026Join a purpose driven winning team, committed to results, in an inclusive and high-performing culture.Overview: As a Site Reliability Engineer (SRE), you will join the Digital Engineering Operations team, responsible for ensuring the operations and reliability of Scotiabank digital applications. You will have the opportunity to drive...


  • (s): Canada : Ontario : Toronto Scotiabank Global Site Full time US$80,000 - US$140,000 per year

    Requisition ID: 244027Join a purpose driven winning team, committed to results, in an inclusive and high-performing culture.Overview: As a Site Reliability Engineer (SRE), you will join the Digital Engineering Operations team, responsible for ensuring the operations and reliability of Scotiabank digital applications. You will have the opportunity to drive...


  • Toronto, Ontario, Canada Procom Full time $80,000 - $120,000 per year

    Site Reliability Engineer (SRE)/ Ingénieur Fiabilité des SitesOn behalf of our banking client, Procom is seeking a Site Reliability Engineer (SRE) for a 12-month contract role. This position is a hybrid role, 3 days a week onsite at our client's Montréal, Quebec office.Site Reliability Engineer - Job Description:The Site Reliability Engineer is...


  • Toronto, Canada Maneva Full time

    About Maneva Maneva builds and deploys edge AI solutions powering real-time intelligence for industrial environments. Our systems run on distributed edge compute devices (NVIDIA Jetson platforms), integrate with local network cameras, PLCs, sensors, and other on-premise equipment, and securely communicate with cloud services via client- or site-based VPNs....


  • Toronto, Canada Maneva Full time

    About ManevaManeva builds and deploys edge AI solutions powering real-time intelligence for industrial environments. Our systems run on distributed edge compute devices (NVIDIA Jetson platforms), integrate with local network cameras, PLCs, sensors, and other on-premise equipment, and securely communicate with cloud services via client- or site-based VPNs....


  • Toronto, Canada Maneva Full time

    About Maneva Maneva builds and deploys edge AI solutions powering real-time intelligence for industrial environments. Our systems run on distributed edge compute devices (NVIDIA Jetson platforms), integrate with local network cameras, PLCs, sensors, and other on-premise equipment, and securely communicate with cloud services via client- or site-based VPNs....


  • Toronto, Canada Maneva Full time

    About Maneva Maneva builds and deploys edge AI solutions powering real-time intelligence for industrial environments. Our systems run on distributed edge compute devices (NVIDIA Jetson platforms), integrate with local network cameras, PLCs, sensors, and other on-premise equipment, and securely communicate with cloud services via client- or site-based VPNs....


  • Toronto, Canada Maneva Full time

    About Maneva Maneva builds and deploys edge AI solutions powering real-time intelligence for industrial environments. Our systems run on distributed edge compute devices (NVIDIA Jetson platforms), integrate with local network cameras, PLCs, sensors, and other on-premise equipment, and securely communicate with cloud services via client- or site-based VPNs....


  • Toronto, Canada Maneva Full time

    About Maneva Maneva builds and deploys edge AI solutions powering real‑time intelligence for industrial environments. Our systems run on distributed edge compute devices (NVIDIA Jetson platforms), integrate with local network cameras, PLCs, sensors, and other on‑premise equipment, and securely communicate with cloud services via client‑ or site‑based...


  • Toronto, Canada Maneva Full time

    About ManevaManeva builds and deploys edge AI solutions powering real‑time intelligence for industrial environments. Our systems run on distributed edge compute devices (NVIDIA Jetson platforms), integrate with local network cameras, PLCs, sensors, and other on‑premise equipment, and securely communicate with cloud services via client‑ or site‑based...

Site Reliability Engineer

4 weeks ago


Toronto, Canada Resonaite Full time

Site Reliability Engineer (DevOps/Release) Our client in the professional services sector is seeking an SRE to enhance service resiliency, automation, and operational excellence across critical cloud and on‑prem workloads. This role focuses on stability engineering, cloud migration readiness, observability, continuous improvement, and L2 production support. Location: Hybrid 3d Toronto Responsibilities Drive service stability, automation, and optimization initiatives, ensuring compliance with SLAs and operational best practices. Support AWS cloud workload migrations by validating readiness, ensuring observability coverage, and developing Day-2 runbook automation. Collaborate with DevSecOps and Architecture teams to operationalize cloud‑native and migrated applications using automated deployment, monitoring, and recovery pipelines. Implement scalable solutions to reduce manual intervention, improve deployment efficiency, and enable self‑healing and auto‑scaling. Use tools such as Splunk, Dynatrace, and Grafana to optimize performance, implement anomaly detection, and proactively address production issues. Analyze trends from testing and production environments, conduct root‑cause investigations, and recommend corrective actions to Agile squads. Maintain clear technical and operational documentation including runbooks, SOPs, post‑mortems, and architecture overviews. Provide leadership in vulnerability remediation, security alignment, and technology lifecycle management. Participate in a 24/7 rotating on‑call schedule, providing L2 support and rapid response to production incidents. Required Skills & Qualifications Strong AppOps experience supporting highly resilient and high‑performance workloads on AWS. Proficiency with Git, PowerShell, Python, Ansible, Terraform, Docker, and microservices patterns. Hands‑on experience with Splunk, Dynatrace, Grafana, and ServiceNow. Solid understanding of AWS ECS architecture, autoscaling, load balancing, and VPC integration. Strong knowledge of Agile methodologies, SDLC processes, release management, and incident/problem/change management. Ability to analyze data, identify issues early, and mitigate risks in production environments. Excellent communication skills for working across technical and business stakeholders. Nice to Have SRE certification Experience with OpenShift Kubernetes Familiarity with DevOps concepts and CI/CD Knowledge of Oracle or PostgreSQL AWS certifications Financial Services or Payments experience ITIL certification Seniority level Mid‑Senior level Employment type Contract Job function Information Technology Industries Professional Services #J-18808-Ljbffr