SRE

3 weeks ago


Toronto, Canada HRC Global Services Full time

Overview Get AI-powered advice on this job and more exclusive features. Responsibilities Provide hands-on SRE with 24x7 SRE support, including incident management, problem management, root cause analysis, monitoring, alerting, and maintenance of infrastructure, compliance Track, audit, monitor and implement on technical work streams Act as portfolio SME (Subject Matter Expert) – understand & document common components, core functionalities, infrastructure of supported applications Be an escalation point in the on-call rotation, and support our maintenance, scheduled work, support and release deployment requirements Lead in incident management and problem management for applications in scope and RCA Action items fulfillment/ownership Focus on continuous improvement and technical standards – drive improvements in productivity, monitoring, tooling and best practices Manage technology currency (server patching, certificate renewal, compliance, etc.) with keen eye on automating opportunities Drive best-in-class technical solutions by tracking closely industry leading solutions and applying to RBC environment and needs Leverage the value in unit, department, and enterprise wide teams to develop better solutions and achieve a cross enterprise mindset Develop SRE solutions (monitoring and alerting, machine learning anomaly detection, self-healing and reliability testing) Apply design-thinking and agile mindset in working with SREs, Scrum Masters and Incident Leads Contribute to and leverage best practices in SRE Simplifies development by building repeatable solutions to manual tasks Supports unit's goals to adopt automation solutions for applications in scope Production Support Perform production support role, including off-hours support and rotational on-call support to be compensated accordingly with overtime pay, lieu time, and on-call allowance Assist in incident management and problem management for applications in scope Evaluate continuously – what went well, what went wrong, what can be done to improve and prevent in future Maintain technology currency (perform server patching, certificate renewal, etc.) with keen eye on automating opportunities Ensure availability and uptime of applications in scope, as per service level objectives Ensure compliance of all systems and applications in scope, including maintaining segregation of duties Technical Consultation Support initiatives outside of application or squad level scope Consult on products built to other teams in and enterprise Innovation and Learning Stay abreast of technology change and learn constantly, through official training assignments and self-assigned learning Provide demos to team at large of new technology findings Must have A Bachelor's degree in Computer Science or related technical field (Example: Mathematics/Engineering/Physics), or equivalent practical experience. Advanced knowledge of the following SRE practices and technologies 4-5 years of experience in related field Python, YAML, Shell scripting Azure, Linux Dynatrace, Prometheus, PagerDuty, Moog, Splunk, Elastic, Azure monitor Chaos Engineering MQ, Kafka Perform production support role, including off-hours support In-depth hands-on experience in a variety of SRE tools (Ansible, Azure Automation, Catchpoint) Good to have: Kafka, Dynatrace, and related technologies with less-than-a-year experience examples listed in original (if applicable) Seniority level Mid-Senior level Employment type Full-time Job function Engineering and Information Technology Industries Human Resources Services Referrals increase your chances of interviewing at HRC Global Services by 2x #J-18808-Ljbffr


  • SRE

    4 weeks ago


    Toronto, Canada HRC Global Services Full time

    OverviewGet AI-powered advice on this job and more exclusive features.ResponsibilitiesProvide hands-on SRE with 24x7 SRE support, including incident management, problem management, root cause analysis, monitoring, alerting, and maintenance of infrastructure, complianceTrack, audit, monitor and implement on technical work streamsAct as portfolio SME (Subject...

  • SRE

    4 weeks ago


    Toronto, Canada HRC Global Services Full time

    OverviewGet AI-powered advice on this job and more exclusive features.ResponsibilitiesProvide hands-on SRE with 24x7 SRE support, including incident management, problem management, root cause analysis, monitoring, alerting, and maintenance of infrastructure, complianceTrack, audit, monitor and implement on technical work streamsAct as portfolio SME (Subject...

  • SRE

    5 days ago


    Toronto, Canada J&M Group Full time

    Join to apply for the SRE role at J&M Group Overview Role Description: 5+ years of experience as a Site reliability Engineer. Understanding of Observability concepts. Knowledge of Dynatrace and Ansible. Location: Toronto, Ontario, Canada Qualifications Competencies: Digital : Ansible, Digital : Site Reliability Engineering (SRE) Experience (Years): 6-8...

  • SRE

    3 days ago


    Toronto, Canada J&M Group Full time

    Join to apply for the SRE role at J&M Group Overview Role Description: 5+ years of experience as a Site reliability Engineer. Understanding of Observability concepts. Knowledge of Dynatrace and Ansible. Location: Toronto, Ontario, Canada Qualifications Competencies: Digital : Ansible, Digital : Site Reliability Engineering (SRE) Experience (Years): 6-8...


  • Toronto, Ontario, Canada JP Techno Park Full time $110,000 - $150,000 per year

    We're currently hiring for a Senior DevOps / Site Reliability Engineer (SRE) with 10+ years of experience in SRE, DevOps, or technical operations supporting production systems. We're looking for someone who has: Experience leading go-live and operational readiness for real-time or high-stakes platforms (fraud, risk, or payments preferred). Strong...

  • Senior SRE Engineer

    4 weeks ago


    Toronto, Canada Iris Software Inc. Full time

    Iris's client, one of the world's largest multinational Investment banking and financial services corporations, is looking to hire a Senior SRE – Tech Lead for a Long-Term opportunity.Work location: Toronto, ON (Hybrid Onsite – 4 days a week)Job description:Strong experience in Azure Cloud, AKS (Azure Kubernetes Service), and production operations.Proven...

  • Senior SRE Engineer

    4 weeks ago


    Toronto, Canada Iris Software Inc. Full time

    Iris's client, one of the world's largest multinational Investment banking and financial services corporations, is looking to hire a Senior SRE – Tech Lead for a Long-Term opportunity. Work location: Toronto, ON (Hybrid Onsite – 4 days a week) Job description: - Strong experience in Azure Cloud, AKS (Azure Kubernetes Service), and production...

  • Senior SRE Engineer

    3 weeks ago


    Toronto, Canada Iris Software Inc. Full time

    Iris's client, one of the world's largest multinational Investment banking and financial services corporations, is looking to hire a Senior SRE - Tech Lead for a Long-Term opportunity. Work location : Toronto, ON (Hybrid Onsite - 4 days a week) Job description: Strong experience in Azure Cloud, AKS (Azure Kubernetes Service), and production operations....

  • Senior SRE Engineer

    4 weeks ago


    Toronto, Canada Iris Software Inc. Full time

    Iris's client, one of the world's largest multinational Investment banking and financial services corporations, is looking to hire a Senior SRE – Tech Lead for a Long-Term opportunity. Work location : Toronto, ON (Hybrid Onsite – 4 days a week) Job description: Strong experience in Azure Cloud, AKS (Azure Kubernetes Service), and production operations....


  • Toronto, Ontario, Canada Infosprint Technologies Full time $150,000 - $200,000 per year

    SRE DevOps ArchitectToronto- HybridWe're currently hiring for a Senior Architect, DevOps / Site Reliability Engineer (SRE) with 10+ years of experience in SRE, DevOps, or technical operations supporting production systems.We're looking for someone who has: Experience leading go-live and operational readiness for real-time or high-stakes platforms (fraud,...