Site Reliability Engineer

3 weeks ago


Old Toronto, Canada Rogers Communications, Inc. Full time
Site Reliability Engineer

Are you ready to take your career to new heights and be a part of a dynamic team at Rogers Sports & Media? We believe in creativity, innovation, and collaboration in everything we do, and we are looking for people who share this mindset to join us. With a monthly reach of 30 million Canadians, you can help shape the future of sports, news, e-commerce, and entertainment. At Rogers, we value diversity and inclusivity and believe that every voice matters. Join us today and be a part of a team that is redefining the future of media.

Rogers Sports and Media is seeking a Site Reliability Engineer (SRE) to build and maintain a robust monitoring system for its cloud and on-prem systems. As a key player in delivering Canadian audiences a diverse content portfolio, from the thrilling Stanley Cup playoffs to the latest Bachelor episode, you'll be at the forefront of technology innovation in the media space. The SRE will use tools like Prometheus, Loki, Zabbix and Grafana to create alerts and dashboards, and will collaborate with teams to ensure system reliability. The ideal candidate has strong technical skills, including scripting, cloud platforms, and monitoring tools, along with excellent communication and problem-solving abilities.

What You'll Do:

  1. Design, develop, and implement monitoring solutions using industry-standard tools.
  2. Create and maintain dashboards, alerts, and reports for system performance visibility.
  3. Integrate monitoring into the software development lifecycle.
  4. Collaborate with stakeholders to define and meet monitoring requirements.
  5. Develop and maintain alerting strategies for timely incident detection.
  6. Participate in incident response and post-mortem analysis.
  7. Automate monitoring and alerting processes for efficiency.
  8. Document monitoring solutions, processes, and best practices.
  9. Develop support playbooks to enable Tier 1 to solve common issues without assistance.
  10. Provide monitoring tool and practice training and support.
  11. Work within a diverse group of DevOps engineers to enhance skills and learn new specialties.

What You'll Bring:

  • Post-secondary education in Computer Science, Information Technology, or related field.
  • 5+ years of experience in Site Reliability Engineering, DevOps, or System Administration.
  • Proven experience with monitoring tools (Prometheus, Grafana, Loki, Zabbix, DataDog, NewRelic, SolarWinds).
  • A strong foundation in cloud platforms, including hands-on experience with AWS and Azure, and proficiency in utilizing cloud monitoring services like Amazon CloudWatch and Azure Monitor.
  • Experience with on-prem monitoring tools and strategies to ensure comprehensive coverage across hybrid environments.
  • Proficiency in scripting languages (Python, Bash).
  • Strong networking, system administration, and infrastructure management knowledge.
  • Experience with logging and distributed tracing tools.
  • Strong problem-solving skills and attention to detail.
  • Excellent communication and collaboration skills.
  • Ability to work in a fast-paced environment.
  • Proficiency in English, with the ability to explain technical concepts clearly and sometimes to non-technical people.

Schedule: Full time
Shift: Day
Length of Contract: Not Applicable (Regular Position)
Work Location: 1 Mount Pleasant (083), Toronto, ON
Travel Requirements: None
Posting Category/Function: Technology & Information Technology
Requisition ID: 313072

At Rogers, we believe the key to a strong business is a diverse workforce where equity and inclusion are core to making everyone feel like they belong. We do this by embracing our diversity, celebrating our different perspectives, and working towards creating environments that empower our people to bring their whole selves to work. Everyone who applies for a job will be considered. We recognize the business value in creating a workplace where each team member has the tools to reach their full potential by removing any barriers for equal participation. We work with our candidates who are experiencing a disability throughout the recruitment process to ensure that they have what they need to be at their best. You matter to us For any questions, please visit the Recruitment Process FAQ.

Successful candidates will be required to complete a background check as part of the hiring process.

Being a Rogers team member comes with some great perks & benefits including:

· Health & well-being benefits
· Donation matching
· Paid time off for volunteering
· Wealth Accumulation including: Pension plan & Employee stock options
· Generous employee discounts
· Leadership development, Mentorship, and Coaching programs

*available for full-time and part-time permanent employees, some restrictions apply

#J-18808-Ljbffr

  • Old Toronto, Canada CB Canada Full time

    Site Reliability Engineer On behalf of our client in the Banking Sector, PROCOM is looking for a Site Reliability Engineer. Site Reliability Engineer – Job Description Azure cloud Jira and confluence CICD Experience with automating (provisioning, configuration management, deployment) and integrating Azure PaaS solutions (Azure App services, Azure...


  • Old Toronto, Canada CB Canada Full time

    Site Reliability Engineer On behalf of our client in the Banking Sector, PROCOM is looking for a Site Reliability Engineer. Site Reliability Engineer – Job Description Azure cloud Jira and confluence CICD Experience with automating (provisioning, configuration management, deployment) and integrating Azure PaaS solutions (Azure App services, Azure...


  • Old Toronto, Canada Reperio Human Capital Full time

    ```html Site Reliability Engineer 100421 Location: Ireland/UK Salary: €70K+ Type: Permanent, Full-time We're seeking experienced Site Reliability Engineers who excel at ensuring the reliability and scalability of production systems, and possess extensive experience with monitoring and automation tools. Responsibilities: Ensure the reliability,...


  • Old Toronto, Canada Reperio Human Capital Full time

    ```html Site Reliability Engineer 100421 Location: Ireland/UK Salary: €70K+ Type: Permanent, Full-time We're seeking experienced Site Reliability Engineers who excel at ensuring the reliability and scalability of production systems, and possess extensive experience with monitoring and automation tools. Responsibilities: Ensure the reliability,...


  • Old Toronto, Canada CB Canada Full time

    Site Reliability Engineer On behalf of our client in the Banking Sector, PROCOM is looking for a Site Reliability Engineer. Site Reliability Engineer – Job Description Azure cloud Jira and Confluence CICD Experience with automating (provisioning, configuration management, deployment) and integrating Azure PaaS solutions (Azure App services, Azure...


  • Old Toronto, Canada CB Canada Full time

    Site Reliability Engineer On behalf of our client in the Banking Sector, PROCOM is looking for a Site Reliability Engineer. Site Reliability Engineer – Job Description Azure cloud Jira and Confluence CICD Experience with automating (provisioning, configuration management, deployment) and integrating Azure PaaS solutions (Azure App services, Azure...


  • Old Toronto, Ontario, CA Reperio Human Capital Full time

    ```html Site Reliability Engineer 100421 Location: Ireland/UK Salary: €70K+ Type: Permanent, Full-time We're seeking experienced Site Reliability Engineers who excel at ensuring the reliability and scalability of production systems, and possess extensive experience with monitoring and automation tools. Responsibilities: Ensure the reliability,...


  • Old Toronto, Ontario, CA CB Canada Full time

    Site Reliability Engineer On behalf of our client in the Banking Sector, PROCOM is looking for a Site Reliability Engineer. Site Reliability Engineer – Job Description Azure cloud Jira and Confluence CICD Experience with automating (provisioning, configuration management, deployment) and integrating Azure PaaS solutions (Azure App services, Azure...


  • Old Toronto, Canada Thomson Reuters Full time

    (Canada) Site Reliability Engineer (Contract) Contract (9 months 4 days) Published 3 days ago New Relic Data Dog Site Reliability Engineer - in the Service Management OrganizationDo you have experience in IT Service Management, working with cloud providers, software development, and technology infrastructure?The Site Reliability Engineer will...


  • Old Toronto, Canada Thomson Reuters Full time

    (Canada) Site Reliability Engineer (Contract) Contract (9 months 4 days) Published 3 days ago New Relic Data Dog Site Reliability Engineer - in the Service Management OrganizationDo you have experience in IT Service Management, working with cloud providers, software development, and technology infrastructure?The Site Reliability Engineer will...


  • Old Toronto, Canada Thomson Reuters Full time

    (Canada) Site Reliability Engineer (Contract) Contract (5 months 29 days) Published 8 months ago CLOSED GCP Site Reliability Engineer - in the Service Management OrganizationDo you have experience in IT Service Management, working with cloud providers, software development, and technology infrastructure?The Site Reliability Engineer will analyze...


  • Old Toronto, Canada Thomson Reuters Full time

    (Canada) Site Reliability Engineer (Contract) Contract (5 months 29 days) Published 8 months ago CLOSED GCP Site Reliability Engineer - in the Service Management OrganizationDo you have experience in IT Service Management, working with cloud providers, software development, and technology infrastructure?The Site Reliability Engineer will analyze...


  • Old Toronto, Canada TD Bank Full time

    Site Reliability EngineerSite Reliability EngineerWork Location: CanadaHours: 37.5Line of Business: Technology SolutionsPay Details: We’re committed to providing fair and equitable compensation to all our colleagues. As a candidate, we encourage you to have an open dialogue with a member of our HR Team and ask compensation related questions, including pay...


  • Old Toronto, Canada TD Bank Full time

    Site Reliability EngineerSite Reliability EngineerWork Location: CanadaHours: 37.5Line of Business: Technology SolutionsPay Details: We’re committed to providing fair and equitable compensation to all our colleagues. As a candidate, we encourage you to have an open dialogue with a member of our HR Team and ask compensation related questions, including pay...


  • Toronto, Canada CB Canada Full time

    Site Reliability Engineer On behalf of our client in the Banking Sector, PROCOM is looking for a Site Reliability Engineer. Site Reliability Engineer – Job Description Azure cloud Jira and confluence CICD Experience with automating (provisioning, configuration management, deployment) and integrating Azure PaaS solutions (Azure App services, Azure...


  • Old Toronto, Canada eTeam Full time

    Remote Work Duration 4 months - Preference is to find candidates who are willing to be converted to full-time employees. The conversion decision will be made based on performance. Job Description Role Description: Defining and measuring reliability goals—SLIs, SLOs, and error budgets for user journey. Designing for and implementing observability (ELK,...


  • Old Toronto, Canada eTeam Full time

    Remote Work Duration 4 months - Preference is to find candidates who are willing to be converted to full-time employees. The conversion decision will be made based on performance. Job Description Role Description: Defining and measuring reliability goals—SLIs, SLOs, and error budgets for user journey. Designing for and implementing observability (ELK,...


  • Old Toronto, Canada Rogers Full time

    Site Reliability Engineer Are you ready to take your career to new heights and be a part of a dynamic team at Rogers Sports & Media? We believe in creativity, innovation, and collaboration in everything we do, and we are looking for people who share this mindset to join us. With a monthly reach of 30 million Canadians, you can help shape the future of...


  • Old Toronto, Canada Rogers Full time

    Site Reliability Engineer Are you ready to take your career to new heights and be a part of a dynamic team at Rogers Sports & Media? We believe in creativity, innovation, and collaboration in everything we do, and we are looking for people who share this mindset to join us. With a monthly reach of 30 million Canadians, you can help shape the future of...


  • Old Toronto, Ontario, CA Thomson Reuters Full time

    (Canada) Site Reliability Engineer (Contract) Contract (9 months 4 days) Published 3 days ago New Relic Data Dog Site Reliability Engineer - in the Service Management OrganizationDo you have experience in IT Service Management, working with cloud providers, software development, and technology infrastructure?The Site Reliability Engineer will analyze...