Site Reliability Engineer

1 week ago


Old Toronto, Ontario, Canada Rogers Communications, Inc. Full time
Job Summary

Rogers Communications, Inc. is seeking a highly skilled Site Reliability Engineer - Cloud Monitoring Specialist to join our dynamic team. As a key player in delivering Canadian audiences a diverse content portfolio, you will be at the forefront of technology innovation in the media space.

Key Responsibilities
  • Design, develop, and implement monitoring solutions using industry-standard tools such as Prometheus, Loki, Zabbix, and Grafana.
  • Create and maintain dashboards, alerts, and reports for system performance visibility.
  • Integrate monitoring into the software development lifecycle.
  • Collaborate with stakeholders to define and meet monitoring requirements.
  • Develop and maintain alerting strategies for timely incident detection.
  • Participate in incident response and post-mortem analysis.
  • Automate monitoring and alerting processes for efficiency.
  • Document monitoring solutions, processes, and best practices.
  • Develop support playbooks to enable Tier 1 to solve common issues without assistance.
  • Provide monitoring tool and practice training and support.
  • Work within a diverse group of DevOps engineers to enhance skills and learn new specialties.
Requirements
  • Post-secondary education in Computer Science, Information Technology, or related field.
  • 5+ years of experience in Site Reliability Engineering, DevOps, or System Administration.
  • Proven experience with monitoring tools (Prometheus, Grafana, Loki, Zabbix, DataDog, NewRelic, SolarWinds).
  • A strong foundation in cloud platforms, including hands-on experience with AWS and Azure, and proficiency in utilizing cloud monitoring services like Amazon CloudWatch and Azure Monitor.
  • Experience with on-prem monitoring tools and strategies to ensure comprehensive coverage across hybrid environments.
  • Proficiency in scripting languages (Python, Bash).
  • Strong networking, system administration, and infrastructure management knowledge.
  • Experience with logging and distributed tracing tools.
  • Strong problem-solving skills and attention to detail.
  • Excellent communication and collaboration skills.
  • Ability to work in a fast-paced environment.
  • Proficiency in English, with the ability to explain technical concepts clearly and sometimes to non-technical people.
What We Offer
  • A dynamic and collaborative work environment.
  • Opportunities for professional growth and development.
  • A comprehensive benefits package.
  • Generous employee discounts.
  • Leadership development, mentorship, and coaching programs.
How to Apply

Please submit your application through our website. We thank all applicants for their interest; however, only those selected for an interview will be contacted.



  • Toronto, Ontario, Canada Bourse de Montreal Inc. Full time

    Job Title: Site Reliability EngineerAt Bourse de Montreal Inc., we're seeking a highly skilled Site Reliability Engineer to join our Global Technology Services (GTS) team. As a Site Reliability Engineer, you will play a critical role in ensuring the reliability, scalability, and performance of our technology infrastructure.Key Responsibilities:Evaluate new...


  • Toronto, Ontario, Canada Bourse de Montreal Inc. Full time

    Job Title: Site Reliability EngineerAt Bourse de Montreal Inc., we're seeking a highly skilled Site Reliability Engineer to join our Global Technology Services (GTS) team. As a Site Reliability Engineer, you will play a critical role in ensuring the reliability, scalability, and performance of our technology infrastructure.Key Responsibilities:Evaluate new...


  • Toronto, Ontario, Canada Lorven Technologies Full time

    Job Title: Site Reliability EngineerLocation: Toronto, CADuration: Long termWe are seeking a highly skilled Site Reliability Engineer to join our team at Lorven Technologies. As a Site Reliability Engineer, you will be responsible for ensuring the reliability and scalability of our cloud-based infrastructure.Key ResponsibilitiesDesign and implement scalable...


  • Toronto, Ontario, Canada Lorven Technologies Full time

    Job Title: Site Reliability EngineerLocation: Toronto, CADuration: Long termWe are seeking a highly skilled Site Reliability Engineer to join our team at Lorven Technologies. As a Site Reliability Engineer, you will be responsible for ensuring the reliability and scalability of our cloud-based infrastructure.Key ResponsibilitiesDesign and implement scalable...


  • Old Toronto, Ontario, Canada PagerDuty, Inc. Full time

    PagerDuty empowers diverse teams to perform essential tasks that drive business success through the PagerDuty Operations Cloud.We are in search of a Senior Site Reliability Engineer to become a vital member of our SRE-Platform team. In this capacity, you will play a crucial role in developing, sustaining, and scaling the Kubernetes infrastructure that...


  • Old Toronto, Ontario, Canada PagerDuty, Inc. Full time

    PagerDuty empowers diverse teams to execute essential tasks that drive business success through the PagerDuty Operations Cloud.We are looking for a Senior Site Reliability Engineer to become a vital member of our SRE-Platform team. In this capacity, you will play a significant role in developing, sustaining, and enhancing the Kubernetes infrastructure that...


  • Old Toronto, Ontario, Canada PagerDuty, Inc. Full time

    PagerDuty empowers diverse teams to drive essential operations that propel business growth through the PagerDuty Operations Cloud.We are in search of a Senior Site Reliability Engineer to become a vital member of our SRE-Platform team. In this capacity, you will play a crucial role in developing, sustaining, and enhancing the Kubernetes infrastructure that...


  • Old Toronto, Ontario, Canada Akamai Full time

    Are you driven by the desire to enhance operational processes? Do you thrive in a multicultural team of engineering professionals? Join our elite Site Reliability team at Akamai. We focus on designing, developing, and managing applications and infrastructure that underpin Akamai's Compute offerings. Our expertise lies in creating and sustaining rapid,...


  • Old Toronto, Ontario, Canada SoundHound Inc Full time

    About SoundHound AI: At SoundHound AI, we envision a world where every individual can seamlessly interact with technology through natural conversation. Our innovative Voice AI solutions cater to various sectors, including automotive and food services, empowering brands to connect with their audiences in meaningful ways.Role Overview: We are seeking a...


  • Toronto, Ontario, Canada Rogers Communications Full time

    Job Title: Site Reliability EngineerRogers Sports & Media is seeking a skilled Site Reliability Engineer to join our dynamic team. As a key player in delivering Canadian audiences a diverse content portfolio, you'll be at the forefront of technology innovation in the media space.Key Responsibilities:Design, develop, and implement monitoring solutions using...


  • Toronto, Ontario, Canada Rogers Communications Full time

    Job Title: Site Reliability EngineerRogers Sports & Media is seeking a skilled Site Reliability Engineer to join our dynamic team. As a key player in delivering Canadian audiences a diverse content portfolio, you'll be at the forefront of technology innovation in the media space.Key Responsibilities:Design, develop, and implement monitoring solutions using...


  • Toronto, Ontario, Canada ISG Search Inc Full time

    {"Job Title": "Site Reliability Engineering", "Job Description": "Job SummaryWe are seeking a skilled Site Reliability Engineer to support our various platforms for a 6-month contract. This role involves database administration, automation, and troubleshooting of investment applications in a fast-paced, global environment.Key ResponsibilitiesSupport and...


  • Toronto, Ontario, Canada ISG Search Inc Full time

    {"Job Title": "Site Reliability Engineering", "Job Description": "Job SummaryWe are seeking a skilled Site Reliability Engineer to support our various platforms for a 6-month contract. This role involves database administration, automation, and troubleshooting of investment applications in a fast-paced, global environment.Key ResponsibilitiesSupport and...


  • Old Toronto, Ontario, Canada SoundHound Inc Full time

    About SoundHound AISoundHound AI is dedicated to enabling seamless interaction with technology through natural language. Our innovative Voice AI solutions cater to various industries, enhancing user experiences and brand engagement.Role OverviewAs a vital member of our Site Reliability Engineering (SRE) team, you will be instrumental in developing robust...


  • Old Toronto, Ontario, Canada SoundHound Inc Full time

    About SoundHound AISoundHound AI is dedicated to enabling seamless interactions between individuals and technology through natural language. Our innovative Voice AI solutions cater to diverse applications, including automotive systems and restaurant services, empowering brands to engage with their customers in meaningful ways.Role OverviewThis position...


  • Toronto, Ontario, Canada ISG Search Inc Full time

    We are seeking a highly skilled Site Reliability Engineer to support our various platforms on a 6-month contract basis. This role involves database administration, automation, and troubleshooting of various investment applications in a fast-paced, global environment.Key Responsibilities:Support and manage investment application systems, databases (MS SQL,...


  • Toronto, Ontario, Canada ISG Search Inc Full time

    We are seeking a highly skilled Site Reliability Engineer to support our various platforms on a 6-month contract basis. This role involves database administration, automation, and troubleshooting of various investment applications in a fast-paced, global environment.Key Responsibilities:Support and manage investment application systems, databases (MS SQL,...


  • Toronto, Ontario, Canada Rogers Full time

    About the RoleRogers Sports & Media is seeking a skilled Site Reliability Engineer to build and maintain a robust monitoring system for its cloud and on-prem systems. As a key player in delivering Canadian audiences a diverse content portfolio, you'll be at the forefront of technology innovation in the media space.Key ResponsibilitiesDesign, develop, and...


  • Toronto, Ontario, Canada Rogers Full time

    About the RoleRogers Sports & Media is seeking a skilled Site Reliability Engineer to build and maintain a robust monitoring system for its cloud and on-prem systems. As a key player in delivering Canadian audiences a diverse content portfolio, you'll be at the forefront of technology innovation in the media space.Key ResponsibilitiesDesign, develop, and...


  • Toronto, Ontario, Canada RBC - Royal Bank Full time

    Job SummaryWe are seeking a highly skilled Site Reliability Engineer to join our team at RBC. As a key member of our Intelligent Operations program, you will be responsible for executing technical planning and successful implementation of complex enterprise-wide initiatives. Your expertise in Observability platforms, particularly Dynatrace, will be...