Senior Developer, Site Reliability

2 weeks ago


Toronto, Ontario, Canada Radio-Canada Full time

Position Title
Senior Developer, Site Reliability (Digital Strategy And Product) (English Services)

Status Of Employment
Contractee Long-Term (Fixed Term)

Position Language Requirement
Language Skills:
Work at CBC/Radio-Canada
At CBC/Radio-Canada, we create content that informs, entertains and connects Canadians on multiple platforms. Our successes and accomplishments are driven by embodying and upholding values, which include creativity, integrity, inclusiveness and relevance.

Do you think you have the ability and drive to keep up with this exciting, ever-changing industry? Whether it be in front of the camera, on air, online or behind the scenes, you would be joining a team that thrives on making connections and telling stories that are important to Canadians.

Unposting Date
:59 PM

Working At CBC
Senior Developer, Site Reliability (Digital Strategy And Product)
At the CBC, we all have a story to tell. What's yours?
If you share our passion for storytelling, Canada and having a positive impact on your community, CBC's Digital Strategy and Product team is where you want to be

CBC serves one of the country's largest digital audiences and one of the few that is built and owned in Canada, by Canadians. Our product suite includes , CBC News app, CBC Gem and CBC Listen.

Our product vision is to make it effortless for all people and communities in Canada to tell and discover the stories that connect us. Whether delivering the latest breaking news, international hit series like North of North, deep dives into the issues of your town or neighbourhood, award winning podcasts like Someone Knows Something, or nation building events like the Olympics, our products ensure that these stories reach Canadians on their phones, TVs, desktops or smart speakers whenever they need or want us.

You have the opportunity to play a part in informing, enlightening and entertaining Canadians through innovative storytelling formats and content discovery. We are an empowered outcome-driven innovative hub for CBC, critical to the future growth of Canada's public media company and the wider content ecosystem. We believe that we are most impactful when our teams reflect the breadth of experience of the audiences we serve and we are committed to an inclusive and equitable workplace to realize that belief. When you join our mission, you are not only contributing to the growth of the CBC, but the future of our country.

Why is this role important?
As our national public broadcast organization, we are committed to providing quality content to our audience securely with a high rate of availability. To help us achieve this, we are looking for a Senior Developer, Site Reliability who has experience configuring monitoring solutions and participating in incident management and resolution. You will also have the opportunity to learn or expand your knowledge on many different technologies while working with all our digital teams to add or improve load testing, monitoring, alerting and incident response. You will be part of building a comprehensive observability solution to help support our digital teams across a large and complex environment.

As part of an integral team, you will have the chance to collaborate with our product development teams to guarantee high availability for all of our systems whether they are external(audience facing) or internal. You will make a direct contribution to the observability of performance for the website, streaming platforms, and global infrastructure.

This position will also play an important role in our incident management process, helping teams to complete postmortem investigations into any major incident at CBC.

If you're passionate about Canada and you love technology, learning and bringing out the best in others, you'll love working at CBC.

CBC/Radio-Canada is the largest broadcaster in Canada and this team is at the forefront of bringing observability to all our digital offerings. We work on improving site reliability by increasing monitoring/alerting, implementing lessons learned from postmortem investigations and working with digital teams to improve their performance testing. We strive to detect potential problems as early as possible to help our digital teams be more proactive in their response to potential issues.

Here's Why We Should Work Together
Our digital teams' values - collaboration, inclusion, learning, and continuous improvement - embody who we are as a people-focused, digital-forward employer. We follow Agile principles and the empowered product operating model. Our dedicated managers work closely with every individual to ensure we are leveraging their strengths, championing their ideas and supporting their pursuit of new skills and their desired career progression.

Here at CBC Digital Strategy & Products, your well being is critical to our success. It is essential that work be a safe space where our employees are able to share their authentic selves with one another and to push each other to challenge conventions.

How You Will Make An Impact
You are an experienced developer who will primarily work on site reliability, helping to build and configure our performance monitoring solution for all our digital teams and products. Enabling our digital teams to be less reactive and more proactive in dealing with potential problems.

Collaborating with digital team leads and product owners to determine the essential metrics that need to be captured for each product or system. Work with them to determine the proper thresholds to ensure effective alerting. Ensure teams have an action plan in place for each alert that may be triggered.

Open communication and dialogue with team members on an ongoing basis, being supportive and receptive to feedback and questions; partnering with the Senior Engineering Manager to recognize and address teams' training opportunities and performance challenges together.

Understanding the importance of accessibility and knowing what it takes to meet the needs and inclusivity of all users.

Having an opportunity to join a company with a mission, value set, and tech-forward approach that aligns with your own; a place where knowledge-sharing guides your learning.

Wanting to be part of a fun team, engaged in a continuous learning culture, where you can take on new challenges and be a significant contributor to engaging our pan Canadian audience.

What You Bring To Our Team

  • Minimum of 2 years of relevant experience.
  • Strong experience working with monitoring tools both in enterprise and open source solutions.(Prometheus, Grafana, Sentry, ELK)
  • Strong experience implementing alerting solutions and setting thresholds.
  • Knowledge of one or more cloud platforms like AWS, Azure, GCP.
  • Experience with coding or scripting languages(C#, Python, Powershell, PowerCLI, Bash).
  • Good understanding of CI/CD pipelines.
  • Knowledge of containerization. (Docker, Kubernetes)
  • Experience with Atlassian applications. (Confluence, Jira)
  • Able to follow technical solutions diagrams.
  • Experience with a variety of operating systems (Linux, Windows, MacOS)
  • Basic understanding of the various components of a full stack solution.

Nice To Have

  • Past experience supporting a full stack solution is an asset.
  • Experience with Infrastructure as Code tools (Terraform, Puppet, Chef)
  • Familiarity with current trends and industry best practices.
  • Self-starter with strong ability to lead through influence.
  • Excellent team player.
  • Strong communication skills.
  • Ability to multitask and deal with concurrent and/or conflicting priorities.
  • Ability to work with remote teams.
  • Bilingualism (English and French) is an asset.

Candidates may be subject to skills and knowledge testing.

We thank all applicants for their interest, but only candidates selected for an interview will be contacted.

As part of our recruitment process, candidates who advance to the next

step will be asked to complete a background check. This includes:

  • A mandatory Criminal record check.
  • Other background checks may be conducted based on the operational requirements of the position.

CBC/Radio-Canada is committed to being a leader in reflecting our country's diversity. That's because we can only create and tell the stories that connect Canadians, by having a workforce that mirrors the ever-changing makeup of our country. That's why we, as an employer, value equal opportunity and nurture an inclusive workplace where our individual differences are not only recognized and valued, but also extend to and pervade all the services we provide as Canada's public broadcaster. For more information, visit the Diversity and Inclusion section of our website. If you have accommodation needs at this stage of the recruitment process, please inform us as soon as possible by sending an e-mail to

You are invited to consult and familiarize yourself with our Code of Conduct, which can be found on our corporate website. All employees must adhere to the Code as a condition of employment. We also invite you to take a look at our policy on conflicts of interest. In the event that you become an employee, it will be important to inform us, as quickly as possible, of any situation that, because of your hiring, constitutes or could appear to constitute a conflict of interest.

Primary Location:
Broadcast Centre 205 Wellington St. W., Toronto, Ontario, M5V 3G7

Number Of Openings
1

Work Schedule
Full time



  • Toronto, Ontario, Canada RBC Full time

    Job DescriptionWhat is the opportunity?This is an exciting opportunity to join a high-performing team that plays a critical role in ensuring the reliability, scalability, and performance of pre-production environments for ATM systems. As a Senior Service Reliability Developer, you will be at the forefront of driving innovation and operational excellence in a...


  • Toronto, Ontario, Canada Autodesk Full time

    Position OverviewWe are seeking a highly motivated and experienced Senior Site Reliability Developer (SRE) to manage critical cloud infrastructure and site reliability operations for Autodesk's global Product Access journey. This pivotal role focuses on ensuring the highest reliability, availability, and performance of our AWS-hosted cloud infrastructure....


  • Toronto, Ontario, Canada CBCRadio-Canada Full time

    Position Title: Senior Developer, Site Reliability (Digital Strategy And Product) (English Services)Status of Employment:Contractee Long-Term (Fixed Term)Position Language Requirement:Language Skills:Work at CBC/Radio-CanadaAt CBC/Radio-Canada, we create content that informs, entertains and connects Canadians on multiple platforms. Our successes and...


  • Toronto, Ontario, Canada RBC Full time

    Job DescriptionWhat is the opportunity?Join our Commercial, Core Banking and Payments Technology (CCBPT) team as a Senior Site Reliability Engineer, where you'll play a key role in supporting our cloud and distributed environments for the Personal Commercial Credit SRE & Ops team. This exciting opportunity will challenge you to work with cutting-edge...


  • Toronto, Ontario, Canada Tubi Full time

    About Tubi:Boldly built for every fandom, Tubi is a free streaming service that entertains over 100 million monthly active users. Tubi offers the world's largest collection of Hollywood movies and TV shows, thousands of creator-led stories and hundreds of Tubi Originals made for the most passionate fans. Headquartered in San Francisco and founded in 2014,...


  • Toronto, Ontario, Canada Mindlance Full time

    Role : Site Reliability EngineerLocation : Toronto, ONDuration : 12 Months of contract (Need to go 4 days in week onsite)Job Description:10+ years relevant SRE experience2+ years of relevant ITRS Geneos (Version 7)Experience on multiple projects with multiple interfaces and/or 3rd parties in the Monitoring, OpenTelemetry and Market Data space.Experience in a...


  • Toronto, Ontario, Canada Autodesk Full time

    Job Requisition ID #25WD92369Position OverviewWe are seeking a highly motivated and experienced Senior Site Reliability Engineer (SRE) to manage critical cloudinfrastructure and site reliability operations for Autodesk's global Product Access journey. This pivotal role focuses on ensuringthe highest reliability, availability, and performance of our...


  • Toronto, Ontario, Canada GoDaddy Full time

    Location Details: Ontario Canada, remote.At GoDaddy, the future of work looks different for each team. Some teams work in the office full-time; others have a hybrid arrangement (they work remotely some days and in the office some days) , and some work entirely remotely.​This is a remote position, so you'll be working remotely from your home. You may...


  • Toronto, Ontario, Canada Zensurance Full time

    About Us: Zensurance is redefining commercial insurance for Canadian businesses.  As a leading InsurTech, we make getting the right coverage simple, fast, and accessible through a digital-first experience. Our platform combines advanced technology with deep industry expertise to deliver tailored insurance solutions that help businesses thrive. Zensurance...


  • Toronto, Ontario, Canada Zensurance Full time

    About Us: Zensurance is redefining commercial insurance for Canadian businesses As a leading InsurTech, we make getting the right coverage simple, fast, and accessible through a digital-first experience. Our platform combines advanced technology with deep industry expertise to deliver tailored insurance solutions that help businesses thrive Zensurance has...