Site Reliability Engineering Specialist
2 weeks ago
Site Reliability Engineering Specialist – Telesat Telesat (NASDAQ and TSX: TSAT) is a leading global satellite operator, providing reliable and secure satellite-delivered communications solutions worldwide to broadcast, telecommunications, corporate and government customers for over 50 years. Backed by a legacy of engineering excellence, reliability and industry‑leading customer service, Telesat has grown to be one of the largest and most successful global satellite operators. Telesat Lightspeed, our revolutionary Low Earth Orbit (LEO) satellite network, scheduled to begin service in 2027, will revolutionize global broadband connectivity for enterprise users by delivering a combination of high capacity, security, resiliency and affordability with ultra‑low latency and fiber‑like speeds. The company’s state‑of‑the‑art fleet consists of 14 GEO satellites, the Canadian payload on ViaSat‑1 and one LEO 3 demonstration satellite. We are seeking a Site Reliability Engineering Specialist to ensure the reliability, performance, and scalability of our infrastructure. The ideal candidate will have extensive experience in cloud environments, automation, and monitoring, with a strong focus on incident response and system optimisation. Excellent problem‑solving skills and a proactive approach to maintaining system health are essential. Responsibilities Work closely with Telesat's cloud engineers to deploy and maintain our Kubernetes‑based infrastructure Help maintain high availability, uptime and resiliency of our infrastructure Perform day‑to‑day operational tasks such as upgrades and patching of the Kubernetes platform Automate operational tasks Monitor the health of the platform and applications using Telesat's observability platform Improve observability, define and measure SLOs Collaborate with development teams to resolve application issues Go on‑call and respond to automated alerts and execute playbooks Identify gaps in processes, as well as build or improve tools to support incident management Facilitate incident response and conduct root cause analysis Education and Experience Required Bachelor's Degree in Computer Science or a related field Minimum nine years of experience in IT operations with a focus on reliability, uptime, availability and performance At least five years of hands‑on provable experience with Microsoft Azure including deployment, management, and monitoring Expertise in automation and configuration management tools with demonstrable experience using tools such as Terraform and Ansible to automate infrastructure and application deployment Strong understanding of monitoring and observability tools with proven experience in monitoring tools such as Prometheus, Grafana, Nagios or Splunk, and the ability to implement and maintain observability solutions CNCF Certified Kubernetes Administrator (CKA) would be considered an asset for this role Security clearance requirement: The successful candidate must be able to work in Canada and obtain clearance under the Canadian Controlled Goods program (CGP). Seniority level: Mid‑Senior; Employment type: Full‑time. At Telesat, we take pride in being an equal opportunity employer that values equality in the workplace. We are committed to providing the best candidate experience possible including any required accommodations at every stage of our interview process. All qualified applicants that have been selected for an interview that require accommodations, are advised to inform the Telesat Talent team accordingly. We will work with you to meet your needs. All accommodation information provided will be treated as confidential. We may use artificial intelligence (AI) tools to support parts of the hiring process, such as reviewing applications, analyzing resumes, or assessing responses. These tools assist our recruitment team but do not replace human judgment. Final hiring decisions are ultimately made by humans. If you would like more information about how your data is processed, please contact us. #J-18808-Ljbffr
-
Site Reliability Engineering Specialist
2 weeks ago
Ottawa, Canada Telesat Full timeTelesat (NASDAQ and TSX: TSAT) is a leading global satellite operator, providing reliable and secure satellite-delivered communications solutions worldwide to broadcast, telecommunications, corporate and government customers for over 50 years. Backed by a legacy of engineering excellence, reliability and industry-leading customer service, Telesat has grown...
-
Site Reliability Engineer
5 days ago
Ottawa, Canada Apptoza Inc. Full timeHI, Hope you are doing Great, If you are fine with below JD please share me your Updated resume ASAP. Site Reliability Engineer Location: TORONTO (ONSITE) Duration: 6 months Exp Required: 10 Years Job Description: Job Title : SRE Technical/Functional Skills • 8+ years of overall IT experience. • Advanced Linux / Unix support experience...
-
Site Reliability Engineer
3 weeks ago
Ottawa, Canada Tecsys Inc. Full timeJob Description Job Description Having recognized the advantages of remote work, including employee morale, productivity, reduced commuting on employee wellbeing and the environment, we are proud to be a digital-first company. The technologies and programs in which we invested have provided a fantastic foundation to this end. Our digital-first work...
-
Site Reliability Engineer
3 weeks ago
Ottawa, Canada Tecsys Inc. Full timeJob Description Job Description Having recognized the advantages of remote work, including employee morale, productivity, reduced commuting on employee wellbeing and the environment, we are proud to be a digital-first company. The technologies and programs in which we invested have provided a fantastic foundation to this end. Our digital-first work...
-
Site Reliability Engineer
3 weeks ago
Ottawa, Canada Tecsys Inc. Full timeJob Description Job Description Having recognized the advantages of remote work, including employee morale, productivity, reduced commuting on employee wellbeing and the environment, we are proud to be a digital-first company. The technologies and programs in which we invested have provided a fantastic foundation to this end. Our digital-first work...
-
Site Reliability Specialist
4 days ago
Ottawa, Canada Innovapost Full time**Who is Innovapost?**: Great question! We are the technology arm of the Canada Post Group of companies. This includes Canada Post, Purolator, and SCI. By joining us you will be able to make a positive impact on how every Canadian deliver and receives their packages and mail. Next time you see your neighbor picking up their mail and receiving a package, you...
-
Site Reliability Engineer
2 weeks ago
Ottawa, Ontario, Canada TECSYS Inc. Full time $120,000 - $140,000 per yearHaving recognized the advantages of remote work, including employee morale, productivity, reduced commuting on employee wellbeing and the environment, we are proud to be a digital-first company. The technologies and programs in which we invested have provided a fantastic foundation to this end. Our digital-first work environment, together with our...
-
Ottawa, Canada Telesat Full timeTelesat (NASDAQ and TSX: TSAT) is a leading global satellite operator, providing reliable and secure satellite-delivered communications solutions worldwide to broadcast, telecommunications, corporate and government customers for over 50 years. Backed by a legacy of engineering excellence, reliability and industry-leading customer service, Telesat has grown...
-
Reliability Engineer
4 weeks ago
Ottawa, Canada Snc-Lavalin Full timeReliability Engineer page is loaded## Reliability Engineerlocations: CA.ON.Ottawa.3110 Albion Road Northtime type: Full timeposted on: Posted Yesterdayjob requisition id: R-132603### **Job Description****Reliability Engineer****Come join us in reshaping the future with AtkinsRéalis. AtkinsRéalis is dedicated in engineering a better future for our...
-
Reliability Engineer
4 weeks ago
Ottawa, Canada Snc-Lavalin Full timeReliability Engineer page is loaded## Reliability Engineerlocations: CA.ON.Ottawa.3110 Albion Road Northtime type: Full timeposted on: Posted Yesterdayjob requisition id: R-132603### **Job Description****Reliability Engineer****Come join us in reshaping the future with AtkinsRéalis. AtkinsRéalis is dedicated in engineering a better future for our...