Senior Site Reliability Engineer
2 days ago
The Sr. SRE will be responsible for the reliability, scalability, and performance of systems supporting classified government projects in an air-gapped deployment. This role leverages advanced monitoring and DevOps tools to ensure uptime and compliance in a disconnected environment.Key ResponsibilitiesDesign and maintain highly reliable systems using RKE2, Kubernetes, Ingress, Kong, Artifactory, and Sonar.Implement observability solutions with Prometheus, Grafana, Splunk, and Elastic to monitor system health in an air-gapped setting.Ensure compliance and performance optimization across multi-tenant deployments.Conduct code quality analysis and security assessments using Sonar.Collaborate with the Lead and Infra/Security Specialists to resolve incidents and improve system resilience.Develop and maintain documentation for system configurations and recovery procedures in a classified environment.Required Skills and QualificationsExpertise in RKE2, Kubernetes, Ingress, Kong, Artifactory, Prometheus, Grafana, Splunk, Elastic, and Sonar.Strong background in site reliability engineering and system observability.Experience working in air-gapped environments with a focus on classified data protection.Proficiency in troubleshooting and optimizing complex, multi-tenant infrastructures.Preferred QualificationsSRE or DevOps certifications (e.g., CKAD, CKA).Prior experience with government or defense-related SRE roles.Seniority levelSeniority levelMid-Senior levelEmployment typeEmployment typeFull-timeJob functionJob functionEngineering and Information TechnologyIndustriesIT Services and IT ConsultingReferrals increase your chances of interviewing at Orion Innovation by 2xGet notified about new Site Reliability Engineer jobs in Quebec, Canada.Drone Operations and Ground Equipment System EngineerGreater Montreal Metropolitan Area 3 days agoSenior Site Reliability Engineer- Central PlatformsPython and Kubernetes Software Engineer - Data, AI/ML & AnalyticsFreelance Software Developer (Python Engineer) - AI TrainerPython and Kubernetes Software Engineer - Data, AI/ML & AnalyticsPython and Kubernetes Software Engineer - Data, AI/ML & AnalyticsPython and Kubernetes Software Engineer - Data, AI/ML & AnalyticsPython and Kubernetes Software Engineer - Data, AI/ML & AnalyticsPython Software Engineer - Ubuntu Hardware Certification TeamStaff Software Engineer, Social Media & Client MarketingGreater Montreal Metropolitan Area 4 days agoWe’re unlocking community knowledge in a new way. Experts add insights directly into each article, started with the help of AI. #J-18808-Ljbffr
-
Senior Site Reliability Engineer
4 weeks ago
Quebec, Canada Orion Innovation Full timeThe Sr. SRE will be responsible for the reliability, scalability, and performance of systems supporting classified government projects in an air-gapped deployment. This role leverages advanced monitoring and DevOps tools to ensure uptime and compliance in a disconnected environment. Key Responsibilities Design and maintain highly reliable systems using RKE2,...
-
Senior Site Reliability Engineer
2 weeks ago
Quebec, Canada Canonical Full timeSenior Site Reliability Engineer Canonical is a leading provider of open‑source software and operating systems to the global enterprise and technology markets. Our platform, Ubuntu, is widely used in breakthrough enterprise initiatives such as public cloud, data science, AI, engineering innovation and IoT. Our customers include the world's leading public...
-
Senior Site Reliability Engineer
2 days ago
Quebec, Canada Canonical Full timeSenior Site Reliability Engineer Canonical is a leading provider of open‑source software and operating systems to the global enterprise and technology markets. Our platform, Ubuntu, is widely used in breakthrough enterprise initiatives such as public cloud, data science, AI, engineering innovation and IoT. Our customers include the world's leading public...
-
Senior Site Reliability Engineer
4 days ago
Montréal-Ouest, Quebec, Canada Orion Innovation Full timeOrion Innovation is a premier, award-winning, global business and technology services firm. Orion delivers game-changing business transformation and product development rooted in digital strategy, experience design, and engineering, with a unique combination of agility, scale, and maturity. We work with a wide range of clients across many industries...
-
Senior Site Reliability Engineer
6 days ago
Eastern Canada|Ontario|Montreal|Quebec|Nova Scotia|Vancouver|Calgary|Winnipeg|British Columbia|Manitoba|Edmonton|Saskatoon|Ottawa|Saint John|White Rock|Kitchener|Halifax|Coquitlam|Burnaby|St. John's Targeted Talent Full timeWe are looking for an experienced Senior Site Reliability Engineer for our client. This is a permanent position that is remote to start with later relocation to Calgary or Winnipeg. Our client is a global enterprise company with a product that you've likely used. Experience with coding/software development, along with Site Reliability will be the key...
-
Site Reliability Engineer
2 days ago
Quebec, Canada ALLTECH CONSULTING SVC INC Full timeJob Description:Technology/Role/Department at our Company Enterprise Technology & Services (ETS) delivers shared technology services for the Firm supporting all business applications and end users. ETS provides capabilities for all stages of the Firm’s software development lifecycle, enabling productive coding, functional and integration testing,...
-
Site Reliability Engineer
3 days ago
Quebec, Canada ALLTECH CONSULTING SVC INC Full timeJob Description: Technology/Role/Department at our Company Enterprise Technology & Services (ETS) delivers shared technology services for the Firm supporting all business applications and end users. ETS provides capabilities for all stages of the Firm’s software development lifecycle, enabling productive coding, functional and integration testing,...
-
Site Reliability Engineer
10 hours ago
Quebec, Canada ALLTECH CONSULTING SVC INC Full timeJob Description: Technology/Role/Department at our Company Enterprise Technology & Services (ETS) delivers shared technology services for the Firm supporting all business applications and end users. ETS provides capabilities for all stages of the Firm’s software development lifecycle, enabling productive coding, functional and integration testing,...
-
Site Reliability Expert
3 weeks ago
Quebec, Canada La Maison Simons Full timeJoin to apply for the Site Reliability Expert (SRE) role at La Maison Simons Are you looking to join our Information Technology team in a unique role that contributes to the optimal maintenance of our production environment? Join the Simons family as a Site Reliability Engineer (SRE). The person in this role plays a key part in ensuring the smooth operation...
-
Site Reliability Expert
2 days ago
Quebec, Canada La Maison Simons Full timeJoin to apply for the Site Reliability Expert (SRE) role at La Maison Simons Are you looking to join our Information Technology team in a unique role that contributes to the optimal maintenance of our production environment? Join the Simons family as a Site Reliability Engineer (SRE). The person in this role plays a key part in ensuring the smooth operation...