Senior Site Reliability Engineer
19 hours ago
Overview Senior Site Reliability Engineer (SRE) with Kubernetes and Rancher. Full-time role focused on building and maintaining highly resilient, secure systems, including in air-gapped environments. Responsibilities System Architecture & Management: Design, architect, and maintain highly reliable, multi-tenant systems using Kubernetes and related tools (RKE2). Includes components such as Ingress, Kong, Artifactory, and Sonar. Observability & Monitoring: Implement and manage observability solutions with Prometheus, Grafana, Splunk, and Elastic to ensure deep visibility into system health and performance, including in air-gapped settings. Compliance & Optimization: Ensure deployments meet stringent compliance standards and are optimized for performance and security. Code Quality & Security: Perform regular code quality analysis and security assessments using Sonar to identify and mitigate vulnerabilities. Incident Response: Collaborate with leads and specialized teams to resolve incidents quickly and improve resilience and recovery procedures. Documentation: Create and maintain documentation for system configurations, runbooks, and disaster recovery plans for managing systems in sensitive environments. Required Skills and Qualifications 8+ years of Site Reliability Experience. Experience with Kubernetes and Rancher. Technical Expertise: Proficiency with RKE2, Kubernetes, Ingress, Kong, Artifactory, Prometheus, Grafana, Splunk, Elastic, and Sonar. SRE & Observability: Strong background in Site Reliability Engineering and implementing comprehensive observability strategies. Secure Environments: Experience in air-gapped or zero-connectivity environments and protecting classified data. Troubleshooting: Ability to troubleshoot and optimize complex, multi-tenant infrastructures under pressure. Preferred Qualifications Relevant SRE or DevOps certifications (e.g., CKAD, CKA). Experience in government or defense-related SRE roles. Experience with Rancher and its ecosystem. Seniority level Mid-Senior level Employment type Full-time Job function Engineering and Information Technology Industries IT Services and IT Consulting #J-18808-Ljbffr
-
Senior Site Reliability Engineer
5 days ago
, , Canada Thinkific Full timeJoin to apply for the Senior Site Reliability Engineer role at Thinkific Join to apply for the Senior Site Reliability Engineer role at Thinkific Are you an experienced Site Reliability Engineer looking for a new challenge? We’re looking for a Senior Site Reliability Engineer to join us at Thinkific. We’re looking for a Senior Site Reliability Engineer...
-
Senior Site Reliability Engineer
3 weeks ago
, , Canada Akamai Technologies Full timeSenior Site Reliability Engineer Join Akamai Technologies as we build a reliable, secure, and scalable Internet. We are looking for a Senior Site Reliability Engineer to help us solve complex performance and reliability challenges. Job Description Are you passionate about cutting‑edge technology and ready to tackle some of the Internet’s most difficult...
-
Senior Site Reliability Engineer
5 days ago
, , Canada DuckDuckGo Full time6 days ago Be among the first 25 applicants Get AI-powered advice on this job and more exclusive features. Who We AreHi, we're DuckDuckGo, the online protection company and remote-first team of 300+ on a mission to raise the standard of trust online. Founded in 2008 and profitable since 2014, our annual revenue now exceeds $100 million USD. Millions use our...
-
Senior Site Reliability Engineer
3 weeks ago
, , Canada Orion Innovation Full timeJob Description: Senior Site Reliability Engineer (SRE) with Kubernetes & Rancher Location: Canada - Remote (Working EST hours) Job Type: Full-time About the Role Are you an exceptional Site Reliability Engineer with a passion for building and maintaining highly resilient and secure systems? We are seeking a Senior SRE to join our team and play a critical...
-
Senior Site Reliability Engineer
4 weeks ago
, BC, Canada GoDaddy Full timeLocation and Work Arrangement Location Details: Canada - Remote. This is a remote position, so you’ll be working remotely from your home. You may occasionally visit a GoDaddy office to meet with your team for events or meetings. Join Our Team GoDaddy's Infrastructure Engineering team is looking for a Senior Site Reliability Engineer with a focus on...
-
Senior Site Reliability Engineer
4 weeks ago
, , Canada Targeted Talent Full timeOverview We are looking for an experienced Senior Site Reliability Engineer for our client. This is a permanent position that is remote to start with later relocation to Calgary or Winnipeg . Our client is a global enterprise company with a product that you've likely used. Experience with coding/software development, along with Site Reliability will be the...
-
Senior Site Reliability Engineer
5 days ago
, , Canada TextNow Full timeThis range is provided by TextNow. Your actual pay will be based on your skills and experience — talk with your recruiter to learn more. Base pay range CA$113,400.00/yr - CA$162,000.00/yr We believe communication belongs to everyone. We exist to democratize phone service. TextNow is evolving the way the world connects and that\'s because we\'re made up of...
-
Staff Site Reliability Engineer
4 weeks ago
, BC, Canada Branch Full timeOverview At Branch, we’re transforming how brands and users interact across digital platforms. Our mobile marketing and deep linking solutions deliver seamless experiences that increase ROI, decrease wasted spend, and eliminate siloed attribution. Our team values ownership, collaboration, and a motto: Build Together, Grow Together, Win Together. As a Staff...
-
Senior Site Reliability Engineer
3 weeks ago
, , Canada Orion Innovation Full timeWe are seeking a highly specialized and experienced Senior Site Reliability Engineer (SRE) to drive the reliability, performance, and automation of our core platform. This role requires an exceptional blend of deep programming expertise in both Ruby and Go , coupled with hands‑on mastery of Linux systems, advanced networking concepts (specifically IPSec),...
-
Site Reliability Engineer
3 weeks ago
, , Canada Orion Innovation Full timeSenior Site Reliability Engineer (SRE) with Kubernetes & Rancher Location: Canada - Remote (Working EST hours) Job Type: Full-time About the Role Are you an exceptional Site Reliability Engineer with a passion for building and maintaining highly resilient and secure systems? We are seeking a Senior SRE to join our team and play a critical role in managing...