Senior SRE — Scale AI Infra with Kubernetes
3 weeks ago
An AI infrastructure company in Canada is looking for a Senior Site Reliability Engineer to build and maintain systems supporting data and AI workloads. You will automate system operations and develop tooling for incident management. Ideal candidates have a deep understanding of distributed systems, experience with cloud technologies (AWS/GCP), and a passion for solving complex problems. This role offers competitive pay and the chance to work in a fast-paced, innovative environment.
#J-18808-Ljbffr
-
SRE Engineer
1 week ago
Markham, Toronto, Montreal, Calgary, Vancouver, Edmonton, Old Toronto, Ottawa, Mississauga, Quebec, Winnipeg, Halifax, Saskatoon, Burnaby, Hamilton, Surrey, Victoria, London, Halton Hills, Regina, Brampton, Vaughan, Kelowna, Laval, Southwestern Ontario, R, Canada kloia Full timeJoin to apply for the SRE Engineer role at kloia Description Kloia is a recognized AWS Premier Consulting Partner and CNCF member with a focus on Application Modernization and Digital Transition projects. Our teams are growing rapidly, and we’re hiring a Site Reliability Engineer primarily for our managed services provided to customers, as well as for...
-
Senior SRE: Kubernetes
1 week ago
Toronto, Montreal, Calgary, Vancouver, Edmonton, Old Toronto, Ottawa, Mississauga, Quebec, Winnipeg, Halifax, Saskatoon, Burnaby, Hamilton, Surrey, Victoria, London, Halton Hills, Regina, Markham, Brampton, Vaughan, Kelowna, Laval, Southwestern Ontario, R, Canada Camunda Full timeA leading automation platform provider in Toronto seeks a Senior Site Reliability Engineer to design and maintain its Kubernetes-based infrastructure. The ideal candidate will have over 5 years of experience in SRE, with strong skills in cloud infrastructure and monitoring tools. This role involves collaborating with teams to innovate automation processes...
-
Senior SRE: Kubernetes, Security
3 weeks ago
Toronto, Montreal, Calgary, Vancouver, Edmonton, Old Toronto, Ottawa, Mississauga, Quebec, Winnipeg, Halifax, Saskatoon, Burnaby, Hamilton, Victoria, Surrey, Halton Hills, London, Regina, Markham, Brampton, Vaughan, Kelowna, Laval, Southwestern Ontario, R, Canada Orion Innovation Full timeA leading IT services company is looking for a Senior Site Reliability Engineer with expertise in Kubernetes and Rancher to ensure the reliability of mission-critical systems. This remote position, available in Canada, requires over 8 years of experience in SRE roles, particularly in secure environments. The role offers competitive compensation and the...
-
Senior SRE: AI/ML GPU HPC Infra
2 weeks ago
Toronto, Canada Boson AI Full timeA technology company in Toronto is seeking a Senior Site Reliability Engineer to manage and optimize a cutting-edge GPU cluster. The role involves hands-on lifecycle management of HPC infrastructure, troubleshooting, and developing automation for operational efficiency. Candidates should have over 5 years of experience in SRE or HPC and be proficient in...
-
Senior SRE: AI/ML GPU HPC Infra
2 weeks ago
Toronto, Canada Boson AI Full timeA technology company in Toronto is seeking a Senior Site Reliability Engineer to manage and optimize a cutting-edge GPU cluster. The role involves hands-on lifecycle management of HPC infrastructure, troubleshooting, and developing automation for operational efficiency. Candidates should have over 5 years of experience in SRE or HPC and be proficient in...
-
Senior SRE: AI/ML GPU HPC Infra
2 weeks ago
Toronto, Canada Boson AI Full timeA technology company in Toronto is seeking a Senior Site Reliability Engineer to manage and optimize a cutting-edge GPU cluster. The role involves hands-on lifecycle management of HPC infrastructure, troubleshooting, and developing automation for operational efficiency. Candidates should have over 5 years of experience in SRE or HPC and be proficient in...
-
Senior SRE: Scale Multi-Cloud Infra
3 weeks ago
Toronto, Montreal, Calgary, Vancouver, Edmonton, Old Toronto, Ottawa, Mississauga, Quebec, Winnipeg, Halifax, Saskatoon, Burnaby, Hamilton, Victoria, Surrey, London, Halton Hills, Regina, Markham, Brampton, Vaughan, Kelowna, Laval, Southwestern Ontario, R, Canada HRB Full timeA leading entertainment company seeks a Senior Site Reliability Engineer to enhance the performance and reliability of its infrastructure. The role involves managing cloud technologies, employing Kubernetes orchestration, and collaborating with teams for continuous improvement while leveraging AI technologies. Experience with Terraform, Oracle EBS, and...
-
Remote Site Reliability Engineer
3 weeks ago
Montreal, Toronto, Calgary, Vancouver, Edmonton, Old Toronto, Ottawa, Mississauga, Quebec, Winnipeg, Halifax, Saskatoon, Burnaby, Hamilton, Victoria, Surrey, Halton Hills, London, Regina, Markham, Brampton, Vaughan, Kelowna, Laval, Southwestern Ontario, R, Canada DevOps projects Full timeA rapidly growing AI start-up is seeking a Remote Site Reliability Engineer to ensure the stability, scalability, and security of its platform. You will architect infrastructure using AWS, design CI/CD pipelines, and enhance observability with advanced tools. Ideal candidates should have 3+ years of experience in SRE or DevOps and expertise in AWS, Linux,...
-
Senior SRE: Kubernetes, Rancher
3 weeks ago
Toronto, Montreal, Calgary, Vancouver, Edmonton, Old Toronto, Ottawa, Mississauga, Quebec, Winnipeg, Halifax, Saskatoon, Burnaby, Hamilton, Victoria, Surrey, Halton Hills, London, Regina, Markham, Brampton, Vaughan, Kelowna, Laval, Southwestern Ontario, R, Canada Orion Innovation Full timeA cutting-edge technology firm is seeking a Senior Site Reliability Engineer to manage mission-critical infrastructure. This fully remote position requires 8+ years of experience and expertise in Kubernetes and observability tools like Prometheus and Grafana. The ideal candidate will thrive in challenging air-gapped environments and have a passion for system...
-
Lead SRE/DevOps Engineer
2 weeks ago
Vancouver, Toronto, Montreal, Calgary, Edmonton, Old Toronto, Ottawa, Mississauga, Quebec, Winnipeg, Halifax, Saskatoon, Burnaby, Hamilton, Surrey, Victoria, London, Halton Hills, Regina, Markham, Brampton, Vaughan, Kelowna, Laval, Southwestern Ontario, R, Canada JobGet Full timeAbout JobGet As the #1 app focused on everyday workers, JobGet is redefining the future of hiring. Founded in 2019, JobGet began as the only mobile-first hiring platform for everyday workers. Since then, we’ve grown by joining forces with Snagajob, the largest hourly job board in the U.S., followed by Seasoned, the leading platform for restaurant hiring....