SRE Engineer
7 days ago
Join to apply for the SRE Engineer role at kloia Description Kloia is a recognized AWS Premier Consulting Partner and CNCF member with a focus on Application Modernization and Digital Transition projects. Our teams are growing rapidly, and we’re hiring a Site Reliability Engineer primarily for our managed services provided to customers, as well as for internal projects to build a scalable and reliable platform of common services. What does SRE do? In Kloia, the SRE Team focuses on eliminating toil in production workloads. Our main goal is to achieve 24x7 SLA with a support system and team that ‘Follow-the-Sun’. Key responsibilities include participating in design and development, making trade-offs between performance, cost, security, and reliability, and supporting the system in production as a reliable escalation point. As an SRE, you will: Eliminate toil through automation, re-architecting, and refactoring. Approach incidents with an “Automate Everything” mindset. Collaborate with software engineers to troubleshoot incidents. Drive complex infrastructure changes with transparency and zero downtime. Design and implement self-healing, reliable, and scalable infrastructure in a cloud-native environment. Guide and unblock developers across teams to push their products forward. Define SLOs and error quotas for production services. Support our dev-ops culture, including participation in the follow-the-sun on-call rota. Position: SRE (Site Reliability Engineer) Location: Remote - LATAM / APAC Level: Junior/Medior What does an average day look like? Proactively support production workloads, troubleshoot to find root causes, and write or review postmortems. Identify infrastructure and observability weaknesses. Technical challenges include: Optimizing resource allocation in Kubernetes for application performance. Including API Gateway monitoring in APM for full observability. Reducing database query hits. Guiding development team on data layer caching. Our stack is cloud-native, including AWS, Terraform, Docker/Kubernetes, Helm, ELK, Instana, OpsGenie, Node.js, Java, Typescript, Python. We expect candidates to have a deep understanding of Linux-based distributed systems at scale and relevant experience. Who should apply? This role suits those eager to work with cutting-edge cloud infrastructure at scale, passionate about automation, and capable of explaining complex concepts simply. Career benefits: Exposure to new technologies, working on products with global reach, and opportunities to develop both development and operations skills. We encourage continuous learning with initiatives like hack days and training. Requirements: Excellent communication skills Deep knowledge of Linux distributed systems at scale Experience with AWS or other cloud providers Experience with SQL/NoSQL databases at scale Experience with service lifecycle and monitoring Experience as a software or platform engineer / SRE Experience with DevOps practices Good understanding of Docker Automation mindset Nice to have: Knowledge of Kubernetes Experience with Terraform or other Infrastructure as Code tools Benefits include: Remote work flexibility Home office budget Hackathon days Access to AWS and CNCF/Kubernetes training and certifications R&D focus Social activities like weekly Lunch & Learn, Fridays, socials, and online games #J-18808-Ljbffr
-
DevOps/SRE Engineer
4 weeks ago
Toronto, Montreal, Calgary, Vancouver, Edmonton, Old Toronto, Ottawa, Mississauga, Quebec, Winnipeg, Halifax, Saskatoon, Burnaby, Hamilton, Victoria, Surrey, Halton Hills, London, Regina, Markham, Brampton, Vaughan, Kelowna, Laval, Southwestern Ontario, R, Canada Targeted Talent Full timeOverview We are looking for an experienced DevOps/SRE Engineer for our client. This is a permanent position that can either be remote or in-office at Toronto! Our client is a large fintech firm with a product that you\'ve likely used many times before. Qualifications You have hands-on experience with enterprise-grade infrastructures, operations and / or...
-
Lead SRE/DevOps Engineer
2 weeks ago
Vancouver, Toronto, Montreal, Calgary, Edmonton, Old Toronto, Ottawa, Mississauga, Quebec, Winnipeg, Halifax, Saskatoon, Burnaby, Hamilton, Surrey, Victoria, London, Halton Hills, Regina, Markham, Brampton, Vaughan, Kelowna, Laval, Southwestern Ontario, R, Canada JobGet Full timeAbout JobGet As the #1 app focused on everyday workers, JobGet is redefining the future of hiring. Founded in 2019, JobGet began as the only mobile-first hiring platform for everyday workers. Since then, we’ve grown by joining forces with Snagajob, the largest hourly job board in the U.S., followed by Seasoned, the leading platform for restaurant hiring....
-
Remote DevOps/SRE Engineer for Fintech Platform
2 weeks ago
Toronto, Montreal, Calgary, Vancouver, Edmonton, Old Toronto, Ottawa, Mississauga, Quebec, Winnipeg, Halifax, Saskatoon, Burnaby, Hamilton, Victoria, Surrey, London, Halton Hills, Regina, Markham, Brampton, Vaughan, Kelowna, Laval, Southwestern Ontario, R, Canada Targeted Talent Full timeA leading fintech firm is seeking an experienced DevOps/SRE Engineer for a permanent position that offers both remote and office work options in Toronto. The role demands hands-on experience with enterprise-grade infrastructure and a solid background in DevOps practices. Candidates should have expertise in CI/CD, Kubernetes, and cloud solutions, alongside a...
-
Site Reliability Engineer
3 weeks ago
Toronto, Montreal, Calgary, Vancouver, Edmonton, Old Toronto, Ottawa, Mississauga, Quebec, Winnipeg, Halifax, Saskatoon, Burnaby, Hamilton, Victoria, Surrey, Halton Hills, London, Regina, Markham, Brampton, Vaughan, Kelowna, Laval, Southwestern Ontario, R, Canada Hopper Full timeSite Reliability Engineer (SRE) - Platform Infrastructure team (100% Remote - Canada) Join Hopper as a Senior Site Reliability Engineer in the Platform Infrastructure team, building and operating the cloud foundation that powers products used by millions of travelers worldwide. What You'll Do: Help evolve a large‑scale, multi‑region infrastructure...
-
Toronto, Montreal, Calgary, Vancouver, Edmonton, Old Toronto, Ottawa, Mississauga, Quebec, Winnipeg, Halifax, Saskatoon, Burnaby, Hamilton, Victoria, Surrey, Halton Hills, London, Regina, Markham, Brampton, Vaughan, Kelowna, Laval, Southwestern Ontario, R, Canada Targeted Talent Full timeA leading recruitment firm is seeking an experienced DevOps/SRE Engineer for a permanent role that offers the possibility of remote work or in-office at Toronto. The ideal candidate should have strong experience in enterprise-grade infrastructures and DevOps practices, with expertise in Kubernetes, Docker, and Microsoft Azure. This position comes with...
-
Remote SRE Engineer
2 weeks ago
Toronto, Montreal, Calgary, Vancouver, Edmonton, Old Toronto, Ottawa, Mississauga, Quebec, Winnipeg, Halifax, Saskatoon, Burnaby, Hamilton, Surrey, Victoria, London, Halton Hills, Regina, Markham, Brampton, Vaughan, Kelowna, Laval, Southwestern Ontario, R, Canada Dayforce US, Inc. Full timeA leading global human capital management company is seeking a Site Reliability Engineer to improve system reliability and automate day-to-day processes. This role offers opportunities for remote work and collaboration on innovative technologies. The ideal candidate has 2-4 years of experience in SRE or similar roles, familiarity with cloud platforms,...
-
Senior DevOps
3 weeks ago
Vancouver, Toronto, Montreal, Calgary, Edmonton, Old Toronto, Ottawa, Mississauga, Quebec, Winnipeg, Halifax, Saskatoon, Burnaby, Hamilton, Victoria, Surrey, Halton Hills, London, Regina, Markham, Brampton, Vaughan, Kelowna, Laval, Southwestern Ontario, R, Canada Rivalry Full timeA leading esports technology company is seeking a Senior DevOps / SRE Engineer to architect and optimize their LAMP stack. This senior role involves performance monitoring, developing automation solutions, and collaborating with a talented team passionate about gaming and esports. The ideal candidate has over 4 years of experience and a deep understanding of...
-
Senior SRE: Drive Reliability
4 weeks ago
Toronto, Montreal, Calgary, Vancouver, Edmonton, Old Toronto, Ottawa, Mississauga, Quebec, Winnipeg, Halifax, Saskatoon, Burnaby, Hamilton, Victoria, Surrey, Halton Hills, London, Regina, Markham, Brampton, Vaughan, Kelowna, Laval, Southwestern Ontario, R, Canada Akamai Technologies Full timeA leading tech firm in Canada is seeking a Senior Site Reliability Engineer to tackle complex performance challenges. In this role, you will lead SRE teams, provide mentorship, and partner with various teams to enhance the reliability of the platform. Candidates should have a relevant degree and strong skills in statistical analysis, coding, and Internet...
-
Senior SRE: Cloud Reliability
4 weeks ago
Ottawa, Toronto, Montreal, Calgary, Vancouver, Edmonton, Old Toronto, Mississauga, Quebec, Winnipeg, Halifax, Saskatoon, Burnaby, Hamilton, Victoria, Surrey, Halton Hills, London, Regina, Markham, Brampton, Vaughan, Kelowna, Laval, Southwestern Ontario, R, Canada Veeva Systems Full timeA leading life sciences technology company is looking for a Senior Software Engineer - SRE to join its Vault Platform team in Ottawa. In this role, you will ensure the scalability and reliability of enterprise applications. Candidates should have over 5 years of experience in Java development and a strong background in open-source technologies. The position...
-
Toronto, Montreal, Calgary, Vancouver, Edmonton, Old Toronto, Ottawa, Mississauga, Quebec, Winnipeg, Halifax, Saskatoon, Burnaby, Hamilton, Victoria, Surrey, Halton Hills, London, Regina, Markham, Brampton, Vaughan, Kelowna, Laval, Southwestern Ontario, R, Canada OpenTable Full timeA leading restaurant technology platform in Toronto seeks a Site Reliability Engineer II to enhance observability systems. This remote role will transition to a hybrid model, requiring strong communication and engineering collaboration skills. Ideal candidates have solid experience in SRE roles, with knowledge of DNS, AWS, and observability tools. A...