SRE Engineer

2 weeks ago


Markham, Canada kloia Full time

Join to apply for the SRE Engineer role at kloia Description Kloia is a recognized AWS Premier Consulting Partner and CNCF member with a focus on Application Modernization and Digital Transition projects. Our teams are growing rapidly, and we’re hiring a Site Reliability Engineer primarily for our managed services provided to customers, as well as for internal projects to build a scalable and reliable platform of common services. What does SRE do? In Kloia, the SRE Team focuses on eliminating toil in production workloads. Our main goal is to achieve 24x7 SLA with a support system and team that ‘Follow-the-Sun’. Key responsibilities include participating in design and development, making trade-offs between performance, cost, security, and reliability, and supporting the system in production as a reliable escalation point. As an SRE, you will: Eliminate toil through automation, re-architecting, and refactoring. Approach incidents with an “Automate Everything” mindset. Collaborate with software engineers to troubleshoot incidents. Drive complex infrastructure changes with transparency and zero downtime. Design and implement self-healing, reliable, and scalable infrastructure in a cloud-native environment. Guide and unblock developers across teams to push their products forward. Define SLOs and error quotas for production services. Support our dev-ops culture, including participation in the follow-the-sun on-call rota. Position: SRE (Site Reliability Engineer) Location: Remote - LATAM / APAC Level: Junior/Medior What does an average day look like? Proactively support production workloads, troubleshoot to find root causes, and write or review postmortems. Identify infrastructure and observability weaknesses. Technical challenges include: Optimizing resource allocation in Kubernetes for application performance. Including API Gateway monitoring in APM for full observability. Reducing database query hits. Guiding development team on data layer caching. Our stack is cloud-native, including AWS, Terraform, Docker/Kubernetes, Helm, ELK, Instana, OpsGenie, Node.js, Java, Typescript, Python. We expect candidates to have a deep understanding of Linux-based distributed systems at scale and relevant experience. Who should apply? This role suits those eager to work with cutting-edge cloud infrastructure at scale, passionate about automation, and capable of explaining complex concepts simply. Career benefits: Exposure to new technologies, working on products with global reach, and opportunities to develop both development and operations skills. We encourage continuous learning with initiatives like hack days and training. Requirements: Excellent communication skills Deep knowledge of Linux distributed systems at scale Experience with AWS or other cloud providers Experience with SQL/NoSQL databases at scale Experience with service lifecycle and monitoring Experience as a software or platform engineer / SRE Experience with DevOps practices Good understanding of Docker Automation mindset Nice to have: Knowledge of Kubernetes Experience with Terraform or other Infrastructure as Code tools Benefits include: Remote work flexibility Home office budget Hackathon days Access to AWS and CNCF/Kubernetes training and certifications R&D focus Social activities like weekly Lunch & Learn, Fridays, socials, and online games #J-18808-Ljbffr


  • SRE Engineer

    5 days ago


    Markham, Canada kloia Full time

    Join to apply for the SRE Engineer role at kloia Description Kloia is a recognized AWS Premier Consulting Partner and CNCF member with a focus on Application Modernization and Digital Transition projects. Our teams are growing rapidly, and we’re hiring a Site Reliability Engineer primarily for our managed services provided to customers, as well as for...

  • SRE Engineer

    1 week ago


    Markham, Toronto, Montreal, Calgary, Vancouver, Edmonton, Old Toronto, Ottawa, Mississauga, Quebec, Winnipeg, Halifax, Saskatoon, Burnaby, Hamilton, Surrey, Victoria, London, Halton Hills, Regina, Brampton, Vaughan, Kelowna, Laval, Southwestern Ontario, R, Canada kloia Full time

    Join to apply for the SRE Engineer role at kloia Description Kloia is a recognized AWS Premier Consulting Partner and CNCF member with a focus on Application Modernization and Digital Transition projects. Our teams are growing rapidly, and we’re hiring a Site Reliability Engineer primarily for our managed services provided to customers, as well as for...


  • Markham, Canada CarltonOne Full time

    A global B2B technology leader is seeking an experienced SRE Manager to lead their Site Reliability Engineering team. The ideal candidate will have extensive experience in cloud infrastructure and incident management, alongside strong leadership skills. Responsibilities include defining SRE strategy, leading a team, and implementing monitoring solutions....


  • Markham, Canada CarltonOne Full time

    A global B2B technology leader is seeking an experienced SRE Manager to lead their Site Reliability Engineering team. The ideal candidate will have extensive experience in cloud infrastructure and incident management, alongside strong leadership skills. Responsibilities include defining SRE strategy, leading a team, and implementing monitoring solutions....


  • Markham, Canada CarltonOne Full time

    A global B2B technology leader is seeking an experienced SRE Manager to lead their Site Reliability Engineering team. The ideal candidate will have extensive experience in cloud infrastructure and incident management, alongside strong leadership skills. Responsibilities include defining SRE strategy, leading a team, and implementing monitoring solutions....

  • DevOps/SRE Engineer

    4 weeks ago


    Toronto, Montreal, Calgary, Vancouver, Edmonton, Old Toronto, Ottawa, Mississauga, Quebec, Winnipeg, Halifax, Saskatoon, Burnaby, Hamilton, Victoria, Surrey, Halton Hills, London, Regina, Markham, Brampton, Vaughan, Kelowna, Laval, Southwestern Ontario, R, Canada Targeted Talent Full time

    Overview We are looking for an experienced DevOps/SRE Engineer for our client. This is a permanent position that can either be remote or in-office at Toronto! Our client is a large fintech firm with a product that you\'ve likely used many times before. Qualifications You have hands-on experience with enterprise-grade infrastructures, operations and / or...


  • Vancouver, Toronto, Montreal, Calgary, Edmonton, Old Toronto, Ottawa, Mississauga, Quebec, Winnipeg, Halifax, Saskatoon, Burnaby, Hamilton, Surrey, Victoria, London, Halton Hills, Regina, Markham, Brampton, Vaughan, Kelowna, Laval, Southwestern Ontario, R, Canada JobGet Full time

    About JobGet As the #1 app focused on everyday workers, JobGet is redefining the future of hiring. Founded in 2019, JobGet began as the only mobile-first hiring platform for everyday workers. Since then, we’ve grown by joining forces with Snagajob, the largest hourly job board in the U.S., followed by Seasoned, the leading platform for restaurant hiring....


  • Toronto, Montreal, Calgary, Vancouver, Edmonton, Old Toronto, Ottawa, Mississauga, Quebec, Winnipeg, Halifax, Saskatoon, Burnaby, Hamilton, Victoria, Surrey, London, Halton Hills, Regina, Markham, Brampton, Vaughan, Kelowna, Laval, Southwestern Ontario, R, Canada Targeted Talent Full time

    A leading fintech firm is seeking an experienced DevOps/SRE Engineer for a permanent position that offers both remote and office work options in Toronto. The role demands hands-on experience with enterprise-grade infrastructure and a solid background in DevOps practices. Candidates should have expertise in CI/CD, Kubernetes, and cloud solutions, alongside a...


  • Toronto, Montreal, Calgary, Vancouver, Edmonton, Old Toronto, Ottawa, Mississauga, Quebec, Winnipeg, Halifax, Saskatoon, Burnaby, Hamilton, Victoria, Surrey, Halton Hills, London, Regina, Markham, Brampton, Vaughan, Kelowna, Laval, Southwestern Ontario, R, Canada Hopper Full time

    Site Reliability Engineer (SRE) - Platform Infrastructure team (100% Remote - Canada) Join Hopper as a Senior Site Reliability Engineer in the Platform Infrastructure team, building and operating the cloud foundation that powers products used by millions of travelers worldwide. What You'll Do: Help evolve a large‑scale, multi‑region infrastructure...


  • Toronto, Montreal, Calgary, Vancouver, Edmonton, Old Toronto, Ottawa, Mississauga, Quebec, Winnipeg, Halifax, Saskatoon, Burnaby, Hamilton, Victoria, Surrey, Halton Hills, London, Regina, Markham, Brampton, Vaughan, Kelowna, Laval, Southwestern Ontario, R, Canada Targeted Talent Full time

    A leading recruitment firm is seeking an experienced DevOps/SRE Engineer for a permanent role that offers the possibility of remote work or in-office at Toronto. The ideal candidate should have strong experience in enterprise-grade infrastructures and DevOps practices, with expertise in Kubernetes, Docker, and Microsoft Azure. This position comes with...