Site Reliability Engineer

3 weeks ago


Toronto ON, Canada emagine Consulting Full time

Work Model:

Remote

Business Trips:

Occasional to Copenhagen

Assignment Type:

B2B

Project Length:

Long-term

Start Date:

ASAP

Project Language:

English

About the Role:

A unique opportunity to join as a Site Reliability Engineer to the dynamic, ambitious, and international company where you will work with a lot of skilled colleagues. You will join the dispersed team, with members, that develops a product platform that helps other product teams deliver cloud-native functionality in a consistent manner.

Responsibilities:
  1. Define and maintain containers for Kubernetes (both in Azure and local developer environments).
  2. Create Helm charts used for deploying our product in Azure.
  3. Be responsible for our CI/CD processes on GitHub Actions, focusing on quality, efficiency, and automation.
  4. Develop and maintain our authentication and authorization functionality (OpenID Connect and OAuth2).
  5. Be responsible for logging, telemetry, and driving improvements in CI/CD and observability.
  6. Maintain internal deployments used by developers.
  7. Enhance the quality and cadence of release processes.
  8. Collaborate with the development team to improve the deployment platform.
Must Have:

5+ years of experience from a similar position working on a SaaS product.

Hands-on experience with cloud solutions in production, either as a cloud software developer who has worked on a SaaS solution, or as a cloud-ops engineer who has been responsible for operating a SaaS solution.

Hands-on experience with Kubernetes.

Experience with logging and tracing tools for effective troubleshooting and debugging.

Experience in optimizing system performance, scalability, and efficiency to handle growing workloads.

Expertise in incident management, including the ability to diagnose and resolve incidents quickly and efficiently.

Knowledge of:

  • Infrastructure as Code principles.
  • Monitoring tools like Prometheus, Grafana, or similar solutions to ensure visibility into system performance and health.
  • Security best practices and the ability to incorporate security considerations into the design and operation of systems.
  • Reliability engineering principles (e.g., Service Level Objectives (SLOs), Service Level Indicators (SLIs), and Error Budgets).

Strong communication skills to effectively collaborate with cross-functional teams, including developers, operations, and other stakeholders.

Ability to document processes, procedures, and system architecture comprehensively.

Strong analytical and problem-solving skills, with the ability to diagnose complex issues and implement effective solutions.

Willingness to adapt to evolving technologies and industry best practices, with a commitment to continuous learning.

We Offer:
  • Long-term cooperation.
  • Transparently built relations based on trust and fair play.
  • Medicover card, Multisport card on preferential conditions.
  • Internal reference bonus.
#J-18808-Ljbffr

  • Toronto, Canada CB Canada Full time

    Site Reliability Engineer On behalf of our client in the Banking Sector, PROCOM is looking for a Site Reliability Engineer. Site Reliability Engineer – Job Description Azure cloud Jira and confluence CICD Experience with automating (provisioning, configuration management, deployment) and integrating Azure PaaS solutions (Azure App services, Azure...


  • Toronto, ON, Canada Thomson Reuters Full time

    (Canada) Site Reliability Engineer (Contract) Contract (9 months 4 days) Published 3 days ago New Relic Data Dog Site Reliability Engineer - in the Service Management Organization Do you have experience in IT Service Management, working with cloud providers, software development, and technology infrastructure? The Site Reliability Engineer will analyze...


  • Toronto, ON, Canada Autodesk Full time

    Position Overview Autodesk, the leading Design and Make Software Company, is looking for a Principal Site Reliability Engineer to join the Autodesk Platform Services Engineering team in Toronto, Canada. In this role, you will help build trusted services of APS (Autodesk Platform Services) measured by Service Level Objectives (SLOs) and Mean Time to...


  • Old Toronto, Canada CB Canada Full time

    Site Reliability Engineer On behalf of our client in the Banking Sector, PROCOM is looking for a Site Reliability Engineer. Site Reliability Engineer – Job Description Azure cloud Jira and confluence CICD Experience with automating (provisioning, configuration management, deployment) and integrating Azure PaaS solutions (Azure App services, Azure...


  • Toronto, ON, Canada OnX Canada Full time

    OnX is looking for a Site Reliability Engineer for one our clients in Toronto. Client: Financial ServicesLocation: Toronto, mostly remoteDuration: 6 months with potential extensionJBoss in middleware experience is super importantResponsibilities:Following the senior technicians plans to build out lower environments with functioning software stacks including...


  • Toronto, ON, Canada OnX Canada Full time

    OnX is looking for a Site Reliability Engineer for one our clients in Toronto. Client: Financial ServicesLocation: Toronto, mostly remoteDuration: 6 months with potential extensionJBoss in middleware experience is super importantResponsibilities:Following the senior technicians plans to build out lower environments with functioning software stacks including...


  • Toronto, ON, Canada Infotek Consulting Services Inc. Full time

    Infotek Consulting is searching for a Site Reliability Engineer - this is a remote opportunity with some travel involvedJob Description: Our EPM (Event and Performance Management) team is availability, performance and reliability management discipline that supports the optimization of the operational experience and behavior of a digital agent - human or...


  • Toronto, Canada Infotek Consulting Services Inc. Full time

    Infotek Consulting is searching for a Site Reliability Engineer - this is a remote opportunity with some travel involved Job Description: Our EPM (Event and Performance Management) team is availability, performance and reliability management discipline that supports the optimization of the operati


  • Toronto, Ontario, Canada Zortech Solutions Full time

    Hi,Hope you are doing GreatThis side Priya Rajput from Zortech Solutions trying to reach you for an exciting job opening, kindly have a look to job description and revert me with your positive feedback. My mail ID is or call me on .Role: Site Reliability EngineerLocation: Toronto, ON-OnsiteDuration: Fulltime PermanentSkills and Responsibilities:...


  • Toronto, ON, Canada Hour Consulting Full time

    Our client, a fast growing Fintech Startup is on a mission to redefine how to protect user identity, providing users secure control over personal information through a privacy compliant network. This approach creates higher customer interaction and sales conversions, while improving overall security for both customers and businesses. They are a...


  • Toronto, ON, Canada OnX Canada Full time

    OnX is looking for a Site Reliability Engineer for one our clients in Toronto. Client: Financial Services Location: Toronto, mostly remote Duration: 6 months with potential extension JBoss in middleware experience is super important Responsibilities: Following the senior technicians plans to build out lower environments with functioning software...


  • toronto, Canada OnX Canada Full time

    OnX is looking for a Site Reliability Engineer for one our clients in Toronto. Client: Financial Services Location: Toronto, mostly remote Duration: 6 months with potential extension JBoss in middleware experience is super important Responsibilities: Following the senior technicians plans to buil


  • Toronto, ON, Canada Nityo Infotech Full time

    Job Responsibilities: Objectives of this Role Run the IKP clusters by monitoring availability and taking a holistic view of system health Build tools and automation to manage platform infrastructure and services Improve reliability, quality, and time to upgrade cluster and service versions Measure and optimize system performance and resource...


  • Toronto, Canada Autodesk Full time

    Position Overview Autodesk, the leading Design and Make Software Company, is looking for a Principal Site Reliability Engineer to join the Autodesk Platform Services Engineering team in Toronto, Canada. On this position, you will help build trusted services of APS (Autodesk Platform Services) as measured by Service Level Objectives (SLOs) and Mean...


  • Old Toronto, Canada Thomson Reuters Full time

    (Canada) Site Reliability Engineer (Contract) Contract (9 months 4 days) Published 3 days ago New Relic Data Dog Site Reliability Engineer - in the Service Management OrganizationDo you have experience in IT Service Management, working with cloud providers, software development, and technology infrastructure?The Site Reliability Engineer will...


  • Toronto, ON, Canada Tata Consultancy Services Full time

    TCS is an equal opportunity employer, and embraces diversity in race, nationality, ethnicity, gender, age, physical ability, neurodiversity, and sexual orientation, to create a workforce that reflects the societies we operate in. Our continued commitment to Culture and Diversity and is reflected in our people stories across our workforce implemented through...


  • Toronto, ON, Canada Tata Consultancy Services Full time

    TCS is an equal opportunity employer, and embraces diversity in race, nationality, ethnicity, gender, age, physical ability, neurodiversity, and sexual orientation, to create a workforce that reflects the societies we operate in. Our continued commitment to Culture and Diversity and is reflected in our people stories across our workforce implemented through...


  • Ajax, ON, Canada Gradient IT Full time

    We are looking for a passionate Site Reliability Engineer with a deep-rooted foundation in DevSecOps and Open Source Technology. The engineer should be passionate about automation and building highly scalable and available services in the cloud. You will help lead a team of engineers to build tooling, automation, and support Spinnaker on behalf of our...


  • Old Toronto, Canada Autodesk Full time

    Position Overview Autodesk, the leading Design and Make Software Company, is looking for a Principal Site Reliability Engineer to join the Autodesk Platform Services Engineering team in Toronto, Canada. In this role, you will help build trusted services of APS (Autodesk Platform Services) measured by Service Level Objectives (SLOs) and Mean Time to Recovery...


  • Toronto, ON, Canada Paymentus Full time

    Summary Paymentus leads the North American marketplace in electronic bill payment solutions and is looking for high performers to join our development team building SaaS Fintech solutions across a range of industries. You will contribute to a massively scalable data platform, that is built on top of a world class enterprise platform, supporting thousands of...