Site Reliability Engineer

3 weeks ago


Old Toronto, Canada Nityo Infotech Full time
Job Responsibilities:
  1. Objectives of this Role
  2. Run the IKP clusters by monitoring availability and taking a holistic view of system health
  3. Build tools and automation to manage platform infrastructure and services
  4. Improve reliability, quality, and time to upgrade cluster and service versions
  5. Measure and optimize system performance and resource utilization, and plan for future capacity
  6. Build dashboards and visualizations to graph system health
  7. Define system alerts and automate responses where possible
  8. Provide operational support and engineering for multiple software development teams
Daily and Monthly Responsibilities
  • Gather and analyze metrics from cluster components and services to assist in performance tuning and fault finding
  • Partner with Core Engineering and Services Engineering teams to improve services through rigorous testing and release procedures
  • Participate in system design consulting, platform management, and capacity planning
  • Create sustainable systems and services through automation and uplifts
  • Balance feature development speed and reliability with well-defined service level objectives
Work Experience

Typical candidates will have at least 5-10 years of experience in the technology field, preferably in software engineering.

Education

Bachelor’s degree in Computer Science or related field. Experience Required 5 - 10 Years

Industry Type: IT

Employment Type: Contract

Location: China

#J-18808-Ljbffr

  • Old Toronto, Canada CB Canada Full time

    Site Reliability Engineer On behalf of our client in the Banking Sector, PROCOM is looking for a Site Reliability Engineer. Site Reliability Engineer – Job Description Azure cloud Jira and confluence CICD Experience with automating (provisioning, configuration management, deployment) and integrating Azure PaaS solutions (Azure App services, Azure...


  • Old Toronto, Canada Thomson Reuters Full time

    (Canada) Site Reliability Engineer (Contract) Contract (9 months 4 days) Published 3 days ago New Relic Data Dog Site Reliability Engineer - in the Service Management OrganizationDo you have experience in IT Service Management, working with cloud providers, software development, and technology infrastructure?The Site Reliability Engineer will...


  • Old Toronto, Canada eTeam Full time

    Remote Work Duration 4 months - Preference is to find candidates who are willing to be converted to full-time employees. The conversion decision will be made based on performance. Job Description Role Description: Defining and measuring reliability goals—SLIs, SLOs, and error budgets for user journey. Designing for and implementing observability (ELK,...


  • Toronto, Canada CB Canada Full time

    Site Reliability Engineer On behalf of our client in the Banking Sector, PROCOM is looking for a Site Reliability Engineer. Site Reliability Engineer – Job Description Azure cloud Jira and confluence CICD Experience with automating (provisioning, configuration management, deployment) and integrating Azure PaaS solutions (Azure App services, Azure...


  • Old Toronto, Canada eTeam Full time

    Remote work Duration - 4 months - Preference is to find candidates who are willing to be converted to full time employee. The conversion decision will be made based on performance. Job Description: Role Desc: Defining and measuring reliability goals—SLIs, SLOs, and error budgets for user journey Designing for and implementing observability:...


  • Toronto, ON, Canada CB Canada Full time

    Site Reliability Engineer On behalf of our client in the Banking Sector, PROCOM is looking for a Site Reliability Engineer. Site Reliability Engineer – Job Description Azure cloud Jira and confluence CICD Experience with automating (provisioning, configuration management, deployment) and integrating Azure PaaS solutions (Azure App services, Azure...


  • Old Toronto, Canada Lightspeed Full time

    Hi there! Thanks for stopping by. Are you actively looking for a new opportunity? Or just checking the market? Well… you might just be in the right place! We’re looking for a Principal Site Reliability Engineer to join our NuOrder by Lightspeed team in North America. NuORDER by Lightspeed builds software solutions that help merchants grow the size and...


  • Old Toronto, Canada Akamai Full time

    Site Reliability Engineer II Do you have a passion for cutting edge technologies and tackling system problems? Are you a self-starting professional who thrives in a dynamic environment? Join our Site Reliability team. Our Team builds and delivers highly secure network security frameworks to protect our customers. We collaborate to create next-generation...


  • Toronto, Ontario, Canada Zortech Solutions Full time

    Hi,Hope you are doing GreatThis side Priya Rajput from Zortech Solutions trying to reach you for an exciting job opening, kindly have a look to job description and revert me with your positive feedback. My mail ID is or call me on .Role: Site Reliability EngineerLocation: Toronto, ON-OnsiteDuration: Fulltime PermanentSkills and Responsibilities:...


  • Old Toronto, Canada ClickHouse Full time

    We are committed to providing our customers with reliable and secure services so we are building out our newly formed Site Reliability Engineering team. As one of the first joiners to our Reliability Engineering Team at ClickHouse, you will be responsible for building and leading processes to ensure the reliability, availability, scalability, and performance...


  • Old Toronto, Canada ClickHouse Full time

    We are committed to providing our customers with reliable and secure services so we are building out our newly formed Site Reliability Engineering team. As one of the first joiners to our Reliability Engineering Team at ClickHouse, you will be responsible for building and leading processes to ensure the reliability, availability, scalability, and performance...


  • Old Toronto, Canada ClickHouse Full time

    We are committed to providing our customers with reliable and secure services so we are building out our newly formed Site Reliability Engineering team. As one of the first joiners to our Reliability Engineering Team at ClickHouse, you will be responsible for building and leading processes to ensure the reliability, availability, scalability, and performance...


  • Old Toronto, Canada emagine Consulting Full time

    Work Model: Remote Business Trips: Occasional to Copenhagen Assignment Type: B2B Project Length: Long-term Start Date: ASAP Project Language: English About the Role: A unique opportunity to join as a Site Reliability Engineer to the dynamic, ambitious, and international company where you will work with a lot of skilled colleagues. You will join the dispersed...


  • Toronto, Canada Autodesk Full time

    Position Overview Autodesk, the leading Design and Make Software Company, is looking for a Principal Site Reliability Engineer to join the Autodesk Platform Services Engineering team in Toronto, Canada. On this position, you will help build trusted services of APS (Autodesk Platform Services) as measured by Service Level Objectives (SLOs) and Mean...


  • Old Toronto, Canada RBC - Royal Bank Full time

    Job Summary This role will be responsible for assisting in the development, implementation, and support of Site Reliability Engineering (SRE) solutions for applications supported by the Branch Technology in Digital organization. The incumbent will need introductory knowledge and experience working in an application development and/or technology operations...


  • Old Toronto, Canada Akamai Full time

    Do you have a passion for cutting edge technologies and tackling system problems? Are you a self-starting professional who thrives in a dynamic environment? Join our Site Reliability team. Our Team builds and delivers highly secure network security frameworks to protect our customers. We collaborate to create next-generation initiatives supporting...


  • Toronto, ON, Canada Thomson Reuters Full time

    (Canada) Site Reliability Engineer (Contract) Contract (9 months 4 days) Published 3 days ago New Relic Data Dog Site Reliability Engineer - in the Service Management Organization Do you have experience in IT Service Management, working with cloud providers, software development, and technology infrastructure? The Site Reliability Engineer will analyze...


  • Old Toronto, Canada RBC - Royal Bank Full time

    Job Summary RBC is seeking an expert engineer for our Finance and Audit Technology group. You will be heavily involved in shaping the future technology landscape of RBC by playing a crucial role in our site reliability transformation initiative. Working closely with our engineering and operations teams to ensure top-notch performance, scalability, and...


  • Old Toronto, Canada RBC - Royal Bank Full time

    Job Summary Job Description What is the opportunity?RBC is seeking an expert engineer for our Finance and Audit Technology group. You will be heavily involved in shaping the future technology landscape of RBC by playing a crucial role in our site reliability transformation initiative. Working closely with our engineering and operations teams to ensure...


  • Toronto, Canada eTeam Full time

    Remote work Duration - 4 months - Preference is to find candidates who are willing to be converted to full time employee . The conversion decision will be made based on performance. Job description - ::: Role Desc : Defining and measuring reliability goals—SLIs, SLOs, and error budgets for user journey Designing for and implementing...