Senior Site Reliability Engineer

3 weeks ago


Toronto ON, Canada Akamai Full time

Join our Origin Service SRE Team

Do you like collaborating across teams to solve complex problems? Do you enjoy solving large scale distributed content delivery challenges? Our team is responsible for monitoring and measuring the reliability of our suite of origin service products and platform. In collaboration with Engineering and product teams, we focus on improving the performance and reliability of the products we support.

Partner with the best

As a Senior Site Reliability Engineer, you will work closely with cross-functional teams to understand and improve the performance, availability and reliability of Akamai's origin service products. You'll define key performance indicators (KPIs), advance the state of monitoring, alerting and operational responses, and investigate complex performance issues.

As a Senior Site Reliability Engineer, you will be responsible for:

  1. Monitoring proactively, investigating, and analyzing performance and availability by (co)designing, managing, and tracking product-related SLIs/SLOs
  2. Solving problems and avoid recurrence by developing tools / prototypes to proactively monitor service performance and availability
  3. Working closely with product engineers to advocate reliable and scalable system design for supportability, resilience and reliability
  4. Leveraging skills in data analysis, network diagnostics and debugging tools to characterize performance and recommend improvements
  5. Collaborating with internal teams to help trouble-shoot and resolve escalations and incidents for our customers

Do what you love

To be successful in this role you will:

  1. Have relevant experience and a Bachelor's diploma in Computer Science, Engineering, or related field
  2. Show experience in one of the scripting or procedural languages (python, perl, shell, C/C++, Java, etc.)
  3. Show knowledge of Oracle SQL for troubleshooting data integrity, reporting and data analysis
  4. Show fluency working in a UNIX/Linux computing environment
  5. Have excellent communication and organizational skills, be able to articulate technical information in an easy to understand
  6. Be a self-starter, proactively taking the initiative improve results while keeping multiple projects on target

Work in a way that works for you

FlexBase, Akamai's Global Flexible Working Program, is based on the principles that are helping us create the best workplace in the world. When our colleagues said that flexible working was important to them, we listened. We also know flexible working is important to many of the incredible people considering joining Akamai. FlexBase, gives 95% of employees the choice to work from their home, their office, or both (in the country advertised). This permanent workplace flexibility program is consistent and fair globally, to help us find incredible talent, virtually anywhere. We are happy to discuss working options for this role and encourage you to speak with your recruiter in more detail when you apply.

#J-18808-Ljbffr

  • Mississauga, ON, Canada Mimecast Canada Limited Full time

    Senior Site Reliability Engineer page is loaded Senior Site Reliability Engineer Apply locations Canada - Mississauga - Remote time type Full time posted on Posted 4 Days Ago job requisition id R4613 Senior Site Reliability Engineer Help Build the Next Generation of Cloud-Scalable AI-Based Security Products Have a passion for software security? Excel...


  • Toronto, ON, Canada Thomson Reuters Full time

    (Canada) Site Reliability Engineer (Contract) Contract (9 months 4 days) Published 3 days ago New Relic Data Dog Site Reliability Engineer - in the Service Management Organization Do you have experience in IT Service Management, working with cloud providers, software development, and technology infrastructure? The Site Reliability Engineer will analyze...


  • Mississauga, ON, Canada Mimecast Canada Limited Full time

    Senior Site Reliability Engineer page is loaded Senior Site Reliability Engineer Apply locations Canada - Mississauga - Remote time type Full time posted on Posted 4 Days Ago job requisition id R4613 Senior Site Reliability Engineer Help Build the Next Generation of Cloud-Scalable AI-Based Security Products Have a passion for software security? Excel...


  • Toronto, ON, Canada CB Canada Full time

    Site Reliability Engineer On behalf of our client in the Banking Sector, PROCOM is looking for a Site Reliability Engineer. Site Reliability Engineer – Job Description Azure cloud Jira and confluence CICD Experience with automating (provisioning, configuration management, deployment) and integrating Azure PaaS solutions (Azure App services, Azure...


  • Mississauga, ON, Canada Mimecast Full time

    Senior Site Reliability Engineer Help Build the Next Generation of Cloud-Scalable AI-Based Security Products Have a passion for software security? Excel at implementing public cloud at scale? Desire to apply Machine Learning to solve complex problems? This may well be the role for you. Our Communication and Collaboration Security products are cutting-edge...


  • Toronto, ON, Canada Akamai Full time

    Are you passionate about cutting edge technology? Do solving some of the Internet's most difficult content delivery challenges interest you? Join our Compute Site Reliability team! Our team is responsible for monitoring and measuring the reliability of our suite of Compute products and platform. In collaboration with Engineering and Product teams, we focus...


  • Toronto, ON, Canada ClickHouse Full time

    We are committed to providing our customers with reliable and secure services so we are building out our newly formed Site Reliability Engineering team. As one of the first joiners to our Reliability Engineering Team at ClickHouse, you will be responsible for building and leading processes to ensure the reliability, availability, scalability, and...


  • Toronto, ON, Canada Akamai Full time

    Are you passionate about cutting edge technology? Do solving some of the Internet's most difficult content delivery challenges interest you? Join our Compute Site Reliability team! Our team is responsible for monitoring and measuring the reliability of our suite of Compute products and platform. In collaboration with Engineering and Product teams, we...


  • Toronto, Canada CB Canada Full time

    Site Reliability Engineer On behalf of our client in the Banking Sector, PROCOM is looking for a Site Reliability Engineer. Site Reliability Engineer – Job Description Azure cloud Jira and confluence CICD Experience with automating (provisioning, configuration management, deployment) and integrating Azure PaaS solutions (Azure App services, Azure...


  • Mississauga, ON, Canada Mimecast Full time

    Senior Site Reliability Engineer Help Build the Next Generation of Cloud-Scalable AI-Based Security Products Have a passion for software security? Excel at implementing public cloud at scale? Desire to apply Machine Learning to solve complex problems? This may well be the role for you. Our Communication and Collaboration Security products are cutting-edge...


  • Old Toronto, Canada Thomson Reuters Full time

    (Canada) Site Reliability Engineer (Contract) Contract (9 months 4 days) Published 3 days ago New Relic Data Dog Site Reliability Engineer - in the Service Management OrganizationDo you have experience in IT Service Management, working with cloud providers, software development, and technology infrastructure?The Site Reliability Engineer will...


  • Toronto, ON, Canada eTeam Full time

    Remote Work Duration 4 months - Preference is to find candidates who are willing to be converted to full-time employees. The conversion decision will be made based on performance. Job Description Role Description: Defining and measuring reliability goals—SLIs, SLOs, and error budgets for user journey. Designing for and implementing observability (ELK,...


  • Old Toronto, Canada CB Canada Full time

    Site Reliability Engineer On behalf of our client in the Banking Sector, PROCOM is looking for a Site Reliability Engineer. Site Reliability Engineer – Job Description Azure cloud Jira and confluence CICD Experience with automating (provisioning, configuration management, deployment) and integrating Azure PaaS solutions (Azure App services, Azure...


  • Toronto, Canada Autodesk Full time

    Position Overview Autodesk, the leading Design and Make Software Company, is looking for a Principal Site Reliability Engineer to join the Autodesk Platform Services Engineering team in Toronto, Canada. On this position, you will help build trusted services of APS (Autodesk Platform Services) as measured by Service Level Objectives (SLOs) and Mean...


  • Toronto, ON, Canada eTeam Full time

    Remote work Duration - 4 months - Preference is to find candidates who are willing to be converted to full time employee. The conversion decision will be made based on performance. Job Description: Role Desc: Defining and measuring reliability goals—SLIs, SLOs, and error budgets for user journey Designing for and implementing observability: (ELK,...


  • Toronto, Canada BMO Full time

    Application Deadline: 04/29/2024Address:33 Dundas Street WestThis role is Hybrid (1-2 days per week in the office)The Director - Site Reliability Engineering will lead a team that will work with application teams, infrastructure teams, and business partners to continuously improve the stability, reliability and efficiency of Finance and Enterprise Risk...


  • Mississauga, ON, Canada Roche Full time

    The Position Senior Site Reliability Engineer (Kubernetes Platform) - Digital Products and Enablement The 21st century needs a 21st century healthcare system. To help build this, Roche is not only developing highly personalized medicine and advanced diagnostics, but also heavily investing into software and digital solutions. To speed up medical...


  • Toronto, ON, Canada Autodesk Full time

    Position Overview Virtual and augmented reality are transforming design and creation through new immersive and collaborative experiences to improve how major segments like entertainment, architecture, engineering, construction, and manufacturing converge. Many industries are being transformed by the growth of XR technology, creating new ways of working to...


  • Toronto, ON, Canada Lightspeed Full time

    Hi there! Thanks for stopping by. Are you actively looking for a new opportunity? Or just checking the market? Well… you might just be in the right place! We’re looking for a Principal Site Reliability Engineer to join our NuOrder by Lightspeed team in North America. NuORDER by Lightspeed builds software solutions that help merchants grow the size and...


  • Toronto, Ontario, Canada Zortech Solutions Full time

    Hi,Hope you are doing GreatThis side Priya Rajput from Zortech Solutions trying to reach you for an exciting job opening, kindly have a look to job description and revert me with your positive feedback. My mail ID is or call me on .Role: Site Reliability EngineerLocation: Toronto, ON-OnsiteDuration: Fulltime PermanentSkills and Responsibilities:...