Senior Site Reliability Engineer

1 month ago


Toronto ON, Canada Akamai Full time

Are you intrigued by planetary scale, distributed, intelligent systems? Do you like collaborating across teams to solve complex problems? Join our highly skilled Site Reliability Engineering team. Our team designs, develops, and manages applications and infrastructure that support Akamai's Compute products and services. We do this while maintaining Akamai's mission at the forefront of what we do: make life better for billions of people, billions of times a day.

Partner with the best

As a Senior SRE in the VHP team, you will be at the forefront of Akamai Connected Cloud compute host technologies. Our team is responsible for the host Linux platform from the hardware to the guest VM images kernel, OS, custom KVM/QEMU virtualization layer, guest images, and working closely with next-generation HW teams to ensure new hardware programs succeed in our datacenters.

As a Senior Site Reliability Engineer, you will be responsible for:

  1. Developing, testing, and distributing changes to software, services, and tools the VHP team is responsible for.
  2. Designing and implementing enhancements to VHP observability infrastructure in order to identify and correct problems before they impact our customers.
  3. Developing subject matter expertise in VHP components.
  4. Identifying and implementing automation best practices for existing products and processes.
  5. Collaborating with our support, operations, and engineering teams to investigate and troubleshoot complex problems.
  6. Participating in on-call rotations, guiding restoration and repair of service-impacting issues.

Do what you love

To be successful in this role you will:

  1. Possess expert level experience in Linux internals, system administration, and a deep understanding of underlying hardware.
  2. Possess advanced level experience with the Linux kernel, OS, and optimization of their configurations for KVM/QEMU virtualization.
  3. Possess advanced level experience with designing, developing, and deploying software and infrastructure at scale.
  4. Possess advanced level experience in a DevOps, Development, or SysAdmin role working with large scale distributed systems.
  5. Experience with tools like SaltStack for managing infrastructure at scale.
  6. Have great communication and interpersonal skills.
  7. Have relevant experience and a Bachelor's diploma in Computer Engineering, Computer Science, or equivalent.

Work in a way that works for you

FlexBase, Akamai's Global Flexible Working Program, is based on the principles that are helping us create the best workplace in the world. When our colleagues said that flexible working was important to them, we listened. We also know flexible working is important to many of the incredible people considering joining Akamai. FlexBase gives 95% of employees the choice to work from their home, their office, or both (in the country advertised). This permanent workplace flexibility program is consistent and fair globally, to help us find incredible talent, virtually anywhere. We are happy to discuss working options for this role and encourage you to speak with your recruiter in more detail when you apply.

What makes Akamai a great place to work

Connect with us on social and see what life at Akamai is like We power and protect life online, by solving the toughest challenges, together. At Akamai, we're curious, innovative, collaborative, and tenacious. We celebrate diversity of thought, and we hold an unwavering belief that we can make a meaningful difference. Our teams use their global perspectives to put customers at the forefront of everything they do, so if you are people-centric, you'll thrive here.

Working for you

At Akamai, we will provide you with opportunities to grow, flourish, and achieve great things. Our benefit options are designed to meet your individual needs for today and in the future. We provide benefits surrounding all aspects of your life:

  1. Your health
  2. Your finances
  3. Your family
  4. Your time at work
  5. Your time pursuing other endeavors

Our benefit plan options are designed to meet your individual needs and budget, both today and in the future.

About us

Akamai powers and protects life online. Leading companies worldwide choose Akamai to build, deliver, and secure their digital experiences, helping billions of people live, work, and play every day. With the world's most distributed compute platform from cloud to edge, we make it easy for customers to develop and run applications, while we keep experiences closer to users and threats farther away.

Join us

Are you seeking an opportunity to make a real difference in a company with a global reach and exciting services and clients? Come join us and grow with a team of people who will energize and inspire you Akamai Technologies is an Affirmative Action, Equal Opportunity Employer that values the strength that diversity brings to the workplace. All qualified applicants will receive consideration for employment and will not be discriminated against on the basis of gender, gender identity, sexual orientation, race/ethnicity, protected veteran status, disability, or other protected group status. #LI-Remote

If no date is displayed, applications are being accepted on an ongoing basis until the job is filled.

#J-18808-Ljbffr

  • Mississauga, ON, Canada Mimecast Canada Limited Full time

    Senior Site Reliability Engineer page is loaded Senior Site Reliability Engineer Apply locations Canada - Mississauga - Remote time type Full time posted on Posted 4 Days Ago job requisition id R4613 Senior Site Reliability Engineer Help Build the Next Generation of Cloud-Scalable AI-Based Security Products Have a passion for software security? Excel...


  • Mississauga, ON, Canada Mimecast Canada Limited Full time

    Senior Site Reliability Engineer page is loaded Senior Site Reliability Engineer Apply locations Canada - Mississauga - Remote time type Full time posted on Posted 4 Days Ago job requisition id R4613 Senior Site Reliability Engineer Help Build the Next Generation of Cloud-Scalable AI-Based Security Products Have a passion for software security? Excel...


  • Toronto, ON, Canada Thomson Reuters Full time

    (Canada) Site Reliability Engineer (Contract) Contract (9 months 4 days) Published 3 days ago New Relic Data Dog Site Reliability Engineer - in the Service Management Organization Do you have experience in IT Service Management, working with cloud providers, software development, and technology infrastructure? The Site Reliability Engineer will analyze...


  • Old Toronto, Canada Lloyds Banking Group Full time

    Job Description - Senior Site Reliability EngineerJOB TITLE: Senior Site Reliability Engineer (SRE)LOCATION: Halifax, Leeds or ManchesterHOURS: Full-timeWORKING PATTERN: Our work style is hybrid, which involves spending at least two days per week, or 40% of our time, at one of our office sites.Who are Lloyds Banking Group and where does this role sit?If you...


  • Old Toronto, Canada Lloyds Banking Group Full time

    Job Description - Senior Site Reliability EngineerJOB TITLE: Senior Site Reliability Engineer (SRE)LOCATION: Halifax, Leeds or ManchesterHOURS: Full-timeWORKING PATTERN: Our work style is hybrid, which involves spending at least two days per week, or 40% of our time, at one of our office sites.Who are Lloyds Banking Group and where does this role sit?If you...


  • Toronto, ON, Canada Autodesk Full time

    Position Overview Autodesk, the leading Design and Make Software Company, is looking for a Principal Site Reliability Engineer to join the Autodesk Platform Services Engineering team in Toronto, Canada. In this role, you will help build trusted services of APS (Autodesk Platform Services) measured by Service Level Objectives (SLOs) and Mean Time to...


  • Toronto, ON, Canada Jobber Full time

    Jobber exists to help people in small businesses be successful. We work with small home service businesses, like your local plumbers, painters, and landscapers, to transform the way service is delivered through technology. With Jobber they can quote, schedule, invoice, and collect payments from their customers, while providing an easy and professional...


  • Toronto, ON, Canada OnX Canada Full time

    OnX is looking for a Site Reliability Engineer for one our clients in Toronto. Client: Financial ServicesLocation: Toronto, mostly remoteDuration: 6 months with potential extensionJBoss in middleware experience is super importantResponsibilities:Following the senior technicians plans to build out lower environments with functioning software stacks including...


  • Toronto, ON, Canada OnX Canada Full time

    OnX is looking for a Site Reliability Engineer for one our clients in Toronto. Client: Financial ServicesLocation: Toronto, mostly remoteDuration: 6 months with potential extensionJBoss in middleware experience is super importantResponsibilities:Following the senior technicians plans to build out lower environments with functioning software stacks including...


  • Toronto, ON, Canada OnX Canada Full time

    OnX is looking for a Site Reliability Engineer for one our clients in Toronto. Client: Financial ServicesLocation: Toronto, mostly remoteDuration: 6 months with potential extensionJBoss in middleware experience is super importantResponsibilities:Following the senior technicians plans to build out lower environments with functioning software stacks including...


  • Toronto, ON, Canada OnX Canada Full time

    OnX is looking for a Site Reliability Engineer for one our clients in Toronto. Client: Financial ServicesLocation: Toronto, mostly remoteDuration: 6 months with potential extensionJBoss in middleware experience is super importantResponsibilities:Following the senior technicians plans to build out lower environments with functioning software stacks including...


  • Mississauga, ON, Canada Mimecast Full time

    Senior Site Reliability Engineer Help Build the Next Generation of Cloud-Scalable AI-Based Security Products Have a passion for software security? Excel at implementing public cloud at scale? Desire to apply Machine Learning to solve complex problems? This may well be the role for you. Our Communication and Collaboration Security products are cutting-edge...


  • toronto, Canada OnX Canada Full time

    OnX is looking for a Site Reliability Engineer for one our clients in Toronto. Client: Financial Services Location: Toronto, mostly remote Duration: 6 months with potential extension JBoss in middleware experience is super important Responsibilities: Following the senior technicians plans to buil


  • toronto, Canada OnX Canada Full time

    OnX is looking for a Site Reliability Engineer for one our clients in Toronto. Client: Financial Services Location: Toronto, mostly remote Duration: 6 months with potential extension JBoss in middleware experience is super important Responsibilities: Following the senior technicians plans to buil


  • Toronto, ON, Canada OnX Canada Full time

    OnX is looking for a Site Reliability Engineer for one our clients in Toronto. Client: Financial Services Location: Toronto, mostly remote Duration: 6 months with potential extension JBoss in middleware experience is super important Responsibilities: Following the senior technicians plans to build out lower environments with functioning software...


  • Toronto, ON, Canada OnX Canada Full time

    OnX is looking for a Site Reliability Engineer for one our clients in Toronto. Client: Financial Services Location: Toronto, mostly remote Duration: 6 months with potential extension JBoss in middleware experience is super important Responsibilities: Following the senior technicians plans to build out lower environments with functioning software...


  • Old Toronto, Canada Practice Better Full time

    About us:Practice Better is a leading all-in-one practice management software solution transforming how health & wellness professionals run their practices and support their clients. The company serves 15,000+ customers in over 70+ countries across the globe, and processes hundreds of millions annually in payments on behalf of customers. Over 65% of growth...


  • Old Toronto, Canada Practice Better Full time

    About us:Practice Better is a leading all-in-one practice management software solution transforming how health & wellness professionals run their practices and support their clients. The company serves 15,000+ customers in over 70+ countries across the globe, and processes hundreds of millions annually in payments on behalf of customers. Over 65% of growth...


  • Toronto, Canada Thomson Reuters Full time

    Thomson Reuters is seeking a Senior Site Reliability Engineer to join our Service Management, Technology team. This role calls for an individual who is capable of analyzing customer problems of high complexity and assessing the scope of impact, while mitigating customer impact of issues and executing work arounds. Willingness to learn is an important aspect...


  • Toronto, Canada Thomson Reuters Full time

    Thomson Reuters is seeking a Senior Site Reliability Engineer to join our Service Management, Technology team. This role calls for an individual who is capable of analyzing customer problems of high complexity and assessing the scope of impact, while mitigating customer impact of issues and executing work arounds. Willingness to learn is an important aspect...