Principal Site Reliability Administrator

5 months ago


Waterloo, Canada opentext Full time

**OPENTEXT - THE INFORMATION COMPANY**

As the Information Company, our mission at OpenText is to create software solutions and deliver services that redefine the future of digital. Be part of a winning team that leads the way in Enterprise Information Management.

**The Opportunity**:
**You Are Great AT**:

- Develop and maintain automation tools and frameworks for provisioning, configuration, and deployment of software systems, employing infrastructure as code (IaC) principles.
- Build and maintain monitoring and alerting systems to proactively identify and address system issues, ensuring high availability and performance of production systems.
- Conduct system capacity and performance analysis, identifying and resolving bottlenecks, and making recommendations for optimization.
- Participate in incident response and on-call rotations to troubleshoot and resolve system failures, minimizing downtime and impact on customers.
- Implement and enforce best practices for security, compliance, and data protection, working closely with security and compliance teams.
- Continuously evaluate and improve operational processes and procedures, driving automation and efficiency to enhance the overall reliability of our systems.
- Mentor and provide technical guidance to junior SREs and cross-functional teams, promoting knowledge sharing and fostering a culture of collaboration.
- Stay up to date with industry trends and emerging technologies, evaluating their potential to improve system reliability and performance.

**What it Takes**:

- Bachelor's or higher degree in Computer Science, Engineering, or a related field.
- 10+ years of experience in a Site Reliability Engineering role or similar role
- Strong programming and scripting skills in languages like Python, Go, or Ruby.
- Proficiency in cloud platforms (AWS, Azure, Google Cloud) and container technologies (Kubernetes, Docker).
- Hand one with on-prem infrastructure like VMware.
- Experience with infrastructure automation tools such as Terraform, Ansible, or Chef.
- Deep understanding of system monitoring and logging tools (e.g., Prometheus, Grafana, ELK Stack).
- Deep understanding of networking principles and experience with network protocols and tools (such as TCP/IP, DNS, and BGP) and Service Mesh.
- Proven track record of designing and maintaining highly available and scalable systems.
- Excellent problem-solving skills and the ability to analyze complex systems to identify and resolve issues.
- Strong communication and collaboration skills, with the ability to work effectively in cross-functional teams.
- Desired to have GCP, AWS, and CKA certification.



  • Waterloo, Canada OpenText Full time

    Principal Site Reliability AdministratorWaterloo, ON, CAOPENTEXTOpenText is a global leader in information management, where innovation, creativity, and collaboration are the key components of our corporate culture. As a member of our team, you will have the opportunity to partner with the most highly regarded companies in the world, tackle complex issues,...


  • Waterloo, Ontario, Canada OpenText Full time

    Principal Site Reliability EngineerAt OpenText, we're seeking a highly skilled Principal Site Reliability Engineer to join our team. As a key member of our technical team, you will be responsible for designing, maintaining, and troubleshooting software features and solutions for our containerized platform.Your ImpactAn SRE bridges the gap between traditional...


  • Waterloo, Ontario, Canada OpenText Full time

    Principal Site Reliability AdministratorWaterloo, ON, CAOPENTEXTOpenText is a global leader in information management, where innovation, creativity, and collaboration are the key components of our corporate culture. As a member of our team, you will have the opportunity to partner with the most highly regarded companies in the world, tackle complex issues,...


  • Waterloo, Ontario, Canada OpenText Full time

    Principal Site Reliability AdministratorWaterloo, ON, CAOPENTEXTOpenText is a global leader in information management, where innovation, creativity, and collaboration are the key components of our corporate culture. As a member of our team, you will have the opportunity to partner with the most highly regarded companies in the world, tackle complex issues,...


  • Waterloo, Canada Open Text Corporation Full time

    **Lead Site Reliability Administrator**: - Req id: 38426- Waterloo, ON, CA Mississauga, ON, CA Richmond Hill, ON, CA**OPENTEXT - THE INFORMATION COMPANY** As the Information Company, our mission at OpenText is to create software solutions and deliver services that redefine the future of digital. Be part of a winning team that leads the way in Enterprise...


  • Waterloo, Ontario, Canada Manulife Insurance Malaysia Full time

    Senior Full Stack Engineer/Site Reliability Engineer (Identity / Access Management)We are seeking a highly skilled Senior Full Stack Engineer/Site Reliability Engineer to join our team in Waterloo, Ontario, or Toronto. As a key member of our Identity and Access Management team, you will be responsible for designing, implementing, and maintaining secure and...


  • Waterloo, Ontario, Canada Manulife Insurance Malaysia Full time

    Senior Full Stack Engineer/Site Reliability Engineer (Identity / Access Management)We are seeking a highly skilled Senior Full Stack Engineer/Site Reliability Engineer to join our team in Waterloo, Ontario, or Toronto. As a key member of our Identity and Access Management team, you will be responsible for designing, implementing, and maintaining secure and...


  • Waterloo, Ontario, Canada Procom Full time

    Site Reliability EngineerProcom is seeking a skilled Site Reliability Engineer for a contract role with one of our clients in the financial sector.Job Details:Design and develop automated solutions to address complex problems.Collaborate with cross-functional teams to ensure seamless site operations.Implement and maintain reliable systems and processes.As a...


  • Waterloo, Ontario, Canada Procom Full time

    Site Reliability EngineerProcom is seeking a skilled Site Reliability Engineer for a contract role with one of our clients in the financial sector.Job Details:Design and develop automated solutions to address complex problems.Collaborate with cross-functional teams to ensure seamless site operations.Implement and maintain reliable systems and processes.As a...


  • Waterloo, Canada opentext Full time

    **OPENTEXT - THE INFORMATION COMPANY** Together Carbonite and Webroot form the SMB and Consumer Division of OpenText. The mission of our joint offering is to make cyber resilience simple, reliable and accessible in the connected world. We foster a thriving, dynamic environment rich with inventive minds and entrepreneurial spirit and our employees are...


  • Waterloo, Canada Open Text Corporation Full time

    **Hiring Manager**: Brian Weiss **Talent Acquisition Advisor**: Draun Raval **Job Code Level**: IZ-CLD-P4 Refer Your Friends! YOUR IMPACT As a Senior Cloud Engineer/Site Reliability Engineer (SRE), you will play a critical role in designing, implementing, and managing cloud infrastructure solutions for our organization while focusing on enhancing system...


  • Waterloo, Canada opentext Full time

    **OPENTEXT - THE INFORMATION COMPANY** As the Information Company, our mission at OpenText is to create software solutions and deliver services that redefine the future of digital. Be part of a winning team that leads the way in Enterprise Information Management. **The Opportunity** As a Site Reliability Engineer (SRE) Senior, you will join a global team,...


  • Waterloo, Canada Open Text Corporation Full time

    **Req id**:42657- Waterloo, ON, CA **OPENTEXT** OpenText is a global leader in information management, where innovation, creativity, and collaboration are the key components of our corporate culture. As a member of our team, you will have the opportunity to partner with the most highly regarded companies in the world, tackle complex issues, and contribute...


  • Waterloo, Canada Open Text Corporation Full time

    **Hiring Manager**: Ather Fayyaz **Talent Acquisition Advisor**: Draun Raval **Job Code Level**: IZ-CLD-P5 Refer Your Friends! YOUR IMPACT WHAT THE ROLE OFFERS - Planning, testing, and implementing solutions for the monitoring, alerting, and observability of platform services. We are shifting towards proactive, log-based monitoring from event-based...


  • Waterloo, Ontario, Canada Procom Full time

    Site Reliability EngineerProcom is seeking a skilled Site Reliability Engineer for a contract role with one of our clients in the financial sector.Job Details:Design, develop, and implement automated solutions to complex problems.Collaborate with cross-functional teams to ensure seamless system operations.Monitor and analyze system performance to identify...


  • Waterloo, Canada Manulife Insurance Malaysia Full time

    Site Reliability Engineer Do you want to be part of a team that redefines how we get work done? We are changing the way we develop, and we want you to be part of it! We are growing, with a mission to power extraordinary customer and employee experiences through software and engineering skills. The customer is at the center of everything we do, and millions...


  • Waterloo, Ontario, Canada https:www.energyjobline.comsitemap Full time

    Job Title: AWS Site Reliability EngineerWe are seeking a highly skilled AWS Site Reliability Engineer to join our team at Procom. As a Site Reliability Engineer, you will be responsible for designing, developing, and supporting technical solutions that automate complex problems.Key Responsibilities:Design and develop automated solutions to improve system...


  • Waterloo, Canada Procom Full time

    Site Reliability Engineer Procom is seeking a Site Reliability Engineer for a contract role with one of our clients in the financial sector. Job Details: As a Site Reliability Engineer, you will be responsible for delivering automated solutions to complex problems. Responsibilities: Design, develop and support technical solutions that automate agent...


  • Waterloo, Ontario, Canada https:www.energyjobline.comsitemap Full time

    Job Title:AWS Site Reliability EngineerJob Summary:We are seeking a highly skilled AWS Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will be responsible for designing, developing, and supporting technical solutions that automate complex problems.Key Responsibilities:Design and develop automated solutions to complex problems...


  • Waterloo, Ontario, Canada https:www.energyjobline.comsitemap Full time

    Job Title:AWS Site Reliability EngineerJob Summary:We are seeking a highly skilled AWS Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will be responsible for designing, developing, and supporting technical solutions that automate complex problems.Key Responsibilities:Design and develop automated solutions to complex problems...