Lead Site Reliability Administrator
2 weeks ago
OPENTEXT - THE INFORMATION COMPANY
OpenText is a global leader in information management, where innovation, creativity, and collaboration are the key components of our corporate culture. As a member of our team, you will have the opportunity to partner with the most highly regarded companies in the world, tackle complex issues, and contribute to projects that shape the future of digital transformation.
AI-First. Future-Driven. Human-Centered.
At OpenText, AI is at the heart of everything we do—powering innovation, transforming work, and empowering digital knowledge workers. We're hiring talent that AI can't replace to help us shape the future of information management. Join us.
YOUR IMPACTThe role Site Reliability Administrator is to build solutions to enhance availability, performance, and stability of OpenText services as well as automating away repetitive work as part of a cloud dev ops organization.
This role would be a great fit for someone with creative and innovative problem-solving skills. You will develop and implement solutions that operate at scale. Our teams are empowered and expected to improve our products to truly deliver a reliable experience to customers.
WHAT THE ROLE OFFERS
- Uses technical knowledge, creativity, and company practices to drive down occurrences of incidents through development of proactive monitoring and alerting.
- Provide attention to incidents according to Service Level Agreements.
- Provide continuous feedback to development teams on system stability, defect analysis and system enhancements
- Develop runbooks and patterns to sustain applications in a production environment
- Participate in technical discussions and drive transition to sustain activities with the development teams
- Work with IT business and development partners to gather input to develop new capabilities in displaying/monitoring/alerting on key performance indicators (KPIs) by tracking business transactions (BT) in real-time
- Take ownership and accountability for the incident resolution process, participating in RCA and SWAT investigations.
- Plan for validation and verification of changes deployed by infrastructure teams, development teams.
- Participate in day-to-day real time advanced level technical support and troubleshooting on issues reported from user/customer base.
- Requires rotating shift work as needed.
- On-call rotation is required, as 7x24x365 support is required.
WHAT YOU NEED TO SUCCEED
- Strong expertise in Linux systems with the ability to understand, maintain, and develop scripting software (e.g., Shell, Python, Perl, JavaScript).
- Hands-on experience with cloud infrastructure (Google Cloud, AWS, Azure) and PaaS technologies such as Kubernetes, Cloud Foundry, and BOSH.
- Solid understanding and operational experience with containerization (Docker, rkt, Mesos) and microservices/RESTful architectures.
- Proficient in Continuous Delivery and automation tools like GitOps, Ansible, Rundeck, or Argo CD, enabling efficient deployment pipelines.
- Skilled in middleware and application support, including Apache, Tomcat, Spring, and Java-based frameworks such as Struts or Spark.
- Experienced with databases and storage, both RDBMS (Oracle, Postgres, MariaDB) and NoSQL (Cassandra), ensuring reliable data performance.
- Knowledgeable in monitoring and observability, using tools like New Relic, Dynatrace, AppDynamics, Zabbix, check_mk, and centralized logging systems like Graylog or Kibana.
- Familiar with messaging and search technologies such as Kafka, RabbitMQ, Solr, and Elasticsearch, supporting scalable distributed systems.
- Demonstrated ability to diagnose and troubleshoot complex issues in high-throughput applications, with a strong grasp of security best practices and ITIL principles.
- Proven leadership and collaboration skills—able to drive scalable solutions, manage multiple priorities, and work both independently and within cross-functional teams.
OpenText's efforts to build an inclusive work environment go beyond simply complying with applicable laws. Our Employment Equity and Diversity Policy provides direction on maintaining a working environment that is inclusive of everyone, regardless of culture, national origin, race, color, gender, gender identification, sexual orientation, family status, age, veteran status, disability, religion, or other basis protected by applicable laws.
If you need assistance and/or a reasonable accommodation due to a disability during the application or recruiting process, please contact us Our proactive approach fosters collaboration, innovation, and personal growth, enriching OpenText's vibrant workplace.
-
Lead Site Reliability Administrator
4 weeks ago
Mississauga, Canada OpenText Full timeLead Site Reliability Administrator at OpenText OpenText is a global leader in information management, driving digital transformation through AI‑powered solutions. The Site Reliability Administrator role focuses on building solutions that enhance the availability, performance, and stability of OpenText services within a cloud Dev‑Ops organization....
-
Lead Site Reliability Administrator
3 weeks ago
Mississauga, Canada OpenText Full timeLead Site Reliability Administrator at OpenText OpenText is a global leader in information management, driving digital transformation through AI‑powered solutions. The Site Reliability Administrator role focuses on building solutions that enhance the availability, performance, and stability of OpenText services within a cloud Dev‑Ops organization....
-
Lead Site Reliability Administrator
4 weeks ago
Mississauga, Canada OpenText Full timeLead Site Reliability Administrator at OpenText OpenText is a global leader in information management, driving digital transformation through AI‑powered solutions. The Site Reliability Administrator role focuses on building solutions that enhance the availability, performance, and stability of OpenText services within a cloud Dev‑Ops organization....
-
Machinery Reliability Specialist
2 days ago
Etobicoke, ON MV C, Canada AVT Reliability Canada Full time $55,000 - $70,000 per yearJob SummaryAVT Reliability is a global leader in asset management, condition monitoring, and engineering support for industrial manufacturing and engineering clients. We specialize in predictive maintenance, condition monitoring, and asset reliability services across the petrochemical, manufacturing, and energy sectors.As we continue to grow, we are...
-
Lead Reliability Administrator
6 days ago
Mississauga, Canada Open Text Corporation Full time**Lead Reliability Administrator**: - Req id: 32655- Mississauga, ON, CA Richmond Hill, ON, CA Waterloo, ON, CA**OPENTEXT - THE INFORMATION COMPANY** As the Information Company, our mission at OpenText is to create software solutions and deliver services that redefine the future of digital. Be part of a winning team that leads the way in Enterprise...
-
Principal Site Reliability Administrator
7 days ago
Waterloo, ON NL A, Canada OpenText Full time $120,000 - $180,000 per yearOPENTEXT - THE INFORMATION COMPANYOpenText is a global leader in information management, where innovation, creativity, and collaboration are the key components of our corporate culture. As a member of our team, you will have the opportunity to partner with the most highly regarded companies in the world, tackle complex issues, and contribute to projects that...
-
Lead Site Reliability Engineer
2 days ago
Toronto, ON MW A, Canada RBC Full time $900,000 - $1,250,000 per yearJob DescriptionWhat is the opportunity?Join RBC as a Lead Site Reliability Engineer and take the lead in ensuring the reliability, scalability, and performance of our critical production systems and infrastructure. This is your chance to drive innovation through cutting-edge engineering practices, automation, and process optimization. Collaborate with...
-
Lead Reliability Administrator
6 days ago
Mississauga, Canada opentext Full time**OPENTEXT - THE INFORMATION COMPANY** As the Information Company, our mission at OpenText is to create software solutions and deliver services that redefine the future of digital. Be part of a winning team that leads the way in Enterprise Information Management. **The Opportunity** As a Site Reliability Engineer (SRE) Senior, you will join a global team,...
-
Site Reliability Engineer
3 days ago
Mississauga, Canada Groupe Compass Quebec ltée. Full timeJoin to apply for the Site Reliability Engineer role at Groupe Compass Quebec ltée. 6 days ago Be among the first 25 applicants Join to apply for the Site Reliability Engineer role at Groupe Compass Quebec ltée. Join an award-winning culture. We have been recognized for being a Great Place to Work, in addition to being selected as a FORTUNE Global 500...
-
Site Reliability Engineer
6 days ago
Mississauga, Canada Groupe Compass Quebec ltée. Full timeJoin to apply for the Site Reliability Engineer role at Groupe Compass Quebec ltée.6 days ago Be among the first 25 applicantsJoin to apply for the Site Reliability Engineer role at Groupe Compass Quebec ltée.Join an award-winning culture. We have been recognized for being a Great Place to Work, in addition to being selected as a FORTUNE Global 500...