Lead Site Reliability Administrator
6 days ago
OPENTEXT OpenText is a global leader in information management, where innovation, creativity, and collaboration are the key components of our corporate culture. As a member of our team, you will have the opportunity to partner with the most highly regarded companies in the world, tackle complex issues, and contribute to projects that shape the future of digital transformation.
YOUR IMPACT
The role of the Site Reliability Administrator (1-year Contract) is to build solutions to enhance the availability, performance, and stability of OpenText services as well as automate away repetitive work as part of a cloud DevOps organization.
This role would be a great fit for someone with creative and innovative problem-solving skills. You will develop and implement solutions that operate at scale. Our teams are empowered and expected to improve our products to truly deliver a reliable experience to customers.
WHAT THE ROLE OFFERS
- Uses technical knowledge, creativity, and company practices to drive down occurrences of incidents through the development of proactive monitoring and alerting.
- Provide attention to incidents according to Service Level Agreements.
- Provided continuous feedback to development teams on system stability, defect analysis, and system enhancements
- Participate in technical discussions and drive transition to sustain activities with the development teams
- Work with IT business and development partners to gather input to develop new capabilities in displaying/monitoring/alerting on key performance indicators (KPIs) by tracking business transactions (BT) in real-time
- Take ownership and accountability for the incident resolution process, participating in RCA and SWAT investigations.
- Participate in day-to-day real-time advanced-level technical support and troubleshooting on issues reported from the user/customer base.
- Requires rotating shift work as needed.
- On-call rotation is required, as 7x24x365 support is required.
WHAT YOU NEED TO SUCCEED
- The ability to understand and maintain Scripting software, expecting proficiency in Powershell.
- Strong experience with IIS web servers, WCF services, and Microsoft Windows server technologies (2016/2019/2025)
- Strong experience with AD, DNS, and F5 load-balancing technologies.
- Good understanding of OCR technology.
- Good understanding of performance and fine-tuning of Windows and web servers.
- Hands-on experience with cloud infrastructure (Google, AWS, or Azure) a plus
- Experience with PaaS technologies such as Cloud Foundry, Kubernetes, and Bosh.
- Good understanding and operational experience with container technologies like Docker, rkt, mesos.
- Good understanding and working experience with micro services and RESTful architecture.
- Experience with CI/CD and IaC tools like Ansible, Rundeck, Terraform, and GitOps to set up pipelines and provision infrastructure as needed.
- Strong working knowledge of a PaaS or Application operations best practices.
- Operational understanding or experience with message brokers such as Apache Kafka or RabittMQ.
- Operational understanding or experience with search technologies such as Solr search or Elasticsearch.
- Experience with at least one scripting language such shell, perl, python, javascripts, etc
- Experience with installing and configuring Apache, Tomcat, and IIS.
- Experience and knowledge in RDBMS and No-SQL databases such as Oracle, Postgres, MariaDB, and Cassandra.
- Experience with APM tools such as Newrelic, Dynatrace or AppDyanmics.
- Experience with monitoring tools such as Zabbix or check_mk.
- Knowledge and familiarity with centralized logging systems such as Graylog, Kibana, and cloud logging.
- Strong understanding of ITIL principles; certification is a plus.
- Is passionate about “getting under the hood” of systems and technologies to understand their inner workings and fix what needs fixing. This requires diagnosing& troubleshooting user-facing service incidents& outages
- Knowledge and familiarity with API gateways such as APIGEE and Oauth 2.0 standards.
OpenText's efforts to build an inclusive work environment go beyond simply complying with applicable laws. Our Employment Equity and Diversity Policy provides direction on maintaining a working environment that is inclusive of everyone, regardless of culture, national origin, race, color, gender, gender identification, sexual orientation, family status, age, veteran status, disability, religion, or other basis protected by applicable laws.
-
Lead Site Reliability Administrator
2 weeks ago
Richmond Hill, Canada Open Text Corporation Full time**Lead Site Reliability Administrator**: - Req id: 40375- Richmond Hill, ON, CA Mississauga, ON, CA Waterloo, ON, CA**OPENTEXT** OpenText is a global leader in information management, where innovation, creativity, and collaboration are the key components of our corporate culture. As a member of our team, you will have the opportunity to partner with the...
-
Lead Site Reliability Administrator
2 weeks ago
Richmond Hill, Canada Open Text Corporation Full time**Lead Site Reliability Administrator**: - Req id: 38365- Richmond Hill, ON, CA Waterloo, ON, CA Mississauga, ON, CA**OPENTEXT - THE INFORMATION COMPANY** Together Carbonite and Webroot form the SMB and Consumer Division of OpenText. The mission of our joint offering is to make cyber resilience simple, reliable and accessible in the connected world. We...
-
Lead Site Reliability Administrator
2 weeks ago
Richmond Hill, Canada opentext Full time**OPENTEXT - THE INFORMATION COMPANY** Together Carbonite and Webroot form the SMB and Consumer Division of OpenText. The mission of our joint offering is to make cyber resilience simple, reliable and accessible in the connected world. We foster a thriving, dynamic environment rich with inventive minds and entrepreneurial spirit and our employees are...
-
Lead Reliability Administrator
2 weeks ago
Richmond Hill, Canada opentext Full time**OPENTEXT - THE INFORMATION COMPANY** As the Information Company, our mission at OpenText is to create software solutions and deliver services that redefine the future of digital. Be part of a winning team that leads the way in Enterprise Information Management. **The Opportunity**: The role of Lead Reliability Administrator is to build solutions to enhance...
-
Site Reliability Engineer
28 minutes ago
Richmond Hill, Canada STAPLES Canada Full timeSome of what you will do: The Site Reliability Engineer (SRE) is responsible for ensuring the reliability, availability, and operational excellence of Staples Canada’s digital platforms. This role supports production systems, develops automation for operations, enhances observability, and partners with engineering teams to improve performance and...
-
Senior Site Reliability Engineer
29 minutes ago
Richmond Hill, Canada STAPLES Canada Full timeA leading retail company in Richmond Hill is seeking a Site Reliability Engineer to ensure the reliability and operational excellence of its digital platforms. The role involves collaboration with engineering teams, development of automation for operations, and enhancing observability. Ideal candidates will possess solid experience in IT development, cloud...
-
Senior Site Reliability Engineer
29 minutes ago
Richmond Hill, Canada Staples Full timeA leading retail company seeks a Site Reliability Engineer to enhance the reliability of its digital platforms in Richmond Hill. This role involves improving system monitoring, optimizing CI/CD pipelines, and supporting incident response. Candidates should have 2+ years in IT operations, a strong background in Kubernetes, and familiarity with Azure. The...
-
Staff Site Reliability Engineer, Energy Software
4 weeks ago
Richmond Hill, Canada Tesla Full timeOverview Tesla is looking for a Site Reliability Engineer to build, enhance, and scale the infrastructure that underpins our Energy IoT applications. These applications provide real-time monitoring, optimization, and control for Tesla’s industry-leading energy products, including Powerwall, Megapack, Solar Roof, Supercharger, Wall Connector, Autobidder,...
-
Staff Site Reliability Engineer, Energy Software
4 weeks ago
Richmond Hill, Canada Tesla Full timeOverviewTesla is looking for a Site Reliability Engineer to build, enhance, and scale the infrastructure that underpins our Energy IoT applications. These applications provide real-time monitoring, optimization, and control for Tesla’s industry-leading energy products, including Powerwall, Megapack, Solar Roof, Supercharger, Wall Connector, Autobidder, and...
-
Sr. Manager, Site Reliability Engineering
28 minutes ago
Richmond Hill, Canada OpenText Full timeOpenText - The Information Company OpenText is a global leader in information management, where innovation, creativity, and collaboration are the key components of our corporate culture. As a member of our team, you will have the opportunity to partner with the most highly regarded companies in the world, tackle complex issues, and contribute to projects...