Site Reliability Administrator

3 months ago


Waterloo, Canada Open Text Corporation Full time

**Site Reliability Administrator**:

- Req id: 35055- Waterloo, ON, CA Richmond Hill, ON, CA Mississauga, ON, CA**OPENTEXT - THE INFORMATION COMPANY**

As the Information Company, our mission at OpenText is to create software solutions and deliver services that redefine the future of digital. Be part of a winning team that leads the way in Enterprise Information Management.

**The Opportunity**
The role Cloud Application Engineer/Site Reliability Engineer is to build solutions to enhance the availability, performance, and stability of OpenText services as well as automate away repetitive work as part of a cloud dev ops organization.
This role would be a great fit for someone with creative and innovative problem-solving skills. You will develop and implement solutions that operate at scale. Our teams are empowered and expected to improve our products to deliver a reliable experience to customers truly.

**You Are Great At**
Collaborates with Agile squads/developers, sustain and business partners and provides significant contributions to develop specifications to resolve problems, and to address enhancement needs focusing in areas of logging, monitoring, and metrics for operational readiness- Uses technical knowledge, creativity, and company practices to drive down occurrences of incidents through the development of proactive monitoring and alerting.
- Provide attention to incidents according to Service Level Agreements.
- Provide continuous feedback to development teams on system stability, defect analysis, and system enhancements
- Participate in technical discussions and drive a transition to sustain activities with the development teams
- Work with IT business and development partners to gather input to develop new capabilities in displaying/monitoring/alerting on key performance indicators (KPIs) by tracking business transactions (BT) in real-time
- Take ownership and accountability for the incident resolution process, participating in RCA and SWAT investigations.
- Plan for validation and verification of changes deployed by infrastructure teams, and development teams.
- Participate in day-to-day real-time advanced-level technical support and troubleshooting on issues reported by the user/customer base.
- Establish and maintain a good relationship with team members, Product Development, Product Management, Customer Service, Client management, and other cross-functional teams.
- Participate in training and information-sharing activities.
- Act as backup for other team members when necessary.
- Requires rotating shift work as needed.
- On-call rotation is required, as 7x24x365 support is required.

**What It Takes**
- The ability to understand and maintain Scripting Software
- Deep understanding of Linux systems
- Hands-on experience with cloud infrastructure; Google, AWS, or Azure a plus
- Experience with PaaS technologies such as Cloud Foundry, Kubernetes, and Bosh.
- Good understanding and operational experience with container technologies like Docker, rkt, Mesos.
- Good understanding and working experience with microservices and RESTful architecture.
- Experience with Continuous delivery tools like Ansible, Rundeck, or Argo CD to set up automated pipelines as needed.
- Strong working knowledge of aPaaS or Application operations best practices.
- Operational understanding or experience with message brokers such as Apache Kafka or RabbitMQ.
- Operational understanding or experience with search technologies such as Solr search or Elasticsearch.
- Experience in supporting middleware technologies such as Apache, Tomcat, and Spring.
- Experience with at least one scripting language such as shell, Perl, python, javascript, etc
- Experience with installing and configuring Apache and Tomcat.
- Experience and knowledge in RDBMS and No-Sql databases such as Oracle, Postgres, MariaDB, and Cassandra.
- Experience with APM tools such as Newrelic, Dynatrace, or AppDyanmics.
- Experience with monitoring tools such as Zabbix or check_mk.
- Knowledge and familiarity with centralized logging systems such as Graylog or Kibana.
- Strong understanding of ITIL principles, certification is a plus.
- Is passionate about “getting under the hood” of systems and technologies to understand their inner workings, and fix what needs fixing. This requires diagnosing & troubleshooting user facing service incidents & outages
- Knowledge and familiarity with API gateway such as APIGEE and Oauth 2.0 standard.
- Proven problem-solving and analytical ability.
- Excellent organizational/time management skills.
- Ability to handle multiple tasks concurrently.
- Ability to lead, drive and implement highly scalable and complex solutions
- A strong understanding of Security best practices.
- A proven record of being able to work independently and collaboratively.



  • Waterloo, Canada Open Text Corporation Full time

    **Lead Site Reliability Administrator**: - Req id: 38426- Waterloo, ON, CA Mississauga, ON, CA Richmond Hill, ON, CA**OPENTEXT - THE INFORMATION COMPANY** As the Information Company, our mission at OpenText is to create software solutions and deliver services that redefine the future of digital. Be part of a winning team that leads the way in Enterprise...


  • Waterloo, Canada opentext Full time

    **OPENTEXT - THE INFORMATION COMPANY** Together Carbonite and Webroot form the SMB and Consumer Division of OpenText. The mission of our joint offering is to make cyber resilience simple, reliable and accessible in the connected world. We foster a thriving, dynamic environment rich with inventive minds and entrepreneurial spirit and our employees are...


  • Waterloo, Canada opentext Full time

    **OPENTEXT - THE INFORMATION COMPANY** As the Information Company, our mission at OpenText is to create software solutions and deliver services that redefine the future of digital. Be part of a winning team that leads the way in Enterprise Information Management. **The Opportunity** The role Cloud Application Engineer/Site Reliability Engineer is to build...


  • Waterloo, Canada opentext Full time

    **OPENTEXT - THE INFORMATION COMPANY** As the Information Company, our mission at OpenText is to create software solutions and deliver services that redefine the future of digital. Be part of a winning team that leads the way in Enterprise Information Management. **The Opportunity** As a Site Reliability Engineer (SRE) Senior, you will join a global team,...


  • Waterloo, Canada Procom Full time

    ```html Site Reliability Engineer Procom is seeking a Site Reliability Engineer for a contract role with one of our clients in the financial sector. Site Reliability Engineer Job Details: As an Automation Developer, you will be responsible for delivering automated solutions to complex problems. Site Reliability Engineer Responsibilities: Design, develop and...


  • Waterloo, Canada Procom Full time

    ```html Site Reliability Engineer Procom is seeking a Site Reliability Engineer for a contract role with one of our clients in the financial sector. Site Reliability Engineer Job Details: As an Automation Developer, you will be responsible for delivering automated solutions to complex problems. Site Reliability Engineer Responsibilities: Design, develop and...


  • Waterloo, Canada opentext Full time

    **OPENTEXT - THE INFORMATION COMPANY** As the Information Company, our mission at OpenText is to create software solutions and deliver services that redefine the future of digital. Be part of a winning team that leads the way in Enterprise Information Management. **The Opportunity**: **You Are Great AT**: - Develop and maintain automation tools and...


  • Waterloo, Canada Procom Full time

    Site Reliability Engineer Procom is seeking a Site Reliability Engineer for a contract role with one of our clients in the financial sector. Job Details: As a Site Reliability Engineer, you will be responsible for delivering automated solutions to complex problems. Responsibilities: Design, develop and support technical solutions that automate agent...


  • Waterloo, Canada Procom Full time

    Site Reliability Engineer Procom is seeking a Site Reliability Engineer for a contract role with one of our clients in the financial sector. Job Details: As a Site Reliability Engineer, you will be responsible for delivering automated solutions to complex problems. Responsibilities: Design, develop and support technical solutions that automate agent...


  • Waterloo, Canada Carta, Inc. Full time

    Carta is a platform that helps people manage equity, build businesses, and invest in the companies of tomorrow. Our mission is to unlock the power of equity ownership for more people in more places.Carta is trusted by more than 40,000 companies and over two million people in nearly 160 countries to manage cap tables, compensation, and valuations. Carta also...


  • Waterloo, Canada Carta, Inc. Full time

    Carta is a platform that helps people manage equity, build businesses, and invest in the companies of tomorrow. Our mission is to unlock the power of equity ownership for more people in more places.Carta is trusted by more than 40,000 companies and over two million people in nearly 160 countries to manage cap tables, compensation, and valuations. Carta also...


  • Waterloo, Ontario, M2L, City of Toronto, Canada Procom Full time

    ```html Site Reliability Engineer Procom is seeking a Site Reliability Engineer for a contract role with one of our clients in the financial sector. Site Reliability Engineer Job Details: As an Automation Developer, you will be responsible for delivering automated solutions to complex problems. Site Reliability Engineer Responsibilities: Design, develop and...


  • Waterloo, Ontario, M2L, City of Toronto, Canada Procom Full time

    Site Reliability Engineer Procom is seeking a Site Reliability Engineer for a contract role with one of our clients in the financial sector. Job Details: As a Site Reliability Engineer, you will be responsible for delivering automated solutions to complex problems. Responsibilities: Design, develop and support technical solutions that automate agent...


  • Waterloo, Canada Airbus Full time

    **Job Summary**: NAVBLUE, an Airbus Company, is currently seeking a Head of SRM to join our growing team. This position is responsible for providing leadership to the Platform Reliability Team. The Platform Reliability Team consists of Platform Administrators and Platform Architects who collectively provide tools and services which enable Customer Experience...


  • Waterloo, Canada Google Inc. Full time

    Software Developer Manager II, Site Reliability EngineeringLocation: Waterloo, ON, CanadaMinimum Requirements:Bachelor’s degree in Computer Science, a related field, or equivalent practical experience.8 years of experience with data structures or algorithms.5 years of experience with software development in one or more programming languages.3 years of...


  • Waterloo, Canada Google Inc. Full time

    Software Developer Manager II, Site Reliability EngineeringLocation: Waterloo, ON, CanadaMinimum Requirements:Bachelor’s degree in Computer Science, a related field, or equivalent practical experience.8 years of experience with data structures or algorithms.5 years of experience with software development in one or more programming languages.3 years of...


  • Waterloo, Canada Google Inc. Full time

    Software Developer Manager II, Site Reliability EngineeringLocation: Waterloo, ON, CanadaMinimum Requirements:Bachelor’s degree in Computer Science, a related field, or equivalent practical experience.8 years of experience with data structures or algorithms.5 years of experience with software development in one or more programming languages.3 years of...


  • Waterloo, Canada Google Inc. Full time

    Software Developer Manager II, Site Reliability Engineering link Copy link corporate_fare Google place Waterloo, ON, Canada Advanced Experience owning outcomes and decision making, solving ambiguous problems and influencing stakeholders;deep expertise in domain. Apply link Copy link Bachelor’s degree in Computer Science, a related...


  • Waterloo, Canada Google Inc. Full time

    Software Developer Manager II, Site Reliability Engineering link Copy link corporate_fare Google place Waterloo, ON, Canada Advanced Experience owning outcomes and decision making, solving ambiguous problems and influencing stakeholders;deep expertise in domain. Apply link Copy link Bachelor’s degree in Computer Science, a related...


  • Waterloo, Canada Google Inc. Full time

    Software Developer Manager II, Site Reliability Engineering link Copy link corporate_fare Google place Waterloo, ON, Canada Advanced Experience owning outcomes and decision making, solving ambiguous problems and influencing stakeholders;deep expertise in domain. Apply link Copy link Bachelor’s degree in Computer Science, a related...