AWS Site Reliability Engineer
1 month ago
At SAP, we enable you to bring out your best. Our company culture is focused on collaboration and a shared passion to help the world run better. p>
The Reliability Engineering organization provides a multitude of products and services related to operations and continuity of business delivery.
The Site Reliability Engineering teams make the SAP Business Technology Platform run better by providing 24x7 deep technical coverage for Incident Management (Outages and other incidents with major customer impact) applying SRE principles. We share a Live Site First culture and care for the business continuity of our customers running mission-critical applications in the Cloud.
We are looking for an engineer to join an already established SRE team for the SAP Business Technology Platform.
EXPECTATIONS AND TASKS
As a Site Reliability Engineer, you will operate and support business-critical Cloud services. You will participate in the development of tools for monitoring and troubleshooting cloud services built on the latest open source and SAP technologies, following SRE principles.
Responsibilities
- Act as technical expert during Live site incidents, investigating and solving incidents on a deep technical level.
- Drive root cause analysis and follow-up improvements to prevent issues from reoccurring.
- Perform in-depth troubleshooting and log analysis to identify and solve complex issues in accordance with internal and external SLAs.
- Build software-based solutions to address improvements in service reliability and stability.
- Enhance infrastructure and platform monitoring by gathering system metrics (4 Golden Signals) and implementing tools for recovery.
- Create and maintain technical documentation.
- Participate in the on-call rotation (follow the sun approach) to react to major incidents.
If you are interested in software engineering based on cutting-edge technology, you will find an inspiring and professional environment for your learning and growth. li>
Fluency in English, basic French.Preferred Additional Skills and Competencies:
- Coding experience with Python, Bash, GO.
- Experience with Unix/Linux operating system.
- Experience with modern monitoring, logging, and alerting tools (Grafana, Prometheus, Kibana, Loki, Splunk On-Call, Dynatrace).
- Contribution to open-source projects.
WORK EXPERIENCE
If you are interested in this position and would like to join our team, please apply even if you don’t meet all the qualifications listed in the job posting.
SAP'S DIVERSITY COMMITMENT
To harness the power of innovation, SAP invests in the development of its diverse employees. We aspire to leverage the qualities and appreciate the unique competencies that each person brings to the company.
EOE AA M/F/Vet/Disability:
Qualified applicants will receive consideration for employment without regard to their age, race, religion, national origin, ethnicity, age, gender, sexual orientation, gender identity or expression, protected veteran status, or disability.
Requisition ID: 410197 | Work Area: Software-Development Operations | Expected Travel: 0 - 10% | Career Status: Professional | Employment Type: Regular Full Time | Additional Locations: #LI-Hybrid
#-
AWS Site Reliability Engineer
3 months ago
Montreal, Canada Banque Nationale du Canada Full timep>Area of Interest: Information technologyAs a Specialist in site reliability engineering on the National Bank Data Protection team, you will ensure the operational reliability of data protection assets. With your experience and knowledge in the operational management of high-availability assets (HA), you will have a positive impact on the Bank's stability...
-
AWS Site Reliability Engineer
1 month ago
Montreal, Canada SAP SE Full timep>We help the world run betterAt SAP, we enable you to bring out your best. Our company culture is focused on collaboration and a shared passion to help the world run better. We focus every day on building the foundation for tomorrow and creating a workplace that embraces differences, values flexibility, and is aligned to our purpose-driven and...
-
AWS Site Reliability Engineer
4 days ago
Montreal, Canada SAP SE Full timep>We help the world run betterAt SAP, we enable you to bring out your best. Our company culture is focused on collaboration and a shared passion to help the world run better. We focus every day on building the foundation for tomorrow and creating a workplace that embraces differences, values flexibility, and is aligned to our purpose-driven and...
-
Site Reliability Engineer 3
3 days ago
Montreal, Canada Behavox Full timeAbout the Role The Behavox Platform is a scalable, fault-tolerant, and highly performant storage and processing system that allows us to manage and analyze massive volumes of data. We have an extensive and flexible set of APIs to develop products that allow our clients to work through millions of data items, by searching, filtering, and visualizing...
-
Site Reliability Engineer
3 days ago
Montreal, Canada SAP SE Full timeWe help the world run betterAt SAP, we enable you to bring out your best. Our company culture is focused on collaboration and a shared passion to help the world run better. We offer a highly collaborative, caring team environment with a strong focus on learning and development, recognition for your individual contributions, and a variety of benefit options...
-
Site Reliability Engineer
4 days ago
Montreal, Canada National Bank Full timeAs a Specialist in site reliability engineering on the National Bank Data Protection team, you will ensure the operational reliability of data protection assets. With your experience and knowledge in the operational management of high-availability assets (HA), you will have a positive impact on the Bank's stability and reputation with its internal and...
-
Site Reliability Engineer
2 months ago
Montreal, Canada National Bank Full timeAs a Specialist in site reliability engineering on the National Bank Data Protection team, you will ensure the operational reliability of data protection assets. With your experience and knowledge in the operational management of high-availability assets (HA), you will have a positive impact on the Bank's stability and reputation with its internal and...
-
Site Reliability Engineer
4 days ago
Montreal, Canada National Bank Full timeAs a Specialist in site reliability engineering on the National Bank Data Protection team, you will ensure the operational reliability of data protection assets. With your experience and knowledge in the operational management of high-availability assets (HA), you will have a positive impact on the Bank's stability and reputation with its internal and...
-
Site Reliability Engineer
3 months ago
Montreal, Canada National Bank Full timeAs a Specialist in site reliability engineering on the National Bank Data Protection team, you will ensure the operational reliability of data protection assets. With your experience and knowledge in the operational management of high-availability assets (HA), you will have a positive impact on the Bank's stability and reputation with its internal and...
-
Site Reliability Engineer
4 days ago
Montreal, Canada National Bank Full timeAs a Specialist in site reliability engineering on the National Bank Data Protection team, you will ensure the operational reliability of data protection assets. With your experience and knowledge in the operational management of high-availability assets (HA), you will have a positive impact on the Bank's stability and reputation with its internal and...
-
Site Reliability Engineer
1 week ago
Montreal, Canada Domtar Full timeSoftware-Development Operations Site Reliability Engineer We help the world run better At SAP, we enable you to bring out your best. Our company culture is focused on collaboration and a shared passion to help the world run better. How? We focus every day on building the foundation for tomorrow and creating a...
-
Site Reliability Engineer
3 months ago
Montreal, Canada National Bank Full timeAs a Specialist in site reliability engineering on the National Bank Data Protection team, you will ensure the operational reliability of data protection assets. With your experience and knowledge in the operational management of high-availability assets (HA), you will have a positive impact on the Bank's stability and reputation with its internal and...
-
Site Reliability Engineer
1 week ago
Montreal, Canada Domtar Full timeSoftware-Development OperationsSite Reliability EngineerWe help the world run betterAt SAP, we enable you to bring out your best. Our company culture is focused on collaboration and a shared passion to help the world run better. How? We focus every day on building the foundation for tomorrow and creating a workplace that embraces differences, values...
-
Site Reliability Engineer
4 weeks ago
Montreal, Canada National Bank of Canada Full timeAs a Specialist in site reliability engineering on the National Bank Data Protection team, you will ensure the operational reliability of data protection assets. With your experience and knowledge in the operational management of high-availability assets (HA), you will have a positive impact on the Bank's stability and reputation with its internal and...
-
Site Reliability Engineer
1 week ago
Montreal, Canada Domtar Full timeSoftware-Development Operations Site Reliability Engineer We help the world run better At SAP, we enable you to bring out your best. Our company culture is focused on collaboration and a shared passion to help the world run better. How? We focus every day on building the foundation for tomorrow and creating a workplace that embraces...
-
Site Reliability Engineer
1 week ago
Montreal, Canada Domtar Full timeSoftware-Development Operations Site Reliability Engineer We help the world run better At SAP, we enable you to bring out your best. Our company culture is focused on collaboration and a shared passion to help the world run better. How? We focus every day on building the foundation for tomorrow and creating a workplace that embraces...
-
Site Reliability Engineer
4 days ago
Montreal, Canada Banque Nationale du Canada Full timeArea of Interest: Information technology As a Specialist in site reliability engineering on the National Bank Data Protection team, you will ensure the operational reliability of data protection assets. With your experience and knowledge in the operational management of high-availability assets (HA), you will have a positive impact on the Bank's stability...
-
Site Reliability Engineer
3 days ago
Montreal, Canada Banque Nationale du Canada Full timeArea of Interest: Information technologyAs a Specialist in site reliability engineering on the National Bank Data Protection team, you will ensure the operational reliability of data protection assets. With your experience and knowledge in the operational management of high-availability assets (HA), you will have a positive impact on the Bank's stability and...
-
Site Reliability Engineer
2 months ago
Montreal, Quebec, Québec, Canada Soho Square Solutions Full timeSite Reliability Engineer (SRE) - ServiceNow, Application InfrastructureThe Application Infrastructure (AI) department is seeking a Site Reliability Engineer (SRE) to drive reliability engineering, operations, and customer support services for a ServiceNow SaaS implementation. Reporting to a Site Reliability Engineering & Operations Lead, this role involves...
-
Site Reliability Engineer
3 months ago
Montreal, Quebec, G4F, CA National Bank Full timeAs a Specialist in site reliability engineering on the National Bank Data Protection team, you will ensure the operational reliability of data protection assets. With your experience and knowledge in the operational management of high-availability assets (HA), you will have a positive impact on the Bank's stability and reputation with its internal and...