Site Reliability Engineer
1 week ago
We are looking for an experienced Site Reliability Engineer or Platform Operations Engineer for our client. This is a permanent position that is remote to start with later relocation to Calgary or Winnipeg. Our client is a global enterprise company with a product that you've likely used.
You Will:
- Own development projects, providing technical guidance and delivering against the Platform & Service Operations Engineering roadmap.
- Designing and Implementing Wargames to test our operational response and identify areas of weakness in our platforms.
- Technical and Management Escalation point for Service Operations Centre (SOC) engineers and during major incidents.
- Troubleshooting, reproducing and mitigating issues in our production environments
- Mentoring other team members.
- Operate global AWS Platforms at scale
- Evidence of Strong Troubleshooting, problem-solving and investigative skills
- Experience of AWS or Other cloud providers
- Experience developing in Java
- Major incident management on experience operating production platforms at scale
- Experience working with distributed web applications
- Experience Automating operational tasks / Processes using other languages
- Understanding of relational and/or NoSQL data structures
- Experience mentoring/influencing peers
- Identifying improvements, highlighting risks vs benefits, and translating them into technical requirements
- Worked with Ansible, Terraform, Python
- Experience working with Serverless / Containers
- Experience of ELK &/Or Graphite/Prometheus / Grafana
- Used Tracing Tools in production before
- Experience in Chaos Engineering / Failure Injection Testing
- Experience of working in an Agile Environment
- Experience working in a similar site reliability role
-
Site Reliability Engineer
6 days ago
Ontario, Canada Orion Innovation Full timeJob Description: Senior Site Reliability Engineer (SRE) with Kubernetes & Rancher Location: Canada - Remote [Working EST hours] Job Type: Full-time About the Role Are you an exceptional Site Reliability Engineer with a passion for building and maintaining highly resilient and secure systems? We are seeking a Senior SRE to join our team and play a critical...
-
Senior Site Reliability Engineer
3 weeks ago
Fredericton, Canada Cvent Full timeCvent is a leading meetings, events, and hospitality technology provider with more than 5,000+ employees and 24,000+ customers worldwide, including 60% of the Fortune 500. Founded in 1999, Cvent delivers a comprehensive event marketing and management platform for marketers and event professionals and offers software solutions to hotels, special event venues...
-
Senior Site Reliability Engineer
3 weeks ago
Fredericton, Canada Cvent Full timeCvent is a leading meetings, events, and hospitality technology provider with more than 5,000+ employees and 24,000+ customers worldwide, including 60% of the Fortune 500. Founded in 1999, Cvent delivers a comprehensive event marketing and management platform for marketers and event professionals and offers software solutions to hotels, special event venues...
-
Senior Site Reliability Engineer
3 weeks ago
Fredericton, Canada Cvent Full timeCvent is a leading meetings, events, and hospitality technology provider with more than 5,000+ employees and 24,000+ customers worldwide, including 60% of the Fortune 500. Founded in 1999, Cvent delivers a comprehensive event marketing and management platform for marketers and event professionals and offers software solutions to hotels, special event venues...
-
Site Reliability Engineer
3 days ago
Vancouver, Canada LayerZero Labs Full timeJoin to apply for the Site Reliability Engineer role at LayerZero Labs Join to apply for the Site Reliability Engineer role at LayerZero Labs Get AI-powered advice on this job and more exclusive features. The Future is Omnichain. LayerZeroThe Future is Omnichain. Founded in 2021, LayerZero’s vision is to create a community of cross-chain developers,...
-
Site Reliability Engineer
3 days ago
Montreal, Canada ApTask Full timeDirect message the job poster from ApTask Looking for an intermediate between 2 to 5 years' experience. The Application Infrastructure (Al) department is seeking a Site Reliability Engineer (SRE) to help drive the reliability engineering, operations and customer support services clients ServiceNow SaaS implementation. Reporting to a Site Reliability...
-
Site Reliability Engineer
1 week ago
Vancouver, Canada LayerZero Labs Full timeJoin to apply for the Site Reliability Engineer role at LayerZero LabsJoin to apply for the Site Reliability Engineer role at LayerZero LabsGet AI-powered advice on this job and more exclusive features.The Future is Omnichain.LayerZeroThe Future is Omnichain.Founded in 2021, LayerZero’s vision is to create a community of cross-chain developers, building...
-
Site Reliability Engineer
1 week ago
Vancouver, Canada LayerZero Labs Full timeJoin to apply for the Site Reliability Engineer role at LayerZero LabsJoin to apply for the Site Reliability Engineer role at LayerZero LabsGet AI-powered advice on this job and more exclusive features.The Future is Omnichain.LayerZeroThe Future is Omnichain.Founded in 2021, LayerZero’s vision is to create a community of cross-chain developers, building...
-
Site Reliability Engineer
2 days ago
Ontario, Canada Apptoza Inc. Full timeHI, Hope you are doing Great, If you are fine with below JD please share me your Updated resume ASAP. Site Reliability Engineer Location: TORONTO (ONSITE) Duration: 6 months Exp Required: 10 Years Job Description: Job Title : SRE Technical/Functional Skills • 8+ years of overall IT experience. • Advanced Linux / Unix support experience required. •...
-
Site Reliability Engineer
3 days ago
Ontario, Canada Apptoza Inc. Full timeHI, Hope you are doing Great, If you are fine with below JD please share me your Updated resume ASAP. Site Reliability Engineer Location: TORONTO (ONSITE) Duration: 6 months Exp Required: 10 Years Job Description: Job Title : SRE Technical/Functional Skills • 8+ years of overall IT experience. • Advanced Linux / Unix support experience required. •...