System Reliability Engineer
4 weeks ago
About the Role:
We are seeking a highly skilled System Reliability Engineer to join our team at Scotiabank. As a key member of our Systems Reliability Office, you will play a critical role in ensuring the stability and reliability of our technology portfolio.
Key Responsibilities:
- Champion a customer-focused culture to deepen client relationships and leverage broader Bank relationships, systems, and knowledge.
- Accountable for creating, maintaining, and distributing Service Level Objectives (SLOs) data and reports/dashboards of our technology portfolio to various stakeholders across the organization.
- Champion stability and reliability across a portfolio of applications and services by working closely with service owner teams to continuously improve Mean Time To Recovery (MTTR) metrics and reduce downtime, leading troubleshooting of our most severe incidents, and participating in incident root cause analysis to prevent recurrence.
- Contribute to prioritizing reliability features with service owners and engineering teams.
- Contribute to the design, development, and delivery of effective tooling, alerts, and automated responses to identify and address reliability risks and automation of SLOs.
- Participate in incident calls and, when required, lead communications on impact and recovery status.
- Ensure information on incidents and problems is complete, accurate, and that action items are being worked on by assigned individuals.
- Produce weekly/monthly/quarterly status reporting or dashboards on incidents and problems for distribution to business and technology stakeholders.
- Participate and drive post-incident activities as per organizational governance and requirements.
- Interface with teams across technology and business partners on stability and reliability concerns, system disruptions, and providing incident details and root causes.
Requirements:
- Degree in Computer Science, Engineering, Business Management/Commerce, or equivalent experience.
- 8+ years of experience in the industry (Software development, DevOps, Service Management) with at least 3 years in a leadership capacity.
- Experience with creating and maintaining system performance dashboards.
- Experience with analyzing and troubleshooting systems.
- Excellent communication (both verbal and written) skills, with the ability to communicate confidently and clearly on conference calls, in meetings, via email, etc. at all levels of the organization.
- Ability to quickly and clearly communicate incident status via email in business-friendly language.
- Experience with IT Service Management (ITSM) tools (ServiceNow, a plus) with a strong understanding of SRE and service management principles.
Nice to Haves:
- Experience or familiarity with the Financial industry.
- Systematic problem-solving approach, coupled with effective communication skills and a sense of drive.
- Experience with Performance and Capacity Management (PCM) tools (e.g., Dynatrace, Splunk).
- Well-rounded broad knowledge of OS platforms (Linux/UNIX), Networking, Web Systems, and IT Ops.
- ITIL Foundation Certification.
What We Offer:
- Diversity, Equity, Inclusion & Allyship - We strive to create an inclusive culture where every employee is empowered to reach their fullest potential, respected for who they are, and embraced through bias-free practices and inclusive values across Scotiabank.
- Accessibility and Workplace Accommodations - We value the unique skills and experiences each individual brings to the Bank, and are committed to creating and maintaining an inclusive and accessible environment for everyone.
- Upskilling through online courses, cross-functional development opportunities, and tuition assistance.
- Competitive Rewards program including bonus, flexible vacation, personal, sick days, and benefits will start on day one.
- Community Engagement - no matter where you choose to work from; we offer opportunities for community engagement and belonging with our various programs such as hackathons, contests, cooking with friends, Humans of Digital, and much more.
Work arrangements: Hybrid.
Location(s): Canada : Ontario : Toronto
-
System Reliability Engineer
2 weeks ago
Old Toronto, Canada Interac Corp. Full timeJob SummaryWe are seeking a highly skilled System Reliability Engineer to join our team at Interac Corp. as we continue to shape the future of digital payments in Canada.
-
Reliability Engineer
2 months ago
Old Toronto, Canada Thomson Reuters Full timeAbout the RoleWe are seeking a skilled Reliability Engineer - Cloud Systems to join our team at Thomson Reuters.As a Reliability Engineer - Cloud Systems, you will be responsible for analyzing and resolving chronic and major issues affecting our cloud-based services.Key responsibilities include:Designing and implementing scalable systems and...
-
System Reliability Engineer
3 months ago
Old Toronto, Canada Scotiabank Full timeRequisition ID: 207317Join a purpose driven winning team, committed to results, in an inclusive and high-performing culture.The Team:You will work cross-functionally amongst a variety of teams and be a contributor in all significant deliverables to the Systems Reliability Office stakeholders. You will also have an understanding ‘what could go wrong’,...
-
AWS Engineer
4 weeks ago
Old Toronto, Canada Street Context Full timeWe're seeking a seasoned Site Reliability Engineer with a passion for designing and implementing robust, scalable systems on AWS.About Street Context: We provide a premium Email, Analytics, and Broker Relationship platform for capital markets and institutional investors.Scale our system to meet increasing global demand by collaborating with development...
-
Reliability Systems Engineer
4 weeks ago
Toronto, Ontario, Canada Teranet Inc. Full timeAbout TeranetTeranet is a leading innovator in electronic services and solutions, operating one of the most advanced and secure registration systems worldwide.Job SummaryWe are seeking a highly skilled Site Reliability Engineer to join our DevOps team. The ideal candidate will possess strong software engineering principles and infrastructure expertise to...
-
System Reliability Expert
4 weeks ago
Old Toronto, Canada Chelsea Avondale Full timeWe are Chelsea Avondale, the world's most innovative home insurance group. Our team comprises of highly skilled professionals in software development, finance, operations, and insurance.Our organization is transforming the Canadian and global insurance landscape by leveraging cutting-edge risk modeling and insurance pricing technologies.We are seeking a...
-
Asset Reliability Engineer
3 months ago
Old Toronto, Canada Chelsea Avondale Full timeChelsea Avondale is the world’s most cutting-edge home insurance group. We have developed sophisticated risk modeling and insurance pricing technologies for home insurance and deploy that technology through our own insurance company. Our team consists of some of the brightest minds in insurance, software development, finance, and operations. Our group...
-
Reliable System Professional
2 weeks ago
Old Toronto, Canada Thomson Reuters Full timeAbout the RoleAs a key member of our Service Management Organization, you will be responsible for designing and operating scalable systems and services that meet the needs of our customers.We are seeking an experienced Site Reliability Engineer to lead the work in driving efficiencies and reducing service operations risks. This is a contract position (9...
-
Hardware Design Reliability Engineer
2 months ago
Old Toronto, Canada Aversan Inc Full timeHardware Design Reliability Engineer North York, Ontario Position Summary Responsible for the hardware reliability activities regarding the hardware products within Engineering perimeter. Essential Functions / Key Areas of Responsibility Monitor the hardware reliability of the hardware systems in the field. Maintain a table with all the hardware returns...
-
Hardware Reliability Engineer Lead
4 weeks ago
Old Toronto, Canada Aversan Inc Full timeJoin Aversan Inc as a Hardware Reliability Engineer LeadNorth York, OntarioAbout the RoleWe are seeking an experienced Hardware Reliability Engineer to lead our hardware reliability activities.Duties and ResponsibilitiesMaintain a comprehensive understanding of hardware systems in the field.Manage a table with all hardware returns from the field, including...
-
Systems Architectural Specialist
4 weeks ago
Old Toronto, Canada System One Full timeAre you a seasoned engineering professional with experience in systems integration and architectural design? Do you have a strong background in leading technical teams and collaborating with cross-functional groups?We are seeking a Senior Systems Integration Engineer to join our team at System One. As a key member of our engineering group, you will be...
-
AWS Site Reliability Engineer
2 months ago
Old Toronto, Canada Street Context Full timep>Are you a Site Reliability Engineer that has a passion for building reliable, resilient and performant systems that scale? p>We are on a mission to build and strengthen our engineering teams to match the accelerating success of Street Context. We provide a premium Email, Analytics and Broker Relationship platform, purpose-built for capital markets and...
-
Enterprise Reliability Engineer
4 weeks ago
Old Toronto, Canada Interac Full timeAt Interac, we design and deliver innovative products and solutions that empower Canadians to take control of their finances and achieve their goals.With a strong focus on software development and maintenance, you will play a key role in ensuring the reliability and performance of our high-traffic payment system. As an Enterprise Reliability Engineer, you...
-
Cloud Reliability Engineer
4 weeks ago
Old Toronto, Canada TD Bank Full timeAbout TD BankTD Bank is a leading financial institution committed to delivering exceptional customer experiences and innovative banking solutions.Job SummaryWe are seeking an experienced Cloud Reliability Engineer to join our team in Toronto, Ontario. This role will be responsible for ensuring the stability, scalability, and reliability of our cloud-based...
-
Asset Reliability Engineer
4 weeks ago
Old Toronto, Canada JLL Full timeJLL empowers you to shape a brighter way. Our people at JLL and JLL Technologies are shaping the future of real estate for a better world by combining world class services, advisory and technology for our clients. We are committed to hiring the best, most talented people and empowering them to thrive, grow meaningful careers and to find a place where they...
-
Hardware Reliability Solutions Engineer
2 weeks ago
Old Toronto, Canada Aversan Inc Full timeJob SummaryAversan Inc is seeking a skilled Hardware Reliability Solutions Engineer to join their team in North York, Ontario. As a key member of the Engineering group, this role will focus on ensuring the reliability of hardware products within the company's perimeter.Responsibilities:Monitor and analyze hardware reliability trends across various systems in...
-
Cloud Reliability Engineer
3 weeks ago
Old Toronto, Canada Royal Bank of Canada> Full timeWe are seeking a skilled Cloud Reliability Engineer to join our Digital team at RBC in Toronto, Canada.As a Cloud Reliability Engineer, you will be responsible for running the production environment by monitoring availability and taking a holistic view of system health. This includes debugging production issues across services and levels of the stack,...
-
AWS Site Reliability Engineer
2 months ago
Old Toronto, Canada Soda Full timeJob Description Job Title: Site Reliability Engineer Location: Poland - Fully Remote Salary: 324K PLN or 27.3K monthly Start: ASAP Stack: AWS, Docker, Kubernetes, Terraform, Jenkins, Ansible, Linux, JavaScript, and Lambda. Are you a seasoned DevOps/SRE professional passionate about building high-performance, scalable systems? I am working with a Media/IT...
-
AWS Site Reliability Engineer
4 weeks ago
Old Toronto, Canada Tecsys Full timeTecsys is a fast-growing innovator offering supply chain solutions to industry-leading healthcare systems, hospitals, and pharmacy businesses to distributors, retailers, and 3PLs. As a Cloud Infrastructure Specialist, you will be responsible for ensuring the reliability and uptime of our platform and applications in a data-driven way to support internal and...
-
Senior Reliability and Safety Engineer
2 weeks ago
Old Toronto, Canada oilandgas Full timeAbout AtkinsRéalis">A world leader in engineering and design consultancy, we deliver innovative solutions that improve people's lives.Our Vision for the Future">We aim to engineer a better future for our planet and its people. Our Safety and Systems Assurance team is dedicated to ensuring the safety and reliability of complex projects across various...