Systems Reliability Engineer

3 weeks ago


Toronto, Canada Scotiabank Full time

Overview Requisition ID: . Join a purpose driven winning team, committed to results, in an inclusive and high-performing culture. The Role: As a member of the Systems Reliability Engineering team, the System Reliability Engineer will collaborate closely with Engineering and development teams, peers, and business partners to continuously improve the stability, reliability and efficiency of our Global Systems through Site Reliability Engineering (SRE) based principles and practices in support of our rapidly changing technology product portfolio. You will advance the tools and techniques to increase the reliability of operational environments while fostering a culture of resiliency within the Development and Engineering teams. You will be part of key initiatives and help to design and enhance the monitoring and alerting of the state of our production systems. Building end-to-end Service Level Objectives and successfully implementing them is a key focus of the Systems Reliability team and your contribution will be key to plan, build, refine and deploy practices and solutions that will improve stability, reliability and efficiency of our systems and services. The team has a strong focus on being people first and promotes support and training on both an individual and team level. Responsibilities Work in collaboration with Director, System Reliability Engineering as well as with Software Development, Product and Data Engineering teams to Champion SRE/ DevOps culture and practices Builds end to end service level objectives by creating user journeys and mapping systems and services to capture the required data to support those objectives Participate and deliver in initiatives to continuously refine, plan and deploy practices for improved stability, reliability, efficiency, repeatability and security. You’ll help to create plans, collaborate with other SROs and DevOps team members to increase service levels in support of resiliency objectives Create and manage stability dashboards, utilized by technology and business for the purpose of tracking stability and reliability trends; participate in incident calls and contribute to recovery and impact communication Contribute to incident post-mortem activities as per organizational governance and requirements Review and evaluate major projects/changes before production deployment to ensure adequate monitoring, support and deployment plans are in place Participate in the SRE on-call stability rotation Champions a high-performance culture and contributes to an inclusive work environment Qualifications Top notch engineer with experience in Software Engineering, Site Reliability Engineering and technology operations Expertise with monitoring/observability tools such as Dynatrace, Splunk, AQA, DataDog, Prometheus, Grafana Experience with cloud technologies such as GCP/Azure/AWS Experience using CI tools and techniques such as Jenkins, Bitbucket, GitHub, Docker, Kubernetes Experience with ITSM tools (ServiceNow) and a strong understanding of SRE principles 5+ years of IT experience with at least 3 years in an SRE/DevOps role Exposure to artificial intelligence and machine learning models is a strong asset Experience in building medium and complex Power BI Knowledge of Unix/Linux and coding languages (Java, Python) is a strong asset Degree in Computer Science, Engineering, or equivalent experience. ITIL Foundation certification is an asset Excellent verbal and written communication skills What's in it for you? Inclusive and collaborative environment that encourages creativity and curiosity and celebrates success Tools and technology to create meaningful customer experiences Opportunities to learn from diverse industry leaders from top technology companies Talent-based hiring with opportunities for growth and career development Casual dress code; comfortable environment Access to thousands of online and in-person courses Competitive rewards package including base salary, performance bonus, pension and profit sharing, paid vacation, personal & sick days, medical, vision, and dental benefits Location(s): Canada : Ontario : Toronto Scotiabank is a leading bank in the Americas. Guided by our purpose: "for every future", we help our customers, their families and their communities achieve success through a broad range of advice, products and services, including personal and commercial banking, wealth management and private banking, corporate and investment banking, and capital markets. At Scotiabank, we value the unique skills and experiences each individual brings to the Bank, and are committed to creating and maintaining an inclusive and accessible environment for everyone. If you require accommodation during the recruitment and selection process, please let our Recruitment team know. Candidates must apply directly online to be considered for this role. Only those candidates selected for an interview will be contacted. Seniority level: Mid-Senior level Employment type: Full-time Job function: Engineering and Information Technology Industries: Banking Referrals increase your chances of interviewing at Scotiabank by 2x #J-18808-Ljbffr



  • Toronto, Canada Scotiabank Full time

    OverviewRequisition ID: 239640. Join a purpose driven winning team, committed to results, in an inclusive and high-performing culture. The Role: As a member of the Systems Reliability Engineering team, the System Reliability Engineer will collaborate closely with Engineering and development teams, peers, and business partners to continuously improve the...


  • Toronto, Canada Scotiabank Full time

    OverviewRequisition ID: 239640. Join a purpose driven winning team, committed to results, in an inclusive and high-performing culture. The Role: As a member of the Systems Reliability Engineering team, the System Reliability Engineer will collaborate closely with Engineering and development teams, peers, and business partners to continuously improve the...


  • Toronto, Canada Scotiabank Full time

    Press Tab to Move to Skip to Content Link Select how often (in days) to receive an alert: Requisition ID: 239640Join a purpose driven winning team, committed to results, in an inclusive and high-performing culture. The Role As a member of the Systems Reliability Engineering team, the System Reliability Engineer will collaborate closely with Engineering and...


  • Toronto, Canada Scotiabank Full time

    Press Tab to Move to Skip to Content Link Select how often (in days) to receive an alert: Requisition ID: 239640Join a purpose driven winning team, committed to results, in an inclusive and high-performing culture. The Role As a member of the Systems Reliability Engineering team, the System Reliability Engineer will collaborate closely with Engineering and...


  • Toronto, Canada Scotiabank Full time

    Press Tab to Move to Skip to Content Link Select how often (in days) to receive an alert: Requisition ID: 239640 Join a purpose driven winning team, committed to results, in an inclusive and high-performing culture. The Role As a member of the Systems Reliability Engineering team, the System Reliability Engineer will collaborate closely with Engineering and...


  • Toronto, Canada Scotiabank Full time

    Press Tab to Move to Skip to Content Link Select how often (in days) to receive an alert: Requisition ID: Join a purpose driven winning team, committed to results, in an inclusive and high-performing culture. The Role As a member of the Systems Reliability Engineering team, the System Reliability Engineer will collaborate closely with Engineering and...


  • Toronto, Ontario, Canada Cerebras Systems Full time $120,000 - $180,000 per year

    Cerebras Systems builds the world's largest AI chip, 56 times larger than GPUs. Our novel wafer-scale architecture provides the AI compute power of dozens of GPUs on a single chip, with the programming simplicity of a single device. This approach allows Cerebras to deliver industry-leading training and inference speeds and empowers machine learning users to...


  • (s): Canada : Ontario : Toronto Scotiabank Global Site Full time $120,000 - $180,000 per year

    Requisition ID: 239640Join a purpose driven winning team, committed to results, in an inclusive and high-performing culture.The RoleAs a member of the Systems Reliability Engineering team, the System Reliability Engineer will collaborate closely with Engineering and development teams, peers, and business partners to continuously improve the stability,...


  • Toronto, Ontario, Canada Apex Systems Full time $120,000 - $180,000 per year

    Senior Site Reliability EngineerApex Systems is a global IT services provider, and our staffing practice has an opening for an SRE with extensive OpenShift Clusters experience, strong GitOps and ArgoCD knowledge, and solid F5 LTM load balancer configuration capabilities to place at our client, an industry leading technology company.Client:A Fortune 100...

  • Reliability Engineer

    2 weeks ago


    Toronto, Canada Chelsea Avondale Full time

    Chelsea Avondale is the world’s most cutting-edge home insurance group. We have developed sophisticated risk modeling and insurance pricing technologies for home insurance and deploy that technology through our own insurance company. Our team consists of some of the brightest minds in insurance, software development, finance, and operations. Our group...