Systems Reliability Engineer
3 weeks ago
OverviewRequisition ID: 239640. Join a purpose driven winning team, committed to results, in an inclusive and high-performing culture. The Role: As a member of the Systems Reliability Engineering team, the System Reliability Engineer will collaborate closely with Engineering and development teams, peers, and business partners to continuously improve the stability, reliability and efficiency of our Global Systems through Site Reliability Engineering (SRE) based principles and practices in support of our rapidly changing technology product portfolio. You will advance the tools and techniques to increase the reliability of operational environments while fostering a culture of resiliency within the Development and Engineering teams. You will be part of key initiatives and help to design and enhance the monitoring and alerting of the state of our production systems. Building end-to-end Service Level Objectives and successfully implementing them is a key focus of the Systems Reliability team and your contribution will be key to plan, build, refine and deploy practices and solutions that will improve stability, reliability and efficiency of our systems and services. The team has a strong focus on being people first and promotes support and training on both an individual and team level.ResponsibilitiesWork in collaboration with Director, System Reliability Engineering as well as with Software Development, Product and Data Engineering teams to Champion SRE/ DevOps culture and practicesBuilds end to end service level objectives by creating user journeys and mapping systems and services to capture the required data to support those objectivesParticipate and deliver in initiatives to continuously refine, plan and deploy practices for improved stability, reliability, efficiency, repeatability and security. You’ll help to create plans, collaborate with other SROs and DevOps team members to increase service levels in support of resiliency objectivesCreate and manage stability dashboards, utilized by technology and business for the purpose of tracking stability and reliability trends; participate in incident calls and contribute to recovery and impact communicationContribute to incident post-mortem activities as per organizational governance and requirementsReview and evaluate major projects/changes before production deployment to ensure adequate monitoring, support and deployment plans are in placeParticipate in the SRE on-call stability rotationChampions a high-performance culture and contributes to an inclusive work environmentQualificationsTop notch engineer with experience in Software Engineering, Site Reliability Engineering and technology operationsExpertise with monitoring/observability tools such as Dynatrace, Splunk, AQA, DataDog, Prometheus, GrafanaExperience with cloud technologies such as GCP/Azure/AWSExperience using CI tools and techniques such as Jenkins, Bitbucket, GitHub, Docker, KubernetesExperience with ITSM tools (ServiceNow) and a strong understanding of SRE principles5+ years of IT experience with at least 3 years in an SRE/DevOps roleExposure to artificial intelligence and machine learning models is a strong assetExperience in building medium and complex Power BIKnowledge of Unix/Linux and coding languages (Java, Python) is a strong assetDegree in Computer Science, Engineering, or equivalent experience. ITIL Foundation certification is an assetExcellent verbal and written communication skillsWhat's in it for you?Inclusive and collaborative environment that encourages creativity and curiosity and celebrates successTools and technology to create meaningful customer experiencesOpportunities to learn from diverse industry leaders from top technology companiesTalent-based hiring with opportunities for growth and career developmentCasual dress code; comfortable environmentAccess to thousands of online and in-person coursesCompetitive rewards package including base salary, performance bonus, pension and profit sharing, paid vacation, personal & sick days, medical, vision, and dental benefitsLocation(s): Canada : Ontario : TorontoScotiabank is a leading bank in the Americas. Guided by our purpose: "for every future", we help our customers, their families and their communities achieve success through a broad range of advice, products and services, including personal and commercial banking, wealth management and private banking, corporate and investment banking, and capital markets.At Scotiabank, we value the unique skills and experiences each individual brings to the Bank, and are committed to creating and maintaining an inclusive and accessible environment for everyone. If you require accommodation during the recruitment and selection process, please let our Recruitment team know. Candidates must apply directly online to be considered for this role. Only those candidates selected for an interview will be contacted.Seniority level: Mid-Senior levelEmployment type: Full-timeJob function: Engineering and Information TechnologyIndustries: BankingReferrals increase your chances of interviewing at Scotiabank by 2x #J-18808-Ljbffr
-
Systems Reliability Engineer
4 weeks ago
Toronto, Canada Scotiabank Full timeOverviewRequisition ID: 239640. Join a purpose driven winning team, committed to results, in an inclusive and high-performing culture. The Role: As a member of the Systems Reliability Engineering team, the System Reliability Engineer will collaborate closely with Engineering and development teams, peers, and business partners to continuously improve the...
-
Systems Reliability Engineer
3 weeks ago
Toronto, Canada Scotiabank Full timeOverview Requisition ID: . Join a purpose driven winning team, committed to results, in an inclusive and high-performing culture. The Role: As a member of the Systems Reliability Engineering team, the System Reliability Engineer will collaborate closely with Engineering and development teams, peers, and business partners to continuously improve the...
-
Systems Reliability Engineer
4 weeks ago
Toronto, Canada Scotiabank Full timePress Tab to Move to Skip to Content Link Select how often (in days) to receive an alert: Requisition ID: 239640Join a purpose driven winning team, committed to results, in an inclusive and high-performing culture. The Role As a member of the Systems Reliability Engineering team, the System Reliability Engineer will collaborate closely with Engineering and...
-
Systems Reliability Engineer
3 weeks ago
Toronto, Canada Scotiabank Full timePress Tab to Move to Skip to Content Link Select how often (in days) to receive an alert: Requisition ID: 239640 Join a purpose driven winning team, committed to results, in an inclusive and high-performing culture. The Role As a member of the Systems Reliability Engineering team, the System Reliability Engineer will collaborate closely with Engineering and...
-
Systems Reliability Engineer
3 weeks ago
Toronto, Canada Scotiabank Full timePress Tab to Move to Skip to Content Link Select how often (in days) to receive an alert: Requisition ID: 239640Join a purpose driven winning team, committed to results, in an inclusive and high-performing culture. The Role As a member of the Systems Reliability Engineering team, the System Reliability Engineer will collaborate closely with Engineering and...
-
Systems Reliability Engineer
3 weeks ago
Toronto, Canada Scotiabank Full timePress Tab to Move to Skip to Content Link Select how often (in days) to receive an alert: Requisition ID: Join a purpose driven winning team, committed to results, in an inclusive and high-performing culture. The Role As a member of the Systems Reliability Engineering team, the System Reliability Engineer will collaborate closely with Engineering and...
-
Performance Reliability Engineer
2 weeks ago
Toronto, Ontario, Canada Cerebras Systems Full time $120,000 - $180,000 per yearCerebras Systems builds the world's largest AI chip, 56 times larger than GPUs. Our novel wafer-scale architecture provides the AI compute power of dozens of GPUs on a single chip, with the programming simplicity of a single device. This approach allows Cerebras to deliver industry-leading training and inference speeds and empowers machine learning users to...
-
Systems Reliability Engineer
1 week ago
(s): Canada : Ontario : Toronto Scotiabank Global Site Full time $120,000 - $180,000 per yearRequisition ID: 239640Join a purpose driven winning team, committed to results, in an inclusive and high-performing culture.The RoleAs a member of the Systems Reliability Engineering team, the System Reliability Engineer will collaborate closely with Engineering and development teams, peers, and business partners to continuously improve the stability,...
-
Senior Site Reliability Engineer
2 weeks ago
Toronto, Ontario, Canada Apex Systems Full time $120,000 - $180,000 per yearSenior Site Reliability EngineerApex Systems is a global IT services provider, and our staffing practice has an opening for an SRE with extensive OpenShift Clusters experience, strong GitOps and ArgoCD knowledge, and solid F5 LTM load balancer configuration capabilities to place at our client, an industry leading technology company.Client:A Fortune 100...
-
Reliability Engineer
2 weeks ago
Toronto, Canada Chelsea Avondale Full timeChelsea Avondale is the world’s most cutting-edge home insurance group. We have developed sophisticated risk modeling and insurance pricing technologies for home insurance and deploy that technology through our own insurance company. Our team consists of some of the brightest minds in insurance, software development, finance, and operations. Our group...