Site-Reliability Engineer
2 weeks ago
Aarorn Technologies Inc provided pay range This range is provided by Aarorn Technologies Inc. Your actual pay will be based on your skills and experience — talk with your recruiter to learn more. Base pay range CA$40.00/hr - CA$50.00/hr Job Title: Site-Reliability Engineer (SRE) Location: Toronto, ON (3x onsite a week) Employment Type: Contract Pay Rate: CAD$40 - $50/HR INC Job Description We are seeking a highly skilled Site Reliability Engineer (SRE) to enhance the reliability, performance, and efficiency of mission‑critical batch workloads within Capital Markets Technology. In this role, you will serve as the technical lead for automation, application development, systems engineering, and observability using Dynatrace, with a primary focus on optimizing batch runtimes. If you are passionate about reducing latency, eliminating manual toil through automation, and building resilient systems that never fail, this is the perfect opportunity for you. This position is integral to our Operational Excellence Strategy and will drive the maturity of reliability engineering practices across the Capital Markets domain. Key Responsibilities Reliability & Performance: Optimize batch processing pipelines to ensure stability, reduce runtime, and minimize failure rates while engineering for resiliency. Observability: Implement and maintain monitoring solutions using Dynatrace; develop dashboards, alerts, and runbooks. Systems Engineering: Configure and tune Linux and Windows systems for optimal performance and resilience. Automation & Orchestration: Design and optimize Airflow DAGs; build and maintain CI/CD pipelines for automation. Incident Management: Lead incident response, root cause analysis, and postmortems; enforce SLOs and reliability best practices. Security & Compliance: Apply security best practices and ensure adherence to regulatory compliance in systems and automation. Qualifications Python Expertise: Advanced coding skills including performance tuning, concurrency (async/multiprocessing), testing, and packaging. Linux Systems: Deep knowledge of kernel/OS tuning, networking, filesystem optimization, process management, and troubleshooting. Dynatrace Proficiency: Experience with custom dashboards, KPIs, anomaly detection, tagging strategies, and alert configurations. Airflow Expertise: Strong understanding of DAG design, SLA management, scheduler/executor tuning, and scaling strategies. Proven experience optimizing batch workloads for performance, reliability, and cost efficiency. Solid understanding of distributed systems concepts such as retries, idempotency, backpressure, and data integrity. Strong backend systems and database optimization skills. Proficiency with CI/CD tools (GitHub Actions, Azure DevOps, Jenkins) and Infrastructure as Code (Terraform, Ansible). Hands‑on experience with containers and orchestration (Docker, Kubernetes). Excellent incident management and root cause analysis capabilities. Strong communication and collaboration skills. Disclaimer: As part of our hiring process, we may use automated or AI‑based tools to support candidate screening and application review. These tools are used to assist decision‑making and do not replace human judgment. #J-18808-Ljbffr
-
Site Reliability Engineer
7 days ago
(s): Canada : Ontario : Toronto Scotiabank Global Site Full timeRequisition ID: 245210Join a purpose driven winning team, committed to results, in an inclusive and high-performing culture.The TeamGlobal Banking and Markets Engineering (GBME) is the fast-moving, award-winning technology engine that powers Scotiabank's Corporate, Investment Banking and Capital Markets businesses.The RoleGBME is searching for a Site...
-
Site Reliability Engineer
1 week ago
(s): Canada : Ontario : Toronto Scotiabank Global Site Full timeRequisition ID: 244026Join a purpose driven winning team, committed to results, in an inclusive and high-performing culture.Overview: As a Site Reliability Engineer (SRE), you will join the Digital Engineering Operations team, responsible for ensuring the operations and reliability of Scotiabank digital applications. You will have the opportunity to drive...
-
Site Reliability Engineer
5 days ago
Toronto, Canada Kyndryl Full timeJoin to apply for the Site Reliability Engineer role at Kyndryl. Direct message the job poster from Kyndryl. Recruitment & Strategic Staffing @Kyndryl | Partnering with IT Consultants in Financial Services & Technology Position: Site Reliability Engineer Client: Financial Services - Capital Markets Technology Duration: 12-month contract with potential...
-
Site Reliability Engineer
3 days ago
Toronto, Canada Kyndryl Full timeJoin to apply for the Site Reliability Engineer role at Kyndryl. Direct message the job poster from Kyndryl. Recruitment & Strategic Staffing @Kyndryl | Partnering with IT Consultants in Financial Services & Technology Position: Site Reliability Engineer Client: Financial Services - Capital Markets Technology Duration: 12-month contract with potential...
-
Site Reliability Engineer
1 week ago
(s): Canada : Ontario : Toronto Scotiabank Global Site Full timeRequisition ID: 247129Join a purpose driven winning team, committed to results, in an inclusive and high-performing culture.As a SRE, you will implement, measure and gather insights from Operational Level Indicators identifying areas for service improvements covering availability, performance, resilience, incidents and chronic problems. You will implement...
-
Site Reliability Engineer
1 week ago
Toronto, Canada Global Technical Talent Full timePrimary Job Title Site Reliability Engineer IV Alternate / Related Job Titles Site Reliability Engineer Senior SRE IT Reliability Engineer Systems Integration Engineer Location & Onsite Flexibility Toronto, ON — Hybrid (4 days onsite) Office Address: 66 Wellington Street West, 19th Floor, Toronto, ON Contract Details Position Type: Contract Contract...
-
Site Reliability Engineer
2 weeks ago
Toronto, Canada Moneris Full timeYour Moneris Career - The OpportunityAs the Site Reliability Engineer, you will help ensure the reliability, performance, and scalability of our systems. You will work with development and operations teams to build and maintain robust infrastructure, automate processes, and improve overall system health.Location: You will be based in our Toronto office,...
-
Site Reliability Engineer
4 weeks ago
Toronto, Canada Denvr Full timeSite Reliability Engineer - Platform Infrastructure Team (100% Remote - Canada) Denvr is a vertically integrated AI Platform Services company headquartered in Calgary, Canada. We provide foundational compute infrastructure and services to support the broader AI ecosystem and its end users. The platform includes cloud‑native solutions for training,...
-
Site Reliability Engineer
1 week ago
Toronto, Ontario, Canada Global Technical Talent, an Inc. 5000 Company Full timePrimary Job Title:Site Reliability Engineer IVAlternate / Related Job Titles:Site Reliability EngineerSenior SREIT Reliability EngineerSystems Integration EngineerLocation & Onsite Flexibility:Toronto, ON —Hybrid (4 days onsite)Office Address:66 Wellington Street West, 19th Floor, Toronto, ONContract DetailsPosition Type:ContractContract...
-
Site Reliability Engineer
4 weeks ago
Toronto, Canada Tecsys Inc. Full timeGet AI-powered advice on this job and more exclusive features. Having recognized the advantages of remote work, including employee morale, productivity, reduced commuting on employee wellbeing and the environment, we are proud to be a digital-first company. The technologies and programs in which we invested have provided a fantastic foundation to this end....