Site Reliability Engineer
4 weeks ago
OverviewJob Title: Site Reliability EngineerLocation: Toronto, ON (Hybrid - 4x Onsite a Week)Employment Type: Contract OpportunityInterview Type: Face to Face (Onsite Interview Only)Base pay range: CA$45.00/hr - CA$55.00/hrThis opportunity is with Aarorn Technologies Inc. Your actual pay will be based on your skills and experience — talk with your recruiter to learn more.What is the Opportunity?Seeking to hire a Senior Site Reliability Engineer for its Application Maintenance and Transformation, Data Services and Integration team. As a Senior Site Reliability Engineer, you will bring an engineering mindset of bold ambition, curiosity and outcome focus to ensuring the performance and reliability of our systems. This role requires a dynamic individual who excels in a collaborative environment, working with cross-functional teams to establish best practices for observability, monitoring, logging, alerting, and automation.ResponsibilitiesSet vision for SRE product base (monitoring, alerting, self-healing, reliability testing).Lead cross-functional collaborations to define and implement best practices for monitoring, logging, and incident response, driving a proactive stance on system health.Function as portfolio SME – document common components, core functionalities, and infrastructure of supported applications.Participate in deploying software applications, automation tools, and IT infrastructure.Work with development teams to understand code changes and their impact on production, ensuring releases meet reliability standards.Drive automation of SRE processes to increase operational efficiency.Guide technical direction for future deployments, advocating for reliability and performance improvements.Lead incident management and problem management for applications in scope, including RCA action items.Debug production issues across services and provide primary operational support.Perform occasional off-hours support.Must-haveBachelor’s degree in Computer Science, Electrical or Electronics Engineering or related field, or equivalent experience.3+ years IT experience in software development and/or maintenance or SRE or DevOps Engineering.1+ years experience building Java Spring Boot applications and REST API development.Experience with relational databases (MS-SQL Server, MySQL, MariaDB, SingleStore, or in-memory distributed databases).Experience with containerization (Docker) and orchestration (Kubernetes; Azure Kubernetes or OpenShift Kubernetes Service preferred).Solid Git skills with experience with CI tools (Jenkins or UCD).Experience on Windows and Linux infrastructure.1+ years developing cloud-native applications using Java or Python.Experience writing SQL queries and optimization skills.Experience using centralized logging solutions (Splunk, ELK, etc.) and active monitoring systems (Dynatrace, etc.).Experience deploying and operating cloud-native applications in a Private (OpenShift) or public cloud (Azure/AWS preferred).Strong communication skills and the ability to work with cross-functional teams in large enterprises.Financial Services domain knowledge preferably Capital Markets and Wealth Management.Nice-to-haveExperience implementing dashboards to visualize logs and instrumentation (Grafana preferred).Exposure to Datawarehouses like Informatica, Snowflake or Databricks and BI tools like SAP BO or similar.Experience creating runbooks, processes, and test plans around reliability and performance of infrastructure and applications.Exposure to PagerDuty, Postman, ServiceNow, SonarQube, NexusIQ and vault tools.Exposure to event brokers like Kafka or IBM-MQ, mainframe tools and environment.Exposure to Industry Disaster recovery test exercises.Seniority levelMid-Senior levelJob functionEngineering and Information TechnologyIndustriesIT Services and IT Consulting #J-18808-Ljbffr
-
Site Reliability Engineer
2 days ago
Toronto, Ontario, Canada Procom Full time $80,000 - $120,000 per yearSite Reliability Engineer (SRE)/ Ingénieur Fiabilité des SitesOn behalf of our banking client, Procom is seeking a Site Reliability Engineer (SRE) for a 12-month contract role. This position is a hybrid role, 3 days a week onsite at our client's Montréal, Quebec office.Site Reliability Engineer - Job Description:The Site Reliability Engineer is...
-
Site Reliability Engineer
4 weeks ago
Toronto, Canada None Full timeJob Title: Site Reliability Engineer (Python & Cloud)Location: Toronto, ONDuration: 6 months with high possibility of extensionSkills Required:Digital: PythonDigital: Google CloudDigital: Site Reliability Engineering (SRE)Job Description:Gather and analyze metrics from operating systems as well as applications to assist in performance tuning and fault...
-
Site Reliability Engineer
2 days ago
Toronto, Canada Maneva Full timeAbout ManevaManeva builds and deploys edge AI solutions powering real-time intelligence for industrial environments. Our systems run on distributed edge compute devices (NVIDIA Jetson platforms), integrate with local network cameras, PLCs, sensors, and other on-premise equipment, and securely communicate with cloud services via client- or site-based VPNs....
-
Site Reliability Engineer
2 days ago
Toronto, Canada Maneva Full timeAbout Maneva Maneva builds and deploys edge AI solutions powering real-time intelligence for industrial environments. Our systems run on distributed edge compute devices (NVIDIA Jetson platforms), integrate with local network cameras, PLCs, sensors, and other on-premise equipment, and securely communicate with cloud services via client- or site-based VPNs....
-
Site Reliability Engineer
21 hours ago
Toronto, Canada Maneva Full timeAbout Maneva Maneva builds and deploys edge AI solutions powering real-time intelligence for industrial environments. Our systems run on distributed edge compute devices (NVIDIA Jetson platforms), integrate with local network cameras, PLCs, sensors, and other on-premise equipment, and securely communicate with cloud services via client- or site-based VPNs....
-
Site Reliability Engineer
12 hours ago
Toronto, Canada Maneva Full timeAbout Maneva Maneva builds and deploys edge AI solutions powering real-time intelligence for industrial environments. Our systems run on distributed edge compute devices (NVIDIA Jetson platforms), integrate with local network cameras, PLCs, sensors, and other on-premise equipment, and securely communicate with cloud services via client- or site-based VPNs....
-
Site Reliability Engineer
7 hours ago
Toronto, Canada Maneva Full timeAbout Maneva Maneva builds and deploys edge AI solutions powering real-time intelligence for industrial environments. Our systems run on distributed edge compute devices (NVIDIA Jetson platforms), integrate with local network cameras, PLCs, sensors, and other on-premise equipment, and securely communicate with cloud services via client- or site-based VPNs....
-
Site Reliability Engineer
3 weeks ago
Toronto, Canada Aarorn Technologies Inc Full timeOverview Job Title: Site Reliability Engineer Location: Toronto, ON (Hybrid - 4x Onsite a Week) Employment Type: Contract Opportunity Interview Type: Face to Face (Onsite Interview Only) Base pay range: CA$45.00/hr - CA$55.00/hr This opportunity is with Aarorn Technologies Inc. Your actual pay will be based on your skills and experience — talk with your...
-
Site Reliability Engineer
4 weeks ago
Toronto, Canada Aarorn Technologies Inc Full timeOverviewJob Title: Site Reliability EngineerLocation: Toronto, ON (Hybrid - 4x Onsite a Week)Employment Type: Contract OpportunityInterview Type: Face to Face (Onsite Interview Only)Base pay range: CA$45.00/hr - CA$55.00/hrThis opportunity is with Aarorn Technologies Inc. Your actual pay will be based on your skills and experience — talk with your...
-
Systems Reliability Engineer
1 week ago
(s): Canada : Ontario : Toronto Scotiabank Global Site Full time $120,000 - $180,000 per yearRequisition ID: 239640Join a purpose driven winning team, committed to results, in an inclusive and high-performing culture.The RoleAs a member of the Systems Reliability Engineering team, the System Reliability Engineer will collaborate closely with Engineering and development teams, peers, and business partners to continuously improve the stability,...