Site Reliability Engineer

4 weeks ago


Toronto, Canada Aarorn Technologies Inc Full time

OverviewJob Title: Site Reliability EngineerLocation: Toronto, ON (Hybrid - 4x Onsite a Week)Employment Type: Contract OpportunityInterview Type: Face to Face (Onsite Interview Only)Base pay range: CA$45.00/hr - CA$55.00/hrThis opportunity is with Aarorn Technologies Inc. Your actual pay will be based on your skills and experience — talk with your recruiter to learn more.What is the Opportunity?Seeking to hire a Senior Site Reliability Engineer for its Application Maintenance and Transformation, Data Services and Integration team. As a Senior Site Reliability Engineer, you will bring an engineering mindset of bold ambition, curiosity and outcome focus to ensuring the performance and reliability of our systems. This role requires a dynamic individual who excels in a collaborative environment, working with cross-functional teams to establish best practices for observability, monitoring, logging, alerting, and automation.ResponsibilitiesSet vision for SRE product base (monitoring, alerting, self-healing, reliability testing).Lead cross-functional collaborations to define and implement best practices for monitoring, logging, and incident response, driving a proactive stance on system health.Function as portfolio SME – document common components, core functionalities, and infrastructure of supported applications.Participate in deploying software applications, automation tools, and IT infrastructure.Work with development teams to understand code changes and their impact on production, ensuring releases meet reliability standards.Drive automation of SRE processes to increase operational efficiency.Guide technical direction for future deployments, advocating for reliability and performance improvements.Lead incident management and problem management for applications in scope, including RCA action items.Debug production issues across services and provide primary operational support.Perform occasional off-hours support.Must-haveBachelor’s degree in Computer Science, Electrical or Electronics Engineering or related field, or equivalent experience.3+ years IT experience in software development and/or maintenance or SRE or DevOps Engineering.1+ years experience building Java Spring Boot applications and REST API development.Experience with relational databases (MS-SQL Server, MySQL, MariaDB, SingleStore, or in-memory distributed databases).Experience with containerization (Docker) and orchestration (Kubernetes; Azure Kubernetes or OpenShift Kubernetes Service preferred).Solid Git skills with experience with CI tools (Jenkins or UCD).Experience on Windows and Linux infrastructure.1+ years developing cloud-native applications using Java or Python.Experience writing SQL queries and optimization skills.Experience using centralized logging solutions (Splunk, ELK, etc.) and active monitoring systems (Dynatrace, etc.).Experience deploying and operating cloud-native applications in a Private (OpenShift) or public cloud (Azure/AWS preferred).Strong communication skills and the ability to work with cross-functional teams in large enterprises.Financial Services domain knowledge preferably Capital Markets and Wealth Management.Nice-to-haveExperience implementing dashboards to visualize logs and instrumentation (Grafana preferred).Exposure to Datawarehouses like Informatica, Snowflake or Databricks and BI tools like SAP BO or similar.Experience creating runbooks, processes, and test plans around reliability and performance of infrastructure and applications.Exposure to PagerDuty, Postman, ServiceNow, SonarQube, NexusIQ and vault tools.Exposure to event brokers like Kafka or IBM-MQ, mainframe tools and environment.Exposure to Industry Disaster recovery test exercises.Seniority levelMid-Senior levelJob functionEngineering and Information TechnologyIndustriesIT Services and IT Consulting #J-18808-Ljbffr



  • Toronto, Ontario, Canada Procom Full time $80,000 - $120,000 per year

    Site Reliability Engineer (SRE)/ Ingénieur Fiabilité des SitesOn behalf of our banking client, Procom is seeking a Site Reliability Engineer (SRE) for a 12-month contract role. This position is a hybrid role, 3 days a week onsite at our client's Montréal, Quebec office.Site Reliability Engineer - Job Description:The Site Reliability Engineer is...


  • Toronto, Canada None Full time

    Job Title: Site Reliability Engineer (Python & Cloud)Location: Toronto, ONDuration: 6 months with high possibility of extensionSkills Required:Digital: PythonDigital: Google CloudDigital: Site Reliability Engineering (SRE)Job Description:Gather and analyze metrics from operating systems as well as applications to assist in performance tuning and fault...


  • Toronto, Canada Maneva Full time

    About ManevaManeva builds and deploys edge AI solutions powering real-time intelligence for industrial environments. Our systems run on distributed edge compute devices (NVIDIA Jetson platforms), integrate with local network cameras, PLCs, sensors, and other on-premise equipment, and securely communicate with cloud services via client- or site-based VPNs....


  • Toronto, Canada Maneva Full time

    About Maneva Maneva builds and deploys edge AI solutions powering real-time intelligence for industrial environments. Our systems run on distributed edge compute devices (NVIDIA Jetson platforms), integrate with local network cameras, PLCs, sensors, and other on-premise equipment, and securely communicate with cloud services via client- or site-based VPNs....


  • Toronto, Canada Maneva Full time

    About Maneva Maneva builds and deploys edge AI solutions powering real-time intelligence for industrial environments. Our systems run on distributed edge compute devices (NVIDIA Jetson platforms), integrate with local network cameras, PLCs, sensors, and other on-premise equipment, and securely communicate with cloud services via client- or site-based VPNs....


  • Toronto, Canada Maneva Full time

    About Maneva Maneva builds and deploys edge AI solutions powering real-time intelligence for industrial environments. Our systems run on distributed edge compute devices (NVIDIA Jetson platforms), integrate with local network cameras, PLCs, sensors, and other on-premise equipment, and securely communicate with cloud services via client- or site-based VPNs....


  • Toronto, Canada Maneva Full time

    About Maneva Maneva builds and deploys edge AI solutions powering real-time intelligence for industrial environments. Our systems run on distributed edge compute devices (NVIDIA Jetson platforms), integrate with local network cameras, PLCs, sensors, and other on-premise equipment, and securely communicate with cloud services via client- or site-based VPNs....


  • Toronto, Canada Aarorn Technologies Inc Full time

    Overview Job Title: Site Reliability Engineer Location: Toronto, ON (Hybrid - 4x Onsite a Week) Employment Type: Contract Opportunity Interview Type: Face to Face (Onsite Interview Only) Base pay range: CA$45.00/hr - CA$55.00/hr This opportunity is with Aarorn Technologies Inc. Your actual pay will be based on your skills and experience — talk with your...


  • Toronto, Canada Aarorn Technologies Inc Full time

    OverviewJob Title: Site Reliability EngineerLocation: Toronto, ON (Hybrid - 4x Onsite a Week)Employment Type: Contract OpportunityInterview Type: Face to Face (Onsite Interview Only)Base pay range: CA$45.00/hr - CA$55.00/hrThis opportunity is with Aarorn Technologies Inc. Your actual pay will be based on your skills and experience — talk with your...


  • Toronto, Ontario, Canada Maneva Full time US$80,000 - US$120,000 per year

    About ManevaManeva builds and deploys edge AI solutions powering real-time intelligence for industrial environments. Our systems run on distributed edge compute devices (NVIDIA Jetson platforms), integrate with local network cameras, PLCs, sensors, and other on-premise equipment, and securely communicate with cloud services via client- or site-based VPNs....