Site Reliability Engineer

6 hours ago


Toronto, Ontario, Canada Tecsys Inc. Full time
About Tecsys Inc.

Tecsys Inc. is a fast-growing innovator offering supply chain solutions to industry leading healthcare systems, hospitals, and pharmacy businesses to distributors, retailers, and 3PLs. We work with industry leaders to transform their supply chains through technology.

About the Role

We are seeking a highly skilled Site Reliability Engineer to join our Network and Security Operations Center department. Our NOC team is focused on improving the reliability and uptime of our platform and applications in a data-driven way to support internal and external customers' needs.

Key Responsibilities
  • Collaborate with Engineering teams to support services before they go live through activities such as system design consulting, developing software platforms and frameworks, capacity planning and launch reviews.
  • Maintain services once they are live by measuring and monitoring availability, latency and overall system health.
  • Develop tools & automation on top of Azure & AWS to continuously reduce the need for manual intervention.
  • Scale systems sustainably through automation and evolve systems by pushing for changes that improve reliability and velocity.
  • Be on-call.
  • Practice sustainable incident response and blameless postmortems.
  • Implement automated solutions for continuous integration and delivery (CI / CD).
  • Implement monitoring, Logging, alerting, and SLA Reporting.
  • Implement service monitoring dashboards displaying key metrics.
  • Create and maintain technical documentation.
  • Apply SRE best practices.
  • Take command of high-severity incidents and facilitate their resolution.
  • Provide support for our planning and deployment teams to enable stability, predictability, and scale in our continued growth.
  • Collaborate with members of the Platform Engineering team to implement and support far-reaching strategic efforts, provide constructive feedback, and foster a collaborative environment.
  • Work cross-functionally with internal teams and vendors to manage our growth around the globe, with a strong focus on maintaining the high level of performance, availability, and reliability for our users.
Requirements
  • Bachelor's degree in computer science or related technical discipline.
  • At least 5 years' experience in systems engineering experience; demonstrable technical experience in new platform development, orchestration, product ownership, and iterative design and deployment.
  • Experience designing and deploying large scale systems, multi-vendor platforms and globally distributed infrastructure.
  • Strong knowledge of system design; high performance computing; file, block, and storage technologies; integration of compute, storage, and network technologies to deliver cohesive infrastructure solutions.
  • High level of understanding and examples of executing projects with full stack automation; our scale is going to require a lot of it, we grow to use less manual intervention and work with both internal and open-source tools to automate day-to-day activities.
  • Self-organize, collaborate, and manage efforts with peers and teams across responsibility areas, languages, geography, and time zones.
  • Be a self-starter, curious, and not afraid to ask questions and challenge the way things are done today.
  • See a problem or opportunity, take ownership and act on it independently.
  • Knowledge of Datadog preferred (or at least, similar/equivalent product)
  • Knowledge of Rapid7 Insight preferred (or at least, similar/equivalent product)
  • Knowledge and experience of AWS or Azure required.
  • Basic knowledge of Java- or.Net-based development required.
  • Knowledge of GitLab (enterprise license) preferred (or at minimum, Jenkins required)
  • Experience with SaaS company is a strong asset.
  • Strong English communication skills, both written and spoken, are essential for effective correspondence with customers, business partners and colleagues beyond the province of Quebec.
Additional Requirements
  • Escalation on-call rotation
  • Occasional travel (quarterly offsites, conferences – less than 10%)

We are committed to fostering a diverse and inclusive workplace where all employees feel valued, respected, and empowered. We believe that diversity drives innovation and strengthens our ability to deliver exceptional solutions. We welcome and encourage applicants from all backgrounds, experiences, and perspectives to join our team.

Tecsys Inc. is an equal opportunity employer. Accommodation is available for applicants selected for an interview.

NB: if you are applying to this position, you must be a Canadian Citizen or a Permanent Resident of Canada, OR, have a valid Canadian work permit.



  • Toronto, Ontario, Canada Bourse de Montreal Inc. Full time

    Job Title: Site Reliability EngineerAt Bourse de Montreal Inc., we're seeking a highly skilled Site Reliability Engineer to join our Global Technology Services (GTS) team. As a Site Reliability Engineer, you will play a critical role in ensuring the reliability, scalability, and performance of our technology infrastructure.Key Responsibilities:Evaluate new...


  • Toronto, Ontario, Canada Bourse de Montreal Inc. Full time

    Job Title: Site Reliability EngineerAt Bourse de Montreal Inc., we're seeking a highly skilled Site Reliability Engineer to join our Global Technology Services (GTS) team. As a Site Reliability Engineer, you will play a critical role in ensuring the reliability, scalability, and performance of our technology infrastructure.Key Responsibilities:Evaluate new...


  • Toronto, Ontario, Canada ISG Search Inc Full time

    We are seeking a highly skilled Site Reliability Engineer to support our various platforms on a 6-month contract basis. This role involves database administration, automation, and troubleshooting of various investment applications in a fast-paced, global environment.Key Responsibilities:Support and manage investment application systems, databases (MS SQL,...


  • Toronto, Ontario, Canada ISG Search Inc Full time

    We are seeking a highly skilled Site Reliability Engineer to support our various platforms on a 6-month contract basis. This role involves database administration, automation, and troubleshooting of various investment applications in a fast-paced, global environment.Key Responsibilities:Support and manage investment application systems, databases (MS SQL,...


  • Toronto, Ontario, Canada Rogers Full time

    About the RoleRogers Sports & Media is seeking a skilled Site Reliability Engineer to build and maintain a robust monitoring system for its cloud and on-prem systems. As a key player in delivering Canadian audiences a diverse content portfolio, you'll be at the forefront of technology innovation in the media space.Key ResponsibilitiesDesign, develop, and...


  • Toronto, Ontario, Canada Rogers Full time

    About the RoleRogers Sports & Media is seeking a skilled Site Reliability Engineer to build and maintain a robust monitoring system for its cloud and on-prem systems. As a key player in delivering Canadian audiences a diverse content portfolio, you'll be at the forefront of technology innovation in the media space.Key ResponsibilitiesDesign, develop, and...


  • Toronto, Ontario, Canada Northbridge Financial Corporation Full time

    Overview of the Senior Site Reliability Engineer Role at Northbridge Financial Corporation The Senior Site Reliability Engineer is responsible for the development and execution of Service Level Objectives (SLOs). This role involves managing complex service reliability solutions and processes, as well as mentoring and guiding junior SREs. Key...


  • Toronto, Ontario, Canada Northbridge Financial Corporation Full time

    Overview of the Senior Site Reliability Engineer Role at Northbridge Financial Corporation The Senior Site Reliability Engineer is responsible for the establishment and execution of Service Level Objectives (SLOs). This role involves managing complex service reliability solutions and processes, while also providing mentorship and guidance to junior...


  • Toronto, Ontario, Canada Lightspeed Restaurant Full time

    Lead Site Reliability Engineer at Lightspeed RestaurantWe are seeking a skilled Lead Site Reliability Engineer to become a vital part of our Lightspeed Restaurant team. Our mission is to create innovative software solutions that empower restaurants to enhance their operational efficiency and profitability.In the role of Lead Site Reliability Engineer, you...


  • Toronto, Ontario, Canada Bold Commerce Full time

    Salary: Who is Bold Commerce?Bold Commerce powers personalized checkout experiences for leading omnichannel retailers and direct-to-consumer brands.As a leader in the composable commerce space, Bold makes checkout better, boosting profitability by enabling personalized, customer-specific checkout flows designed to increase the Checkout Power Trio of...


  • Toronto, Ontario, Canada Northbridge Financial Corporation Full time

    Overview of the Senior Site Reliability Engineer Role at Northbridge Financial Corporation The Senior Site Reliability Engineer is responsible for the establishment and execution of Service Level Objectives (SLOs). This role involves managing service reliability solutions and processes of increasing intricacy, along with mentoring and guiding junior...


  • Toronto, Ontario, Canada CIRCLE Full time

    About Circle: Circle is a pioneering financial technology firm positioned at the forefront of the evolving digital economy, where value can traverse globally, almost instantaneously, and at a lower cost compared to traditional settlement systems. This innovative layer of the internet unveils extraordinary opportunities for transactions, commerce, and...


  • Toronto, Ontario, Canada Thomson Reuters Full time

    About the RoleThis is an exciting opportunity to join our team as a Lead Site Reliability Engineer at Thomson Reuters. As a key member of our engineering team, you will be responsible for leading and mentoring a team of SREs, providing technical guidance, coaching, and support to foster a culture of collaboration, innovation, and continuous improvement.Key...


  • Toronto, Ontario, Canada Thomson Reuters Full time

    About the RoleThis is an exciting opportunity to join our team as a Lead Site Reliability Engineer at Thomson Reuters. As a key member of our engineering team, you will be responsible for leading and mentoring a team of SREs, providing technical guidance, coaching, and support to foster a culture of collaboration, innovation, and continuous improvement.Key...


  • Toronto, Ontario, Canada ISG Search Inc Full time

    We are seeking a skilled Site Reliability Engineer to support our various platforms on a 6-month contract basis. This role involves database administration, automation, and troubleshooting of various investment applications in a fast-paced, global environment.Key Responsibilities:Support and manage investment application systems, databases (MS SQL, Oracle),...


  • Toronto, Ontario, Canada ISG Search Inc Full time

    We are seeking a skilled Site Reliability Engineer to support our various platforms on a 6-month contract basis. This role involves database administration, automation, and troubleshooting of various investment applications in a fast-paced, global environment.Key Responsibilities:Support and manage investment application systems, databases (MS SQL, Oracle),...


  • Toronto, Ontario, Canada CIRCLE Full time

    About Circle: Circle operates at the forefront of financial technology, revolutionizing the way value is exchanged globally. Our innovative platform enables transactions to occur swiftly and cost-effectively, paving the way for a new era in commerce and finance. We are dedicated to enhancing economic prosperity and promoting inclusivity through our...


  • Toronto, Ontario, Canada Lightspeed Full time

    Welcome to Lightspeed Are you exploring new career avenues? You may find an exciting opportunity here. We are seeking a Senior Site Reliability Engineer to enhance our operations at Lightspeed. Our team is dedicated to developing software solutions that empower merchants to expand their business effectively. In this role, you will be instrumental in...


  • Toronto, Ontario, Canada Lightspeed Full time

    Welcome to Lightspeed! Are you exploring new career paths or simply assessing the job market? You may find the opportunity you're looking for here. We are in search of a Senior Site Reliability Engineer to enhance our NuOrder by Lightspeed team in North America. NuORDER by Lightspeed develops innovative software solutions that empower merchants to...


  • Toronto, Ontario, Canada Lightspeed Full time

    Welcome to Lightspeed Are you exploring new career paths or simply surveying the job market? You may find an exciting opportunity here. We are in search of a Senior Site Reliability Engineer to enhance our NuOrder by Lightspeed division in North America. NuORDER by Lightspeed develops innovative software solutions aimed at empowering merchants to...