Staff Site Reliability Expert

3 weeks ago


Toronto, Canada Lightspeed Commerce Full time

**Hi there Thanks for stopping by**

Are you actively looking for a new opportunity? Or just checking the market? Wellyou might just be in the right place

Lightspeed Hospitality makes point of sale solutions for restaurants and cafes around the world. As part of this global team of about 150 people, you will work on our flagship K-Series product, while keeping the lights on for one of our smaller regional offerings: U/L/G/O-Series.

Our SRE team is responsible for the design, build and operation of Lightspeed's product infrastructure. We collaborate with teams across the company to make this happen: dev's, QA, PM's, architects, IT, Data team,

**What you'll be responsible for**:

- Initiate and contribute to continuous improvement of our software delivery processes and practices in a multi-location, multidisciplinary team to empower and accelerate product development
- Use automation extensively to design, configure, manage, and monitor systems in support of our product development teams
- Design and architect operational solutions with the specific goal of increasing the standardization, automation, repeatability, cost-efficiency and consistency of operational tasks
- Working with developers and other SRE to design and build scalable, reliable and cost-efficient Cloud infrastructure
- Adhere to and advocate for best practices, including Infrastructure as Code, monitoring, high availability, disaster recovery, security, and DevOps methodologies
- Provide timely assistance and remediation solutions during critical situations and production incidents to help resolve service problems (You will be on call for periods of time)

**What you'll be bringing to the team**:

- Expert knowledge of Amazon Web Services
- Expert experience with Docker, Kubernetes & Linux Systems
- Strong experience with configuration management tools such as Chef, Puppet, Ansible, Salt
- Strong experience with Infrastructure as code practices: we use Terraform
- Strong experience with datastores: MySQL, ElasticSearch, Kafka, DynamoDB,
- Ability to read & write complex scripts using Shell
- Ability to read & write code in at least one programming language: Python, Ruby, Go,
- Good understanding of Agile development and continuous delivery best practices, software engineering tools, processes, methods and testing
- Ability to partner effectively with other teams
- Ability to plan, organize, prioritize and stay focused
- Strong experience provisioning and managing infrastructures with high availability constraints
- Strong experience with cloud cost optimization

**Who you are**:

- You are a problem solvers who does not shy away from tackling complexity and critical thinking
- You have a strong will to learn, grow and get out of your comfort zone
- You have great energy and passion for technology
- You are able to express yourself flawlessly in English
- You have strong interpersonal skills

**And what about the rest?**:

- Lots of autonomy, flexible work culture and possibility of remote work
- Development of high traffic products, used at the global scale
- Exposure to modern and proven technology
- Opportunity to learn and expand your skill set
- Tons of growth opportunities into technical or people management roles
- Amazing benefits & perks, including equity for all Lightspeeders
- Opportunity to join a fast-paced, high-growth company
- Become a valued part of the diverse and inclusive Lightspeed family

**Where to from here?**
Obviously, this has to be mutually beneficial: we want you to step into a role you love, and we want to offer you a place you're proud to come to every day. For a glimpse into our world check out our career page here.

Lightspeed is building communities through commerce, and we need people from all backgrounds and lived experiences to do that. We were founded in 2005, in Montreal's gay village and our original members were all part of the LGBTQ+ community. The ethos of our business has been about inclusion from the very beginning, and we strive to provide a workplace where everyone belongs.

**Who we are**:
Powering the businesses that are the backbone of the global economy, Lightspeed's one-stop commerce platform helps merchants innovate to simplify, scale, and provide exceptional customer experiences. Our cloud commerce solution transforms and unifies online and physical operations, multichannel sales, expansion to new locations, global payments, financial solutions, and connection to supplier networks.
- Founded in Montréal, Canada in 2005, Lightspeed is dual-listed on the New York Stock Exchange (NYSE: LSPD) and Toronto Stock Exchange (TSX: LSPD). With teams across North America, Europe, and Asia Pacific, the company serves retail, hospitality, and golf businesses in over 100 countries.

Lightspeed handles your information in accordance with our Applicant Privacy Statement.



  • Toronto, Canada Pinterest Full time

    About Pinterest: Millions of people across the world come to Pinterest to find new ideas every day. It’s where they get inspiration, dream about new possibilities and plan for what matters most. Our mission is to help those people find their inspiration and create a life they love. In your role, you’ll be challenged to take on work that upholds this...


  • Toronto, ON, Canada Hour Consulting Full time

    Our client, a fast growing Fintech Startup is on a mission to redefine how to protect user identity, providing users secure control over personal information through a privacy compliant network. This approach creates higher customer interaction and sales conversions, while improving overall security for both customers and businesses. They are a...


  • Toronto, Canada CB Canada Full time

    Site Reliability Engineer On behalf of our client in the Banking Sector, PROCOM is looking for a Site Reliability Engineer. Site Reliability Engineer – Job Description Azure cloud Jira and confluence CICD Experience with automating (provisioning, configuration management, deployment) and integrating Azure PaaS solutions (Azure App services, Azure...

  • Site Superintendent

    4 days ago


    Toronto, Canada O.N. Site Construction Full time

    Hiring the best people is our main focus, as a result we are always looking for great Site Superintendents and Skilled Labourers to add to our team. **Description**: Since 2005, O.N. Site Construction Inc., a medium sized, service oriented company, has worked hard to build a reputation on quality workmanship and unprecedented customer service in the general...


  • Old Toronto, Canada CB Canada Full time

    Site Reliability Engineer On behalf of our client in the Banking Sector, PROCOM is looking for a Site Reliability Engineer. Site Reliability Engineer – Job Description Azure cloud Jira and confluence CICD Experience with automating (provisioning, configuration management, deployment) and integrating Azure PaaS solutions (Azure App services, Azure...


  • Toronto, ON, Canada EQ Bank | Equitable Bank Full time

    Being a traditional bank just isn’t our thing. We are big believers in innovating the banking experience because we believe Canadians deserve better options, and we challenge ourselves and our teams to creatively transform what’s possible in banking. Our team is made up of inquisitive and agile minds that find smarter ways of doing things. Overall we...


  • Old Toronto, Canada Lightspeed Restaurant Full time

    Data is the new Gold!!! We are here to help our data teams build and maintain the data and AI infrastructure platform, and the governance framework needed for having data flowing everywhere at Lightspeed. Data security, reliability, and high availability are our mojo. Role: Collaborate seamlessly with cross-functional data teams to craft and deploy cloud...


  • Old Toronto, Canada Lightspeed Restaurant Full time

    Data is the new Gold!!! We are here to help our data teams build and maintain the data and AI infrastructure platform, and the governance framework needed for having data flowing everywhere at Lightspeed. Data security, reliability, and high availability are our mojo. Role: Collaborate seamlessly with cross-functional data teams to craft and deploy cloud...


  • Toronto, Canada Equitable Bank Full time

    The WorkDesign, develop, and implement Java-based solutions using microservices architecture.Deploy and maintain digital platforms on the cloud, ensuring high availability and scalability.Collaborate with cross-functional teams to integrate numerous services and ensure seamless delivery.Be a functional leader in the team, guiding the team with the best...


  • Toronto, ON, Canada EQ Bank | Equitable Bank Full time

    Join a Challenger Being a traditional bank just isn’t our thing. We are big believers in innovating the banking experience because we believe Canadians deserve better options, and we challenge ourselves and our teams to creatively transform what’s possible in banking. Our team is made up of inquisitive and agile minds that find smarter ways of doing...


  • Old Toronto, Canada EQ Bank | Equitable Bank Full time

    Join a ChallengerBeing a traditional bank just isn’t our thing. We are big believers in innovating the banking experience because we believe Canadians deserve better options, and we challenge ourselves and our teams to creatively transform what’s possible in banking. Our team is made up of inquisitive and agile minds that find smarter ways of doing...


  • Toronto, Canada BMO Full time

    Application Deadline: 04/29/2024Address:33 Dundas Street WestThis role is Hybrid (1-2 days per week in the office)The Director - Site Reliability Engineering will lead a team that will work with application teams, infrastructure teams, and business partners to continuously improve the stability, reliability and efficiency of Finance and Enterprise Risk...


  • Toronto, Ontario, Canada Zortech Solutions Full time

    Hi,Hope you are doing GreatThis side Priya Rajput from Zortech Solutions trying to reach you for an exciting job opening, kindly have a look to job description and revert me with your positive feedback. My mail ID is or call me on .Role: Site Reliability EngineerLocation: Toronto, ON-OnsiteDuration: Fulltime PermanentSkills and Responsibilities:...


  • Toronto, Canada Equitable Bank Full time

    The WorkDesign, develop, and implement Java-based solutions using microservices architecture.Deploy and maintain digital platforms on the cloud, ensuring high availability and scalability.Collaborate with cross-functional teams to integrate numerous services and ensure seamless delivery.Be a functional leader in the team, guiding the team with the best...


  • Toronto, Canada eTeam Full time

    Remote work Duration - 4 months - Preference is to find candidates who are willing to be converted to full time employee . The conversion decision will be made based on performance. Job description - ::: Role Desc : Defining and measuring reliability goals—SLIs, SLOs, and error budgets for user journey Designing for and implementing...


  • Old Toronto, Canada eTeam Full time

    Remote Work Duration 4 months - Preference is to find candidates who are willing to be converted to full-time employees. The conversion decision will be made based on performance. Job Description Role Description: Defining and measuring reliability goals—SLIs, SLOs, and error budgets for user journey. Designing for and implementing observability (ELK,...


  • Toronto, Canada OnX Canada Full time

    OnX is looking for a Site Reliability Engineer for one our clients in Toronto. Client: Financial Services Location: Toronto, mostly remote Duration: 6 months with potential extension JBoss in middleware experience is super important Responsibilities: Following the senior technicians plans to build out lower environments with functioning software stacks...


  • Toronto, Canada OnX Canada Full time

    OnX is looking for a Site Reliability Engineer for one our clients in Toronto. Client: Financial Services Location: Toronto, mostly remote Duration: 6 months with potential extension JBoss in middleware experience is super important Responsibilities: Following the senior technicians plans to build out lower environments with functioning software stacks...


  • Toronto, Canada OnX Canada Full time

    OnX is looking for a Site Reliability Engineer for one our clients in Toronto.Client: Financial Services Location: Toronto, mostly remote Duration: 6 months with potential extensionJBoss in middleware experience is super importantResponsibilities: Following the senior technicians plans to build out lower environments with functioning software stacks...


  • Toronto, Canada Autodesk Full time

    Position Overview Autodesk, the leading Design and Make Software Company, is looking for a Principal Site Reliability Engineer to join the Autodesk Platform Services Engineering team in Toronto, Canada. On this position, you will help build trusted services of APS (Autodesk Platform Services) as measured by Service Level Objectives (SLOs) and Mean...