Current jobs related to Principal Site Reliability Engineer - Montreal, Quebec - Lightspeed


  • Montreal, Quebec, Canada Axelon Services Corporation Full time

    Job Title:Site Reliability EngineerAbout the Role:We are seeking a skilled Site Reliability Engineer to join our Application Infrastructure team. As a Site Reliability Engineer, you will play a critical role in driving the reliability engineering, operations, and customer support services for our ServiceNow SaaS implementation.Key Responsibilities:Deliver...


  • Montreal, Quebec, Canada Axelon Services Corporation Full time

    Job Title:Site Reliability EngineerAbout the Role:We are seeking a skilled Site Reliability Engineer to join our Application Infrastructure team. As a Site Reliability Engineer, you will play a critical role in driving the reliability engineering, operations, and customer support services for our ServiceNow SaaS implementation.Key Responsibilities:Deliver...


  • Montreal, Quebec, Canada Lightspeed Full time

    Overview:Thank you for your interest. Are you exploring new career opportunities? You may find what you're looking for here.We are seeking a Principal Site Reliability Engineer to be part of our team at Lightspeed. Our company develops innovative software solutions that assist merchants in enhancing their business growth and profitability. In this role, you...


  • Montreal, Quebec, Canada Axelon Services Corporation Full time

    {"title": "Site Reliability Engineer", "description": "Job SummaryWe are seeking a skilled Site Reliability Engineer to join our Application Infrastructure team. As a Site Reliability Engineer, you will be responsible for delivering reliable and resilient systems without wasteful operational effort.Key ResponsibilitiesDelivery of improvements that will...


  • Montreal, Quebec, Canada Axelon Services Corporation Full time

    {"title": "Site Reliability Engineer", "description": "Job SummaryWe are seeking a skilled Site Reliability Engineer to join our Application Infrastructure team. As a Site Reliability Engineer, you will be responsible for delivering reliable and resilient systems without wasteful operational effort.Key ResponsibilitiesDelivery of improvements that will...


  • Montreal, Quebec, Canada Lightspeed Full time

    About the Role:We are seeking a Principal Site Reliability Engineer to become an integral part of our innovative team at Lightspeed. Our organization develops advanced software solutions designed to enhance the growth and profitability of businesses. In this role, you will focus on critical aspects such as cloud infrastructure, operational reliability,...


  • Montreal, Quebec, Canada Lightspeed Full time

    Welcome to Lightspeed!Are you exploring new career paths or simply assessing the job market? You may have found the perfect opportunity.We are in search of a Principal Site Reliability Engineer to become a vital part of our NuOrder by Lightspeed team in North America. Our company develops innovative software solutions designed to enhance merchants' business...


  • Montreal, Quebec, Canada Lightspeed Full time

    Welcome to Lightspeed!We are on the lookout for a Principal Site Reliability Engineer to enhance our NuOrder by Lightspeed division in North America. Our team is dedicated to developing software solutions that empower merchants to expand their business profitability and reach. In this role, you will be pivotal in addressing essential aspects such as cloud...


  • Montreal, Quebec, Canada Axelon Services Corporation Full time

    About the RoleWe are seeking a highly skilled Site Reliability Engineer to join our team at Axelon Services Corporation. As a key member of our HashiVault squad, you will play a critical role in ensuring the reliability and scalability of our cloud-based services.Key ResponsibilitiesDesign and implement automation scripts to streamline service delivery and...


  • Montreal, Quebec, Canada Axelon Services Corporation Full time

    About the RoleWe are seeking a highly skilled Site Reliability Engineer to join our team at Axelon Services Corporation. As a key member of our HashiVault squad, you will play a critical role in ensuring the reliability and scalability of our cloud-based services.Key ResponsibilitiesDesign and implement automation scripts to streamline service delivery and...


  • Montreal, Quebec, Canada Lyft Full time

    Lyft is a leading micromobility company dedicated to improving urban transportation systems worldwide. They are seeking a Site Reliability Engineer to join their expanding team. This individual will be responsible for designing, implementing, and maintaining the infrastructure systems to ensure reliability and scalability.Responsibilities:Assist in defining...


  • Montreal, Quebec, Canada Axelon Services Corporation Full time

    {"Job Title:Site Reliability Engineer (SRE)Montreal QC12 MonthsThe ideal candidate would have at least one of:ServiceNow administration or development experience, orSoftware development skills in one or more programming language, e.g. PythonThe Application Infrastructure (AI) department is seeking a Site Reliability Engineer (SRE) to help drive the...


  • Montreal, Quebec, Canada Axelon Services Corporation Full time

    {"Job Title:Site Reliability Engineer (SRE)Montreal QC12 MonthsThe ideal candidate would have at least one of:ServiceNow administration or development experience, orSoftware development skills in one or more programming language, e.g. PythonThe Application Infrastructure (AI) department is seeking a Site Reliability Engineer (SRE) to help drive the...


  • Montreal, Quebec, Canada Alltech Consulting Services Full time

    Job DescriptionWe are seeking a highly skilled Site Reliability Engineer to join our team at Alltech Consulting Services. As a key member of our HashiVault squad, you will play a critical role in ensuring the smooth operation of our services and delivering a superior user experience to our clients.About the RoleAs a Site Reliability Engineer, you will be...


  • Montreal, Quebec, Canada Alltech Consulting Services Full time

    Job DescriptionWe are seeking a highly skilled Site Reliability Engineer to join our team at Alltech Consulting Services. As a key member of our HashiVault squad, you will play a critical role in ensuring the smooth operation of our services and delivering a superior user experience to our clients.About the RoleAs a Site Reliability Engineer, you will be...


  • Montreal, Quebec, Canada Axelon Services Corporation Full time

    {"title": "Site Reliability Engineer", "description": "About the RoleWe are seeking a skilled Site Reliability Engineer to join our HashiVault squad. As an SRE, you will be responsible for implementing new features, dealing with user requests, and reducing repeatable tasks to allow more time for strategic initiatives.About the Company*** is a leading global...


  • Montreal, Quebec, Canada Axelon Services Corporation Full time

    {"title": "Site Reliability Engineer", "description": "About the RoleWe are seeking a skilled Site Reliability Engineer to join our HashiVault squad. As an SRE, you will be responsible for implementing new features, dealing with user requests, and reducing repeatable tasks to allow more time for strategic initiatives.About the Company*** is a leading global...


  • Montreal, Quebec, Canada Lyft Full time

    About the RoleWe are seeking a highly skilled Site Reliability Engineer to join our team at Lyft in Montreal. As a Site Reliability Engineer, you will play a critical role in ensuring the reliability and scalability of our production systems, platforms, and tools.Key ResponsibilitiesDefine the team's roadmap and architecture based on technological and...


  • Montreal, Quebec, Canada Lyft Full time

    About the RoleWe are seeking a highly skilled Site Reliability Engineer to join our team at Lyft in Montreal. As a Site Reliability Engineer, you will play a critical role in ensuring the reliability and scalability of our production systems, platforms, and tools.Key ResponsibilitiesDefine the team's roadmap and architecture based on technological and...


  • Montreal, Quebec, Canada Lyft Full time

    About the RoleWe are seeking a highly skilled Site Reliability Engineer to join our team at Lyft in Montreal. As a Site Reliability Engineer, you will play a critical role in ensuring the reliability and scalability of our production systems, platforms, and tools.Key ResponsibilitiesDefine the team's roadmap and architecture based on technological and...

Principal Site Reliability Engineer

3 months ago


Montreal, Quebec, Canada Lightspeed Full time

Welcome to NuOrder by Lightspeed

Are you actively looking for a new opportunity? Or just checking the market? Well... you might just be in the right place We're looking for a Principal Site Reliability Engineer to join our NuOrder by Lightspeed team in North America.

NuORDER by Lightspeed builds software solutions that help merchants grow the size and the profitability of their business. You'll join a team responsible for supporting the group in cross-cutting concerns, such as:

  1. Cloud infrastructure
  2. Reliability and incident management
  3. Data warehousing and analytics
  4. Cost transparency and efficiency

You will also be supporting our growing Dev teams with the infrastructure and tools needed to continue scaling. You will build and support multi-region infrastructures and networks, and help run our products in a reliable, efficient and secure manner by implementing, advising and advocating the well-known DevOps principles.

What you'll be doing:
  1. Work closely with development teams to empower them with the necessary tools and practices for monitoring software health in production, defining and measuring reliability metrics (SLI, SLO), and managing error budgets.
  2. Design, build and maintain robust infrastructure built upon GCP, leveraging cloud native technologies such as GKE, Cloud SQL, BigQuery, etc.
  3. Develop and manage CI/CD pipelines for efficient deployment and release using a number of technologies (GitLab, Github, Helm, Terraform, etc.).
  4. Drive incident management process and conduct post-mortem analysis to prevent future outages.
  5. Mentor junior SREs and developers, providing guidance on best practices in cloud architecture, data management, and software development.
  6. Conduct system performance benchmarks and implement enhancements to improve system reliability and throughput.
  7. Collaborate with cross-functional teams to identify, design, and implement internal process improvements in a cost-efficient manner.
  8. Design and build robust, scalable, and highly available systems.
  9. Build platform solutions and apply software engineering principles to improve the reliability of our software and accelerate software delivery.
  10. Manage infrastructure change through infrastructure as code (IaC).
  11. Be part of our on-call rotation.
  12. Stay current with industry trends and emerging technologies, advocating for the adoption of new technologies and practices that improve product quality and team efficiency.
What you need to bring:
  1. Bachelor's degree in Computer Science, Engineering, or possess a related level of real-world experience.
  2. 9-10+ years of experience across site reliability engineering, systems administration, and/or software engineering.
  3. Strong expertise in container orchestration platforms, specifically Kubernetes.
  4. Strong understanding of both relational (e.g., PostgreSQL, MySQL) and NoSQL databases (e.g., MongoDB, Cassandra, Redis).
  5. Deep understanding of network protocols and IP networking, as well as experience with network troubleshooting.
  6. Proficiency in programming languages such as Java, Python, Go, etc.
  7. Proven track record of managing large-scale infrastructure in cloud environments, such as Google Cloud, AWS or Azure.
  8. Experience with monitoring tools (e.g., Prometheus, Grafana, Datadog) and logging solutions (e.g., ELK stack).
  9. Strong understanding of security best practices.
  10. Exceptional problem-solving skills and the ability to work under pressure to troubleshoot and resolve complex issues.
  11. Excellent communication skills to effectively collaborate with cross-functional teams.
  12. Strong leadership skills, capable of leading projects and influencing engineering decisions across the organization.

We know that people are more than what's on their CV. If you're unsure that you have the right profile for the role... hit the 'Apply' button and give it a try

What's in it for you?

Come live the Lightspeed experience...

Ability to do your job in a truly flexible environment;

Genuine career opportunities in a company that's creating new jobs everyday;

Work in a team big enough for growth but lean enough to make a real impact.

... and enjoy a range of benefits that'll keep you happy, healthy and (not) hungry:

  • Lightspeed share scheme (we are all owners)
  • Lightspeed RSU program (we are all owners)
  • Unlimited paid time off policy
  • Flexible working policy
  • Health insurance
  • Health and wellness benefits
  • Paid leave assistance for new parents
  • Linkedin learning
  • Volunteer day

#J-18808-Ljbffr