Staff Site Reliability Engineer

1 month ago


Montreal, Canada Lightspeed Full time

We’re looking for a Staff Site Reliability Engineer to join our NuOrder by Lightspeed team. NuORDER by Lightspeed builds software solutions that help merchants grow the size and profitability of their business. You'll join a team responsible for supporting the group in cross-cutting concerns, such as cloud infrastructure, reliability and incident management, data warehousing and analytics, cost transparency and efficiency, and much more. You will also be supporting our growing Dev teams with the infrastructure and tools needed to continue scaling. You will build and support multi-region infrastructures and networks, and help run our products in a reliable, efficient, and secure manner by implementing, advising, and advocating well-known DevOps principles.

Role:

  • Design, build, and maintain robust infrastructure on GCP, leveraging cloud-native technologies such as GKE, Cloud SQL, BigQuery, etc.
  • Develop and manage CI/CD pipelines for efficient deployment and release using various technologies (GitLab, GitHub, Helm, Terraform, etc.).
  • Work closely with development teams to provide tools and practices for monitoring software health in production, defining and measuring reliability metrics (SLI, SLO), and managing error budgets.
  • Build platform solutions and apply software engineering principles to improve software reliability and accelerate delivery.
  • Support the incident management process and conduct post-mortem analysis to prevent future outages.
  • Mentor junior SREs and developers, offering guidance on best practices in cloud architecture, data management, and software development.
  • Manage infrastructure changes through infrastructure as code (IaC) using Terraform.
  • Participate in the on-call rotation.
  • Stay current with industry trends and emerging technologies, advocating for the adoption of new technologies and practices to improve product quality and team efficiency.

What you need to bring:

  • Bachelor’s degree in Computer Science, Engineering, or equivalent real-world experience.
  • 6+ years of experience in site reliability engineering, systems administration, and/or software engineering.
  • Expertise in container orchestration platforms, specifically Kubernetes.
  • Strong understanding of both relational (e.g., PostgreSQL, MySQL) and NoSQL databases (e.g., MongoDB, Cassandra, Redis).
  • Familiarity with network protocols and IP networking, along with experience in network troubleshooting.
  • Proficiency in at least one programming language such as Bash, Python, Go, etc.
  • Proven track record of managing large-scale infrastructure in cloud environments like Google Cloud, AWS, or Azure.
  • Experience with monitoring tools (e.g., Prometheus, Grafana, Datadog) and logging solutions (e.g., ELK stack).
  • Strong understanding of security best practices.
  • Excellent problem-solving skills and the ability to work under pressure to troubleshoot and resolve complex issues.
  • Excellent communication skills for effective collaboration with cross-functional teams.
  • A keen eagerness to learn and embrace challenges.
#J-18808-Ljbffr

  • Montreal, Canada Lightspeed Commerce Full time

    Hi there! Thanks for stopping by We're looking for a Staff Site Reliability Engineer to join our NuOrder by Lightspeed team in North America. NuORDER by Lightspeed builds software solutions that help merchants grow the size and profitability of their business. You'll join a team responsible for supporting the group in cross-cutting concerns, such as cloud...


  • Montreal, Canada Lightspeed Commerce Full time

    Hi there! Thanks for stopping by We're looking for a Staff Site Reliability Engineer to join our NuOrder by Lightspeed team in North America. NuORDER by Lightspeed builds software solutions that help merchants grow the size and profitability of their business. You'll join a team responsible for supporting the group in cross-cutting concerns, such as cloud...


  • Montreal, Canada Lightspeed Full time

    Hi there! Thanks for stopping by Are you actively looking for a new opportunity? Or just checking the market? Well… you might just be in the right place! We’re looking for a Staff Site Reliability Engineer to join our NuOrder by Lightspeed team in North America NuORDER by Lightspeed builds software solutions that help merchants grow the size and...


  • Montreal, Canada Lightspeed Full time

    Hi there! Thanks for stopping by We’re looking for a Staff Site Reliability Engineer to join our NuOrder by Lightspeed team in North America. NuORDER by Lightspeed builds software solutions that help merchants grow the size and profitability of their business. You'll join a team responsible for supporting the group in cross-cutting concerns, such as cloud...


  • Montreal, Canada Lightspeed Full time

    Hi there! Thanks for stopping by We’re looking for a Staff Site Reliability Engineer to join our NuOrder by Lightspeed team in North America. NuORDER by Lightspeed builds software solutions that help merchants grow the size and profitability of their business. You'll join a team responsible for supporting the group in cross-cutting concerns, such as cloud...


  • Montreal, Canada Lightspeed Full time

    Hi there! Thanks for stopping by. Are you actively looking for a new opportunity? Or just checking the market? Well… you might just be in the right place! We’re looking for a Staff Site Reliability Engineer to join our NuOrder by Lightspeed team in North America. NuORDER by Lightspeed builds software solutions that help merchants grow the size and the...


  • Montreal, Canada Lightspeed Full time

    Hi there! Thanks for stopping by. Are you actively looking for a new opportunity? Or just checking the market? Well… you might just be in the right place! We’re looking for a Staff Site Reliability Engineer to join our NuOrder by Lightspeed team in North America. NuORDER by Lightspeed builds software solutions that help merchants grow the size and the...


  • Montreal, Canada Lightspeed Full time

    Hi there! Thanks for stopping by. Are you actively looking for a new opportunity? Or just checking the market? Well… you might just be in the right place! We’re looking for a Staff Site Reliability Engineer to join our NuOrder by Lightspeed team in North America. NuORDER by Lightspeed builds software solutions that help merchants grow the size and the...


  • Montreal, Canada Lightspeed Full time

    Hi there! Thanks for stopping by We’re looking for a Staff Site Reliability Engineer to join our NuOrder by Lightspeed team in North America. NuORDER by Lightspeed builds software solutions that help merchants grow the size and the profitability of their business. You'll join a team responsible for supporting the group in cross-cutting concerns, such as...


  • Montreal, Canada Lightspeed Full time

    Hi there! Thanks for stopping by We’re looking for a Staff Site Reliability Engineer to join our NuOrder by Lightspeed team in North America. NuORDER by Lightspeed builds software solutions that help merchants grow the size and the profitability of their business. You'll join a team responsible for supporting the group in cross-cutting concerns, such as...


  • Montreal, Canada Lightspeed Full time

    We’re looking for a Staff Site Reliability Engineer to join our NuOrder by Lightspeed team in North America. NuORDER by Lightspeed builds software solutions that help merchants grow the size and the profitability of their business. You'll join a team responsible for supporting the group in cross-cutting concerns, such as cloud infrastructure, reliability...


  • Montreal, Canada Lightspeed Full time

    We’re looking for a Staff Site Reliability Engineer to join our NuOrder by Lightspeed team in North America. NuORDER by Lightspeed builds software solutions that help merchants grow the size and the profitability of their business. You'll join a team responsible for supporting the group in cross-cutting concerns, such as cloud infrastructure, reliability...


  • Montreal, Quebec, G4F, CA Lightspeed Commerce Full time

    Hi there! Thanks for stopping by We're looking for a Staff Site Reliability Engineer to join our NuOrder by Lightspeed team in North America. NuORDER by Lightspeed builds software solutions that help merchants grow the size and profitability of their business. You'll join a team responsible for supporting the group in cross-cutting concerns, such as...


  • Montreal, Quebec, G4F, CA Lightspeed Full time

    Hi there! Thanks for stopping by We’re looking for a Staff Site Reliability Engineer to join our NuOrder by Lightspeed team in North America. NuORDER by Lightspeed builds software solutions that help merchants grow the size and profitability of their business. You'll join a team responsible for supporting the group in cross-cutting concerns, such as...


  • Montreal, Quebec, G4F, CA Lightspeed Full time

    Hi there! Thanks for stopping by. Are you actively looking for a new opportunity? Or just checking the market? Well… you might just be in the right place! We’re looking for a Staff Site Reliability Engineer to join our NuOrder by Lightspeed team in North America. NuORDER by Lightspeed builds software solutions that help merchants grow the size and the...


  • Montreal, Quebec, G4F, CA Lightspeed Full time

    Hi there! Thanks for stopping by We’re looking for a Staff Site Reliability Engineer to join our NuOrder by Lightspeed team in North America. NuORDER by Lightspeed builds software solutions that help merchants grow the size and the profitability of their business. You'll join a team responsible for supporting the group in cross-cutting concerns, such...


  • Montreal, Quebec, G4F, CA Lightspeed Full time

    We’re looking for a Staff Site Reliability Engineer to join our NuOrder by Lightspeed team in North America. NuORDER by Lightspeed builds software solutions that help merchants grow the size and the profitability of their business. You'll join a team responsible for supporting the group in cross-cutting concerns, such as cloud infrastructure,...


  • Montreal, Quebec, Canada Lyft Full time

    Lyft is a leading micromobility company dedicated to improving urban transportation systems worldwide. They are seeking a Site Reliability Engineer to join their expanding team. This individual will be responsible for designing, implementing, and maintaining the infrastructure systems to ensure reliability and scalability.Responsibilities:Assist in defining...


  • Montreal, Canada Caspian One Full time

    Join a dynamic team in Montreal as a Site Reliability Engineer (SRE) , where you will work in a hybrid environment alongside a passionate and innovative team. Key Responsibilities: System Design & Maintenance: Collaborate closely with engineering and development teams to design, build, and mainta


  • Montreal, Quebec, G4F, CA Lightspeed Full time

    We’re looking for a Staff Site Reliability Engineer to join our NuOrder by Lightspeed team. NuORDER by Lightspeed builds software solutions that help merchants grow the size and profitability of their business. You'll join a team responsible for supporting the group in cross-cutting concerns, such as cloud infrastructure, reliability and incident...