Principal Site Reliability Engineer

1 month ago


Montreal Quebec GF, CA Lightspeed Full time

```html

Hi there Thanks for stopping by. Are you actively looking for a new opportunity? Or just checking the market? Well… you might just be in the right place

We’re looking for a Principal Site Reliability Engineer to join our NuOrder by Lightspeed team in North America. NuORDER by Lightspeed builds software solutions that help merchants grow the size and the profitability of their business. You'll join a team responsible for supporting the group in cross-cutting concerns, such as cloud infrastructure, reliability and incident management, data warehousing and analytics, cost transparency and efficiency, and much more. You will also be supporting our growing Dev teams with the infrastructure and tools needed to continue scaling. You will build and support multi-region infrastructures and networks, and help run our products in a reliable, efficient and secure manner by implementing, advising and advocating the well-known DevOps principles.

What you’ll be doing:
  1. Work closely with development teams to empower them with the necessary tools and practices for monitoring software health in production, defining and measuring reliability metrics (SLI, SLO), and managing error budgets.
  2. Design, build and maintain robust infrastructure built upon GCP, leveraging cloud native technologies such as GKE, Cloud SQL, BigQuery, etc.
  3. Develop and manage CI/CD pipelines for efficient deployment and release using a number of technologies (GitLab, GitHub, Helm, Terraform, etc.).
  4. Drive incident management process and conduct post-mortem analysis to prevent future outages.
  5. Mentor junior SREs and developers, providing guidance on best practices in cloud architecture, data management, and software development.
  6. Conduct system performance benchmarks and implement enhancements to improve system reliability and throughput.
  7. Collaborate with cross-functional teams to identify, design, and implement internal process improvements in a cost-efficient manner.
  8. Design and build robust, scalable, and highly available systems.
  9. Build platform solutions and apply software engineering principles to improve the reliability of our software and accelerate software delivery.
  10. Manage infrastructure change through infrastructure as code (IaC).
  11. Be part of our on-call rotation.
  12. Stay current with industry trends and emerging technologies, advocating for the adoption of new technologies and practices that improve product quality and team efficiency.
What you need to bring:
  1. Bachelor’s degree in Computer Science, Engineering, or possess a related level of real-world experience.
  2. 9-10+ years of experience across site reliability engineering, systems administration, and/or software engineering.
  3. Strong expertise in container orchestration platforms, specifically Kubernetes.
  4. Strong understanding of both relational (e.g., PostgreSQL, MySQL) and NoSQL databases (e.g., MongoDB, Cassandra, Redis).
  5. Deep understanding of network protocols and IP networking, as well as experience with network troubleshooting.
  6. Proficiency in programming languages such as Java, Python, Go, etc.
  7. Proven track record of managing large-scale infrastructure in cloud environments, such as Google Cloud, AWS, or Azure.
  8. Experience with monitoring tools (e.g., Prometheus, Grafana, Datadog) and logging solutions (e.g., ELK stack).
  9. Strong understanding of security best practices.
  10. Exceptional problem-solving skills and the ability to work under pressure to troubleshoot and resolve complex issues.
  11. Excellent communication skills to effectively collaborate with cross-functional teams.
  12. Strong leadership skills, capable of leading projects and influencing engineering decisions across the organization.

We know that people are more than what’s on their CV. If you’re unsure that you have the right profile for the role hit the ‘Apply’ button and give it a try

What’s in it for you?

Come live the Lightspeed experience

  • Ability to do your job in a truly flexible environment;
  • Genuine career opportunities in a company that’s creating new jobs everyday;
  • Work in a team big enough for growth but lean enough to make a real impact.

… and enjoy a range of benefits that’ll keep you happy, healthy and (not) hungry:

  • Lightspeed share scheme (we are all owners)
  • Lightspeed RSU program (we are all owners)
  • Unlimited paid time off policy
  • Flexible working policy
  • Health insurance
  • Health and wellness benefits
  • Paid leave assistance for new parents
  • LinkedIn learning
  • Volunteer day
``` #J-18808-Ljbffr

  • Montreal, Quebec, G4F, CA Lightspeed Full time

    Hi there! Thanks for stopping by Are you actively looking for a new opportunity? Or just checking the market? Well… you might just be in the right place! We’re looking for a Principal Site Reliability Engineer to join our NuOrder by Lightspeed team in North America. NuORDER by Lightspeed builds software solutions that help merchants grow the size and...


  • Montreal, Quebec, G4F, CA Lightspeed Full time

    Hi there! Thanks for stopping by Are you actively looking for a new opportunity? Or just checking the market? Well… you might just be in the right place! We’re looking for a Principal Site Reliability Engineer to join our NuOrder by Lightspeed team in North America. NuORDER by Lightspeed builds software solutions that help merchants grow the size and...


  • Montreal, Quebec, G4F, CA SAP Full time

    We help the world run better At SAP, we enable you to bring out your best. Our company culture is focused on collaboration and a shared passion to help the world run better. We offer a highly collaborative, caring team environment with a strong focus on learning and development, recognition for your individual contributions, and a variety of benefit...


  • Montreal, Quebec, G4F, CA Socotra, Inc. Full time

    At Lyft, our mission is to improve people’s lives with the world’s best transportation. Imagine cities where streets are safe, communities thrive, and personal cars are a thing of the past. We envision a future where shared and active transportation modes are the norm, fostering vibrant, connected neighborhoods. As a leader in micromobility, Lyft powers...


  • Montreal, Quebec, G4F, CA SAP SE Full time

    We help the world run betterAt SAP, we enable you to bring out your best. Our company culture is focused on collaboration and a shared passion to help the world run better. We focus every day on building the foundation for tomorrow and creating a workplace that embraces differences, values flexibility, and is aligned to our purpose-driven and future-focused...


  • Montreal, Quebec, G4F, CA Lightspeed Commerce Full time

    Hi there! Thanks for stopping by We're looking for a Staff Site Reliability Engineer to join our NuOrder by Lightspeed team in North America. NuORDER by Lightspeed builds software solutions that help merchants grow the size and profitability of their business. You'll join a team responsible for supporting the group in cross-cutting concerns, such as...


  • Montreal, Quebec, G4F, CA Lightspeed Full time

    Hi there! Thanks for stopping by We’re looking for a Staff Site Reliability Engineer to join our NuOrder by Lightspeed team in North America. NuORDER by Lightspeed builds software solutions that help merchants grow the size and profitability of their business. You'll join a team responsible for supporting the group in cross-cutting concerns, such as...


  • Montreal, Quebec, G4F, CA Lightspeed Full time

    Hi there! Thanks for stopping by. Are you actively looking for a new opportunity? Or just checking the market? Well… you might just be in the right place! We’re looking for a Staff Site Reliability Engineer to join our NuOrder by Lightspeed team in North America. NuORDER by Lightspeed builds software solutions that help merchants grow the size and the...


  • Montreal, Quebec, G4F, CA Lightspeed Full time

    Hi there! Thanks for stopping by We’re looking for a Staff Site Reliability Engineer to join our NuOrder by Lightspeed team in North America. NuORDER by Lightspeed builds software solutions that help merchants grow the size and the profitability of their business. You'll join a team responsible for supporting the group in cross-cutting concerns, such...


  • Montreal, Quebec, G4F, CA Lightspeed Full time

    We’re looking for a Staff Site Reliability Engineer to join our NuOrder by Lightspeed team in North America. NuORDER by Lightspeed builds software solutions that help merchants grow the size and the profitability of their business. You'll join a team responsible for supporting the group in cross-cutting concerns, such as cloud infrastructure,...


  • Montreal, Quebec, G4F, CA Lightspeed Full time

    We’re looking for a Staff Site Reliability Engineer to join our NuOrder by Lightspeed team. NuORDER by Lightspeed builds software solutions that help merchants grow the size and profitability of their business. You'll join a team responsible for supporting the group in cross-cutting concerns, such as cloud infrastructure, reliability and incident...


  • Montreal, Quebec, G4F, CA Unity Full time

    The opportunityAt Unity, we are the world's leading platform for creating and operating real-time 3D (RT3D) content. We deeply understand the critical importance of reliability in today's fast-paced digital world. Our infrastructure, systems, and applications play a pivotal role in delivering seamless experiences to our customers.Our team of Site...


  • Montreal, Quebec, G4F, CA Banque Nationale du Canada Full time

    Site Reliability Engineering Developer SREHybridJob Number: 21829Category: Senior ProfessionalStatus: PermanentSchedule: Full-TimeArea of Interest: Information technologyA career in technology at National Bank means participating in the transformation to have a direct impact on the client. As a System Reliability Specialist, you will be responsible for...


  • Montreal, Quebec, G4F, CA Banque Nationale du Canada Part time

    Site Reliability Engineering Developper SRE Hybrid Job Number 21241 Category Senior Professional Status: Permanent Type of Contract Permanent Schedule: Full-Time Full Time / Part Time? Full-Time 06-Jun-2024 City Montreal Province/State Area of Interest: Information technology A career in technology at National Bank means participating in...


  • Montreal, Quebec, G4F, CA Axelon Services Full time

    Job Title: Private Cloud Site Reliability Specialist 12 Months Contract Years of experience: 5+ years Location: Montreal (Office attendance from Day 1 - Hybrid mode)Position Description: The Private Cloud SRE L3 team is part of the Enterprise Computing organization within Brokerage. The team has a presence in cities globally and is focused on supporting...


  • Montreal, Quebec, G4F, CA Pharmascience Inc. Full time

    Proud to be at the forefront of our industry since 1983, Pharmascience is a leader in generic medicines. We’re a Canadian company with global reach that has never lost sight of the human touch. Pharmascience, headquartered in Montreal, was named one of the top 300employers in 2022. Our 1,400employees are committed to the quality of our products, research...


  • Montreal, Quebec, G4F, CA Ennuviz Full time

    Job Information Industry: IT Services Work Experience: 3-5 years City: Downtown Montreal Northeast State/Province: Quebec Country: Canada Zip/Postal Code: H2Z About Us We are a client-focused digital transformation expert with 50+ years of technology & industry experience. Our customized solutions empower organizations to streamline, optimize operating...

  • Principal Engineer

    4 weeks ago


    Montreal, Quebec, G4F, CA FHLB Des Moines Full time

    Job Title: Principal Engineer - Software Location: Canada - Montreal Time Type: Full Time Posted On: Posted 3 Days Ago Job Requisition ID: R1493-24 Are you looking for a unique opportunity to be a part of something great? Want to join a 20,000-member team that works on the technology that powers the world around us? Looking for an atmosphere of trust,...


  • Montreal, Quebec, G4F, CA BBA inc. Full time

    Tailings and Mine Waste Principal Engineer Type of position: RegularYour future role on our teamBBA is looking for a principal engineer to support the growth of the Soil and Infrastructure group. We're looking for someone who can take on a technical role and participate in the civil design of mine tailings facilities, waste rock piles and various water...


  • Montreal, Quebec, G4F, CA National Bank Full time

    A career in technology at National Bank means participating in transformation to have a direct impact on clients. As a Systems Reliability Developper, you will help all IT teams put in place the necessary mechanisms to improve and maintain the highest standards of resilience and availability of IT services. Your job Promote good practices for resilience...