Senior DevOps/Site Reliability Engineer

3 weeks ago


Toronto ON, Canada Focal Systems Full time

We are looking for smart, creative and passionate people who want to help build a great and enduring company and deploy Deep Learning to the world Mission of the role: To enable us to scale from 200k to 1 million cameras Job SummaryAs a Sr. DevOps/Site Reliability Engineer (SRE) at our company, you will play a pivotal role in ensuring the smooth operation and continuous improvement of our infrastructure, deployment processes, and overall system reliability. Responsibilities Set up and manage blue/green and canary deployments to ensure smooth launches without downtime.Operate multiple large GCP Kubernetes clusters and fine tune for reliability vs costManage the various distributed services of the company, ensuring to always provide graceful updates, comprehensive test coverage, tracking of logs, and 99.9% uptimeWork with Backend, Frontend and Deep Learning teams and write infrastructure automation code for their needsIdentify scalability bottlenecks through load testing and plan infrastructure architectureCreate tools to provide transparency/ease of access into the company's rich datasets stored across varying geographic locations and data formatsDesign, build, and manage a robust Continuous Integration and Continuous Deployment (CI/CD) pipeline. Requirements 4+ years experience in an infrastructure or Site Reliability Engineer (SRE) role3+ years of experience with containerization (Docker) and orchestration platforms (Kubernetes) required.Great understanding of SQL, networking, distributed systems, operating systems (debian), data structures, algorithms, and software engineering practicesExperience operating Kafka (or other Pub/Sub) clusters at terabyte scale.Terraform or other Infrastructure as Code automation solutionOperating Relational SQL databases and Redis at terabyte scale.Proven experience with setting up monitoring/alerting and reliability engineeringScriptings skills in Python Nice to have experience: GitOpsSetting up automation for complex load testing scenariosTuning Deep Learning pipelines with Python, Pytorch and MultiprocessingBackend programming with Python Why Focal SystemsStrong and - We are a tightly-knit team with an ambitious mission and a strong set of core values, which define our approach to business and have successfully guided us since inception.Exceptional Team - We are a team of hard-working, fun-loving professionals from some of the most eminent universities, research labs, and tech companies of our time. We pride ourselves on recruiting exceptional individuals to help us redefine the state-of-the-art.Outstanding Partners - We work with 10+ of the largest retailers in the world and have a world-class roster of investors, advisors and partners to support & advise us in our endeavors.BenefitsWe care deeply about the health, happiness, and wellbeing of all of our employees. We offer: Competitive Salary& Attractive StockHealth InsuranceCatered lunchesPaid Time OffQuarterly Team RetreatsEducation grants

Benefits

Competitive salaryHealth InsuranceCatered lunchesPaid Time OffQuarterly Team RetreatsEducation grantsMeaningful equity-grants in a very fast growing startup that is aiming for an IPO

#J-18808-Ljbffr

  • Mississauga, ON, Canada Mimecast Canada Limited Full time

    Senior Site Reliability Engineer page is loaded Senior Site Reliability Engineer Apply locations Canada - Mississauga - Remote time type Full time posted on Posted 4 Days Ago job requisition id R4613 Senior Site Reliability Engineer Help Build the Next Generation of Cloud-Scalable AI-Based Security Products Have a passion for software security? Excel...


  • Toronto, ON, Canada Autodesk Full time

    Position Overview Virtual and augmented reality are transforming design and creation through new immersive and collaborative experiences to improve how major segments like entertainment, architecture, engineering, construction, and manufacturing converge. Many industries are being transformed by the growth of XR technology, creating new ways of working to...


  • Mississauga, ON, Canada Mimecast Canada Limited Full time

    Senior Site Reliability Engineer page is loaded Senior Site Reliability Engineer Apply locations Canada - Mississauga - Remote time type Full time posted on Posted 4 Days Ago job requisition id R4613 Senior Site Reliability Engineer Help Build the Next Generation of Cloud-Scalable AI-Based Security Products Have a passion for software security? Excel...


  • Mississauga, ON, Canada Mimecast Full time

    Senior Site Reliability Engineer Help Build the Next Generation of Cloud-Scalable AI-Based Security Products Have a passion for software security? Excel at implementing public cloud at scale? Desire to apply Machine Learning to solve complex problems? This may well be the role for you. Our Communication and Collaboration Security products are cutting-edge...


  • Toronto, Canada BMO Full time

    Application Deadline: 04/29/2024Address:33 Dundas Street WestThis role is Hybrid (1-2 days per week in the office)The Director - Site Reliability Engineering will lead a team that will work with application teams, infrastructure teams, and business partners to continuously improve the stability, reliability and efficiency of Finance and Enterprise Risk...


  • Mississauga, ON, Canada Mimecast Full time

    Senior Site Reliability Engineer Help Build the Next Generation of Cloud-Scalable AI-Based Security Products Have a passion for software security? Excel at implementing public cloud at scale? Desire to apply Machine Learning to solve complex problems? This may well be the role for you. Our Communication and Collaboration Security products are cutting-edge...


  • Toronto, ON, Canada Akamai Full time

    Are you passionate about cutting edge technology? Do solving some of the Internet's most difficult content delivery challenges interest you? Join our Compute Site Reliability team! Our team is responsible for monitoring and measuring the reliability of our suite of Compute products and platform. In collaboration with Engineering and Product teams, we focus...


  • Ottawa, ON, Canada Lightspeed Commerce Full time

    Hi there! Thanks for stopping by Are you actively looking for a new opportunity? Or just checking the market? Well… you might just be in the right place! We’re looking for a Principal Site Reliability Engineer to join our NuOrder by Lightspeed team in North America. NuORDER by Lightspeed builds software solutions that help merchants grow the size and...


  • Ottawa, ON, Canada Lightspeed Full time

    Hi there! Thanks for stopping by. Are you actively looking for a new opportunity? Or just checking the market? Well… you might just be in the right place! We’re looking for a Principal Site Reliability Engineer to join our NuOrder by Lightspeed team in North America. NuORDER by Lightspeed builds software solutions that help merchants grow the size...


  • Old Toronto, Canada Focal Systems Full time

    We are looking for smart, creative and passionate people who want to help build a great and enduring company and deploy Deep Learning to the world! Mission of the role: To enable us to scale from 200k to 1 million cameras Job SummaryAs a Sr. DevOps/Site Reliability Engineer (SRE) at our company, you will play a pivotal role in ensuring the smooth operation...


  • Old Toronto, Canada Focal Systems Full time

    We are looking for smart, creative and passionate people who want to help build a great and enduring company and deploy Deep Learning to the world! Mission of the role: To enable us to scale from 200k to 1 million cameras Job SummaryAs a Sr. DevOps/Site Reliability Engineer (SRE) at our company, you will play a pivotal role in ensuring the smooth operation...


  • Old Toronto, Canada Focal Systems Full time

    We are looking for smart, creative and passionate people who want to help build a great and enduring company and deploy Deep Learning to the world! Mission of the role: To enable us to scale from 200k to 1 million cameras Job SummaryAs a Sr. DevOps/Site Reliability Engineer (SRE) at our company, you will play a pivotal role in ensuring the smooth operation...


  • Toronto, ON, Canada Score Media and Gaming Inc. Full time

    About the Role As part of the theScore team, you will be working with a team of smart, friendly, and dedicated Engineers, Product Managers and Designers determined to deliver some of the best apps the market has to offer. We want you to be challenged and to get the full experience of what it’s like to work at theScore! We are looking for a Senior DevOps...


  • Toronto, ON, Canada Thomson Reuters Full time

    (Canada) Site Reliability Engineer (Contract) Contract (9 months 4 days) Published 3 days ago New Relic Data Dog Site Reliability Engineer - in the Service Management Organization Do you have experience in IT Service Management, working with cloud providers, software development, and technology infrastructure? The Site Reliability Engineer will analyze...


  • Ottawa, ON, Canada Axiad Ids, Inc. Full time

    Axiad is looking for an experienced Senior DevOps Engineer to join our Cloud team. Axiad is looking for an experienced Senior DevOps Engineer to join our Cloud team. Do you have a passion for Continuous Integration (CI) and Delivery (CD) and cloud first applications leveraging cloud-agnostic technology that runs on cloud platforms (AWS, GCP, Azure)? Do...


  • Ottawa, ON, Canada Synopsys, Inc. Full time

    Synopsys, Software Integrity Group , is named a leader for 2023 in the Gartner Magic Quadrant for Application Security Testing (AST), in recognition of our vision and ability to execute. Security and risk management leaders will need to meet tighter deadlines and test more-complex applications by integrating and automating AST in the software life cycle...


  • Toronto, Canada Equitable Bank Full time

    The WorkDesign, develop, and implement Java-based solutions using microservices architecture.Deploy and maintain digital platforms on the cloud, ensuring high availability and scalability.Collaborate with cross-functional teams to integrate numerous services and ensure seamless delivery.Be a functional leader in the team, guiding the team with the best...


  • Toronto, ON, Canada CB Canada Full time

    Site Reliability Engineer On behalf of our client in the Banking Sector, PROCOM is looking for a Site Reliability Engineer. Site Reliability Engineer – Job Description Azure cloud Jira and confluence CICD Experience with automating (provisioning, configuration management, deployment) and integrating Azure PaaS solutions (Azure App services, Azure...

  • Sr. DevOps Engineer

    15 hours ago


    Toronto, Canada The Select Group Full time

    Sr. DevOps Engineer The Select Group is seeking a Senior DevOps Engineer for our leading telecommunication client in Canada. We are currently seeking a Senior DevOps Engineer to join our clients growing team and play a key role in shaping the future of their technology landscape. Sr. DevOps Engineer requirements: Bachelor's degree in Computer Science,...

  • Sr. DevOps Engineer

    8 hours ago


    Toronto, Canada The Select Group Full time

    Sr. DevOps Engineer The Select Group is seeking a Senior DevOps Engineer for our leading telecommunication client in Canada. We are currently seeking a Senior DevOps Engineer to join our clients growing team and play a key role in shaping the future of their technology landscape. Sr. DevOps Engineer requirements: Bachelor's degree in Computer Science,...