AWS SRE Engineer

3 weeks ago


Old Toronto, Canada Focal Systems Full time
p>Location: Toronto, Canada - Remote
Salary: $170-180k CAD + stock

Company Description

Focal Systems is the industry leader in retail AI solutions. We are a Silicon Valley based startup that has more than doubled in size every year since inception. Our mission is to automate and optimize brick and mortar retail using deep learning computer vision. We are looking for smart, creative and passionate people who want to help build a great and enduring company and deploy Deep Learning to the world

Work with Backend, Frontend, and Deep Learning teams and write infrastructure automation code for their needs.Identify scalability bottlenecks through load testing and plan infrastructure architecture.Create tools to provide transparency/ease of access into the company's rich datasets stored across varying geographic locations and data formats.Design, build, and manage a robust Continuous Integration and Continuous Deployment (CI/CD) pipeline.

Requirements

  • Solid experience in an infrastructure or Site Reliability Engineer (SRE) role.
  • Great understanding of SQL, networking, distributed systems, operating systems (debian), and software engineering practices.
  • Terraform or other Infrastructure as Code automation solution.
  • Operating Relational SQL databases and Redis at terabyte scale.
  • Proven experience with setting up monitoring/alerting and reliability engineering.
  • Scripting skills in Python.
  • Must be comfortable with 12-hour on-call rotations.
  • Setting up automation for complex load testing scenarios.
  • Tuning Deep Learning pipelines with Python, Pytorch, and Multiprocessing.
  • Backend programming with Python.

    Exceptional Team - We are a team of hard-working, fun-loving professionals from some of the most eminent universities, research labs, and tech companies of our time. We pride ourselves on recruiting exceptional individuals to help us redefine the state-of-the-art.

    Outstanding Partners - We work with 10+ of the largest retailers in the world and have a world-class roster of investors, advisors, and partners to support & advise us in our endeavors.


  • AWS SRE Engineer

    4 weeks ago


    Old Toronto, Canada Manulife Insurance Malaysia Full time

    div>Senior Site Reliability EngineerJob DescriptionDo you want to be part of a team that redefines how we get work done? We are seeking a self-motivated Senior Site Reliability Engineer in our Identity and Access Management space, who is obsessed with delivering value, is forward-thinking, and excited to see the successful implementation of the products...


  • Old Toronto, Canada The Nylas Api Full time

    At The Nylas API, we're committed to simplifying the integration of email, calendar, and contact management features into applications. Our technology enables developers to build more reliable and secure communication tools.The RoleWe're seeking an experienced Cloud Engineering Expert to join our SRE team. As a key member of this team, you'll be responsible...

  • AWS SRE Engineer

    4 weeks ago


    Old Toronto, Canada Northbridge Financial Corporation Full time

    What is it like to be a Senior Site Reliability Engineer at Northbridge Financial? The Senior Site Reliability Engineer oversees the creation and implementation of Service Level Objectives (SLOs). The Senior SRE handles service reliability solutions and processes of increasing complexity and is responsible for mentoring and leading less experienced SREs. We...

  • AWS SRE Engineer

    4 weeks ago


    Old Toronto, Canada Etraveli Group Full time

    h3>About TripstackWe are travel tech entrepreneurs, changing the way millions of people travel. Our proprietary virtual interlining technology provides access to billions of travel itineraries by combining flights from different airline carriers that don’t traditionally work together. We take our customers from point A to B via C, including land...

  • AWS SRE Engineer

    4 weeks ago


    Old Toronto, Canada The Nylas Api Full time

    The Company At Nylas, we specialize in making it easier for developers to add email, calendar, and contact management features into their applications. We provide tools called APIs, which streamline the integration of these functionalities, ensuring they are secure and effective. This enables better, safer, and more reliable communication within...

  • AWS SRE Engineer

    4 weeks ago


    Old Toronto, Canada Jobber Full time

    p>At Jobber, we don’t just build a product - we work on real problems that help people in small businesses to become successful. We release early and often while dedicating time to addressing technical debt. p>We help employees grow professionally; we have a ton of onboarding resources, tutorials, hackathons and buddies to support learnings and provide...


  • Old Toronto, Canada Soda Full time

    Job Description Job Title: Site Reliability Engineer Location: Poland - Fully Remote Salary: 324K PLN or 27.3K monthly Start: ASAP Stack: AWS, Docker, Kubernetes, Terraform, Jenkins, Ansible, Linux, JavaScript, and Lambda. Are you a seasoned DevOps/SRE professional passionate about building high-performance, scalable systems? I am working with a Media/IT...

  • AWS SRE Engineer

    4 weeks ago


    Old Toronto, Canada Criteo Full time

    What You'll Do: What’s a PRE Team? The concept of Product Reliability Engineering (PRE) was born from an industry-leading online SRE book (go ahead, “Google” it). At Criteo, we are the bridge between Product and Platform Engineering. The PRE group is composed of 7 teams of people with a wide variety of backgrounds, experiences, and perspectives. How...


  • Old Toronto, Canada Street Context Full time

    p>Are you a Site Reliability Engineer that has a passion for building reliable, resilient and performant systems that scale? p>We are on a mission to build and strengthen our engineering teams to match the accelerating success of Street Context. We provide a premium Email, Analytics and Broker Relationship platform, purpose-built for capital markets and...


  • Old Toronto, Canada Ascend Fundraising Solutions Full time

    We are seeking a highly skilled AWS Cloud Infrastructure Engineer to collaborate closely with our IT team. In this role, you will diagnose, troubleshoot, and resolve system reliability issues related to infrastructure management.Key Responsibilities:Take ownership of customer-reported issues, ensuring timely resolution and implementing preventive measures to...

  • Sre

    7 months ago


    Toronto, Canada Q1 Technologies Full time

    Skills and Responsibilities: - Owner of the Production Environment: Has independent veto power on changes. Is business aligned and understands business outcomes. - Experience owning change management, release management and Production support. - Experience in an Operational Role? DevOps, SRE, and Software Engineering - Understands code integrity Merges,...


  • Old Toronto, Canada Focal Systems Full time

    About the CompanyFocal Systems is a leading retail AI solutions provider based in Silicon Valley, with an impressive track record of doubling in size every year since inception. Our mission is to revolutionize brick and mortar retail using cutting-edge deep learning computer vision.Job OverviewWe are seeking a highly skilled AWS SRE Engineer to join our...

  • AWS Engineer

    4 weeks ago


    Old Toronto, Canada Street Context Full time

    We're seeking a seasoned Site Reliability Engineer with a passion for designing and implementing robust, scalable systems on AWS.About Street Context: We provide a premium Email, Analytics, and Broker Relationship platform for capital markets and institutional investors.Scale our system to meet increasing global demand by collaborating with development...


  • Toronto, Canada Lorven Technologies Full time

    Job Title : Senior SRE/Platform Support Engineer Location :  Toronto Duration :  Long term Primary-Skills: • 5+ years of Advanced knowledge, experience, and understanding of Unix/Linux, Windows • Hands on experience in working as SRE/Platform Support Engineer • Willingness to work in the Application Support space • Prior RBC...


  • Old Toronto, Canada PharmaLex Full time

    Your Job SRE at Pharmalex The SRE at Pharmalex is the software engineering approach to production operations. 50% of your time will be building software to automate the manual work you do during the other 50% of your time will be providing operational support to the products you cover. SRE operates critical products 24/7/365 operating within agreed SLOs....


  • Old Toronto, Canada Lyons Consulting Group Full time

    p>Provide hands-on SRE with 24x7 SRE support, including incident management, problem management, root cause analysis, monitoring, alerting, and maintenance of infrastructure compliance.Track, audit, monitor and implement on technical work streams.Act as portfolio SME (Domain Expert) – understand & document common components, core functionalities,...

  • App Support Team Lead

    2 months ago


    Toronto, Canada Resonaite Full time

    Our client in the financial services sector is looking for a Technical Team lead on a fulltime/permanent to own the their application operational efforts. Focus is on cloud platforms (Azure/AWS), automation, and Infrastructure-as-Code (IaC) within an Agile environment.Location: Hybrid 1d/week - TorontoResponsibilities:Lead the operational stability and...

  • App Support Team Lead

    4 weeks ago


    Toronto, Canada Resonaite Full time

    Our client in the financial services sector is looking for a Technical Team lead on a fulltime/permanent to own the their application operational efforts. Focus is on cloud platforms (Azure/AWS), automation, and Infrastructure-as-Code (IaC) within an Agile environment.Location: Hybrid 1d/week - TorontoResponsibilities:Lead the operational stability and...

  • App Support Team Lead

    4 weeks ago


    Toronto, Canada Resonaite Full time

    Our client in the financial services sector is looking for a Technical Team lead on a fulltime/permanent to own the their application operational efforts. Focus is on cloud platforms (Azure/AWS), automation, and Infrastructure-as-Code (IaC) within an Agile environment.Location: Hybrid 1d/week - TorontoResponsibilities:Lead the operational stability and...

  • DevOps Sre Manager

    7 months ago


    Toronto, Canada Actionstep Full time

    Actionstep is a pioneer in the development and sale of software-as-a-service (SaaS) products, specializing in the delivery of Legal Practice Management software. We are a fast growing, dynamic business with a global customer base and team. Headquartered in Auckland, New Zealand, with team members in the United Kingdom, United States, Canada and Australia, we...