Site Reliability Engineer

Found in: Talent CA C2 - 5 days ago


Mississauga, Canada OSL Retail Services Full time

Overview

It’s an exciting time to be at OSL Retail Services, working for a people-focused company that’s at the top of its game. The momentum we’ve generated in recent years with our commitments to client customers, innovation, business results, and an entrepreneurial spirit has created energy, enthusiasm, and engagement among our employees that is pushing us to new heights. And we’re on the lookout for talented people who share our vision and values and want to join us in this journey. At OSL, our culture is our foundation. Passionate employees, great customer service and long-term relationships are all built upon that foundation. We value people, passion, honesty, respect, and integrity.

About the role:

As the Site Reliability Engineer , you will be instrumental in managing and maintaining the infrastructure for multiple partner products, which includes brands such as Walmart, Samsung Canada, Ted Baker, Brooks Brothers, and Lucky Brand. Your expertise will ensure the highest levels of reliability and performance, supporting our commitment to delivering exceptional service to our clients and customers. This hybrid role will be based out of our Mississauga, Ontario location.

What you’ll do:

Manage OSL Omni Channel Production environment: Hybrid cloud environment (. Public/Private) interconnected to several partners and OSL-owned and field equipment supporting multiple brands across the OSL portfolio. Build out production monitoring: design and deploy infrastructure monitoring for all services, including API endpoints and external web applications and services. Create active reporting and alerting: create dashboards to identify trends and bottlenecks, developing alerting/escalation strategies to manage incidents effectively. Implement security controls: ensuring that the environment is secure and adheres to security best practices. Implement operational controls and processes : ensuring the mechanisms, safety controls, fuse breakers, and due diligence are in place to responsibly introduce changes and react to planned or unplanned events.

What you’ve done:

5+ years demonstrated experience in the field. Strong knowledge of networking protocols and services (TCP/IP, DNS, DHCP, VPN, . Experience with cloud platforms such as AWS, Azure, or GCP is a plus. Expert or near-expert skills with Linux, Windows, networking, storage, and virtualization Experience with server provisioning and configuration management, utilizing tools such as Terraform, Ansible or Chef, delivering Infrastructure as Code Experience with database administration of MSSQL and PostgreSQL Observability, monitoring and alerting with tools like Prometheus and Datadog Strong engineering background with experience in automation, configuration management, scripting, and security best practices Familiarity with Infrastructure as Code (IaC) principles, automated builds, monitoring, and scaling Experience with system and application monitoring tools such as Nagios, Graphite, Prometheus, Grafana, ELK, CollectD, StatsD, DataDog Proficiency in Windows and Linux systems administration, including scripting, troubleshooting, and upgrading Significant experience with automation and configuration management in a production environment (Puppet, Chef, Ansible) Ability to take ownership of technical delivery and collaborate effectively with business partners The ability to assume leadership responsibilities for troubleshooting and managing incidents in the environment is crucial. It requires weighing desired outcomes against risk and urgency. This responsibility may also involve working with OSL's wide network of partner service providers. Strong scripting and coding capabilities, sufficient to integrate / instrument OSL’s technology stack Strong mentoring and advocacy skills for good design and engineering values Excellent communication and stakeholder management skills, with the ability to convey complex technical concepts to non-technical audiences Experience maintaining a 24x7 SaaS environment covering multiple time zones Operational support experience with after-hours on-call responsibilities ITIL Knowledge (Incident, Change, and Problem Management) and tools

Working Conditions:

Flexibility to work various schedules, including evenings and weekends as required.

What’s in it for you:

Competitive base salary $80-120K plus bonuses and other perks Vacation plus additional flex days Comprehensive benefits Training and development opportunities to grow your career with one of Canada’s Best Managed Companies A supportive workplace culture and work environment 
  • Director, Site Reliability Engineering

    Found in: beBee jobs CA - 5 days ago


    Mississauga, Ontario, Canada Abbott Laboratories Full time

    About AbbottAbbott is a global healthcare leader, creating breakthrough science to improve people's health. We're always looking towards the future, anticipating changes in medical science and technology. Working at Abbott At Abbott, you can do work that matters, grow, and learn, care for yourself and family, be your true self and live a full life. You will...


  • Mississauga, Canada Mimecast Canada Limited Full time

    Senior Site Reliability Engineer page is loaded Senior Site Reliability Engineer Apply locations Canada - Mississauga - Remote time type Full time posted on Posted 4 Days Ago job requisition id R4613 Senior Site Reliability EngineerHelp Build the Next Generation of Cloud-Scalable AI-Based Security ProductsHave a passion...

  • Senior Site Reliability Engineer

    Found in: Talent CA C2 - 5 days ago


    Mississauga, Canada Mimecast Full time

    Senior Site Reliability EngineerHelp Build the Next Generation of Cloud-Scalable AI-Based Security ProductsHave a passion for software security? Excel at implementing public cloud at scale? Desire to apply Machine Learning to solve complex problems?  This may well be the role for you.  Our Communication and Collaboration Security products are cutting edge...

  • Senior Site Reliability Engineer

    Found in: Jooble CA O C2 - 2 days ago


    Mississauga, ON, Canada Mimecast Canada Limited Full time

    Senior Site Reliability Engineer page is loaded Senior Site Reliability Engineer Apply locations Canada - Mississauga - Remote time type Full time posted on Posted 4 Days Ago job requisition id R4613 Senior Site Reliability Engineer Help Build the Next Generation of Cloud-Scalable AI-Based Security Products Have a passion for software security? Excel...

  • Senior Site Reliability Engineer

    Found in: Jooble CA O C2 - 2 days ago


    Mississauga, ON, Canada Mimecast Canada Limited Full time

    Senior Site Reliability Engineer page is loaded Senior Site Reliability Engineer Apply locations Canada - Mississauga - Remote time type Full time posted on Posted 4 Days Ago job requisition id R4613 Senior Site Reliability Engineer Help Build the Next Generation of Cloud-Scalable AI-Based Security Products Have a passion for software security? Excel...

  • Digital Site Reliability Engineer

    Found in: Jooble CA O C2 - 2 weeks ago


    Mississauga, ON, Canada Roche Full time

    The Position Senior Site Reliability Engineer (Kubernetes Platform) - Digital Products and Enablement The 21st century needs a 21st century healthcare system. To help build this, Roche is not only developing highly personalized medicine and advanced diagnostics, but also heavily investing into software and digital solutions. To speed up medical...

  • Reliability Engineer

    Found in: Talent CA C2 - 7 days ago


    Mississauga, Canada Thermo Fisher Scientific Full time

    Facilitate the development and implementation of a Site Asset Maintenance Strategy that achieves optimum asset safety and reliability. Actively promote and advance Reliability Centered Culture principles in the plant by continuing to shift maintenance efforts from corrective to preventative maintenance. Solve asset reliability and availability loss...

  • Director, Site Reliability Engineering

    Found in: beBee S CA - 2 weeks ago


    Mississauga, Canada Abbott Laboratories Full time

    About AbbottAbbott is a global healthcare leader, creating breakthrough science to improve people’s health. We’re always looking towards the future, anticipating changes in medical science and technology. Working at Abbott At Abbott, you can do work that matters, grow, and learn, care for yourself and family, be your true self and live a full...

  • Reliability Engineering

    Found in: beBee jobs CA - 5 days ago


    Mississauga, Ontario, Canada Thermo Fisher Scientific Full time

    Job DescriptionThis Co-Op position is a minimum of 12 months and will run from May 2024 through April 2025Summary:The main focus of this position is to provide support for the Engineering department.Essential Functions:Researches, develops and implements processes and procedures necessary to establish a Reliability Centered Maintenance(RCM) culture to...

  • Senior Site Reliability Engineer

    Found in: Talent CA C2 - 5 days ago


    Mississauga, Canada Roche Full time

    The Position Senior Site Reliability Engineer (Kubernetes Platform) - Digital Products and Enablement The 21st century needs a 21st century healthcare system. To help build this, Roche is not only developing highly personalized medicine and advanced diagnostics, but also heavily investing into software and digital solutions. To speed up medical...


  • Mississauga, Canada Roche Full time

    The Position Senior Site Reliability Engineer (Kubernetes Platform) - Digital Products and Enablement The 21st century needs a 21st century healthcare system. To help build this, Roche is not only developing highly personalized medicine and advanced diagnostics, but also heavily investing into software and digital solutions. To speed up medical processes,...

  • Senior Site Reliability Engineer

    Found in: Jooble CA O C2 - 2 weeks ago


    Mississauga, ON, Canada Roche Full time

    The Position Senior Site Reliability Engineer (Kubernetes Platform) - Digital Products and Enablement The 21st century needs a 21st century healthcare system. To help build this, Roche is not only developing highly personalized medicine and advanced diagnostics, but also heavily investing into software and digital solutions. To speed up medical processes,...

  • Site Reliability Engineer

    Found in: Appcast CA A2 P - 5 days ago


    Mississauga, Canada TechMatrix Inc Full time

    8+ years of experience in DevOps, SRE and performing assessments with project plan. Solid understanding of SRE/DevOps principles and practices.Good understanding of application design and architecture along with various design patterns.Excellent in OpenShift (ECS)- Configure and manage application microservices, creating routes for Microservices, and also...


  • Mississauga, Canada TechMatrix Inc Full time

    8+ years of experience in DevOps, SRE and performing assessments with project plan. Solid understanding of SRE/DevOps principles and practices.Good understanding of application design and architecture along with various design patterns.Excellent in OpenShift (ECS)- Configure and manage application microservices, creating routes for Microservices, and also...

  • Site Reliability Engineer

    Found in: Appcast CA C2 Glassdoor - 5 days ago


    Mississauga, Canada TechMatrix Inc Full time

    8+ years of experience in DevOps, SRE and performing assessments with project plan. Solid understanding of SRE/DevOps principles and practices.Good understanding of application design and architecture along with various design patterns.Excellent in OpenShift (ECS)- Configure and manage application microservices, creating routes for Microservices, and also...

  • Site Reliability Engineer

    Found in: Whatjobs CA C2 - 7 days ago


    Mississauga, Canada TechMatrix Inc Full time

    8+ years of experience in DevOps, SRE and performing assessments with project plan. Solid understanding of SRE/DevOps principles and practices. Good understanding of application design and architecture along with various design patterns. Excellent in OpenShift (ECS)- Configure and manage application microservices, creating routes for Microservices, and also...

  • Reliability Engineering

    Found in: beBee S CA - 2 weeks ago


    Mississauga, Canada Thermo Fisher Scientific Full time

    Job DescriptionThis Co-Op position is a minimum of 12 months and will run from May 2024 through April 2025Summary:The main focus of this position is to provide support for the Engineering department.Essential Functions:Researches, develops and implements processes and procedures necessary to establish a Reliability Centered Maintenance(RCM) culture to...

  • Lead Site Reliability Administrator

    Found in: beBee S CA - 2 weeks ago


    Mississauga, Canada opentext Full time

      OPENTEXT OpenText is a global leader in information management, where innovation, creativity, and collaboration are the key components of our corporate culture. As a member of our team, you will have the opportunity to partner with the most highly regarded companies in the world, tackle complex issues, and contribute to projects that shape the future...

  • Site Senior Project Engineer

    Found in: Jooble CA O C2 - 7 days ago


    Mississauga, ON, Canada Lycopodium Limited Full time

    With offices in Australia, Canada, Africa, Peru and the Philippines, Lycopodium proudly delivers high quality professional engineering and project delivery services globally, across the resources, infrastructure and industrial processes sectors. LycopodiumCanada is currently recruiting for anexperienced FIFO Senior Project Engineer to enable an EPCM gold...

  • Site Coordinator

    7 days ago


    Mississauga, Canada Edenshaw Management Limited Full time

    **SCOPE** The Site Coordinator works alongside the Assistant Superintendent/Superintendent through all stages of a project, starting from site planning, through shoring, excavation, construction and interior finishing. **RESPONSIBILITIES** - Supervise, direct, coach, and train Junior Site Coordinators, Students, and other staff assigned to project - Create...