Director of Site Reliability Engineering

4 days ago


Mississauga, Canada CEI Fleet Collision and Safety Full time
h3>Director, Site Reliability Engineering

Apply locations Mississauga time type Full time posted on Posted 2 Days Ago job requisition id R104373

We are looking for a Director, Site Reliability Engineering to join Element Fleet Management. As the largest pure-play fleet manager in the world, we provide unmatched products and services and solutions to our clients.Someone with experience using data analytics to drive decision-making for system improvements and incident prevention?As the Director, Site Reliability Engineering, you will lead and manage our SRE team, working closely with cross-functional teams to implement and refine SRE practices, minimize downtime, and drive automation for high efficiency. You will bring a mix of operational and engineering expertise to design robust systems, oversee incident management, monitor key metrics, and foster a culture of continuous improvement. Provide ongoing training and development opportunities for team growth.Incident Management and Response: Lead the team in incident response, coordinating with cross-functional stakeholders to ensure timely resolution. li>Problem Management: Analyze and address underlying issues in applications and systems to prevent recurring incidents. li>Change Management and Release Engineering: Implement and oversee change management practices, ensuring safe and reliable releases. Work closely with development and QA teams to standardize and optimize deployment pipelines for maximum reliability and scalability.Monitoring, Alerting, and Reporting: Build and maintain robust monitoring, logging, and alerting solutions for system health and application performance. li>Automation and Tooling: Drive the adoption of automation and self-healing systems to reduce manual intervention, improve efficiency, and minimize human error. Oversee the development of tools and frameworks to support automation in deployment, monitoring, and incident response.Capacity Planning and Disaster Recovery: Conduct capacity planning and manage resources to ensure systems can handle current and future demands. li>Audit and Compliance: Collaborate with internal and external audit teams to ensure that our production systems meet SOC1, SOX, and other regulatory requirements. li>Vendor Management: Manage relationships with external vendors to ensure they meet performance and service level agreements. li>Bachelor's degree in computer science, engineering, or a related field; li>10+ years of experience in IT operations, SRE, or related field, with a strong record of managing high-availability systems in production environments.Solid understanding of SRE principles and practices, including error budgets, service level objectives (SLOs), and service level indicators (SLIs).Strong background in automation, CI/CD, and DevOps practices, with experience using tools such as Jenkins, GitLab CI/CD, or similar.Experience with observability tools such as Prometheus, Grafana, ELK Stack, Splunk, or DataDog, and the ability to design, implement, and interpret monitoring and alerting systems.Proven ability to lead and manage incident response and post-incident analysis, with a focus on improving response times and reducing incident frequency.Proficiency in scripting and programming languages such as Python, Go, or Bash, with an ability to build automation scripts and tooling.Familiarity with SOC1, SOX, and other regulatory compliance frameworks, and experience in maintaining audit and compliance documentation.Strong project management skills with a focus on prioritization, resource planning, and risk assessment.Google Cloud Professional DevOps Engineer, AWS Certified DevOps Engineer, or Certified Kubernetes Administrator (CKA) Familiarity with advanced SRE tools and practices such as chaos engineering, load testing, and synthetic monitoring Experience managing third-party relationships to ensure vendors meet performance and service level expectations Element Fleet Management is the global leader in the fleet management industry, providing a full suite of customized services and consulting for our clients with commercial vehicle fleets. We simplify fleet management so that our clients are free to achieve their business goals, knowing that their vehicles are in good hands.

  • Mississauga, Canada CEI Fleet Collision and Safety Full time

    h3>Director, Site Reliability EngineeringApply locations Mississauga time type Full time posted on Posted 3 Days Ago job requisition id R104373Get started on an exciting career at Element!What We NeedWe are looking for a Director, Site Reliability Engineering to join Element Fleet Management. As the largest pure-play fleet manager in the world, we provide...


  • Mississauga, Ontario, Canada KUBRA Full time

    KUBRA is a fast-growing company that delivers customer communications solutions to some of the largest utility, insurance, and government entities across North America.As Site Reliability Engineering Director, you will play a key role in ensuring the stability, reliability, and efficiency of our platforms. This is an exciting opportunity to lead a team of...


  • Mississauga, Ontario, Canada Interesting Engineering, Inc. Full time

    About Interesting Engineering, Inc.Interesting Engineering, Inc. is a dynamic organization that offers cutting-edge solutions for customers. With a strong focus on innovation and stability, we are looking for a skilled Site Reliability Engineer Team Lead to join our team.


  • Mississauga, Ontario, Canada Interesting Engineering, Inc. Full time

    Job Title: Technical Engineering Director">We are seeking a highly experienced and skilled Senior Site Reliability Engineer to lead our team in optimizing our customer experience management platforms.About the Role:">">Develop and implement strategic plans to achieve low and continuously improving mean time to recovery (MTTR) for service-impacting...


  • Mississauga, Ontario, Canada CEI Fleet Collision and Safety Full time

    We are seeking an experienced Site Reliability Engineering Team Lead to join our team at CEI Fleet Collision and Safety.Job Description:As a Director of Site Reliability Engineering, you will lead and manage our SRE team, working closely with cross-functional teams to implement and refine SRE practices, minimize downtime, and drive automation for high...


  • Mississauga, Ontario, Canada KUBRA Full time

    About KUBRAKUBRA is a leading provider of billing and payments, mapping, mobile apps, proactive communications, and artificial intelligence solutions for customers.Job Title: Site Reliability Engineering Team LeadWe are seeking an experienced Site Reliability Engineer to lead our DevOps team in optimizing our customer experience management...


  • Mississauga, Ontario, Canada KUBRA Full time

    We are seeking an experienced Site Reliability Engineer to lead our DevOps team in optimizing customer experience management platforms. The ideal candidate will have a passion for enhancing platform stability, reliability, and efficiency.About the RoleAs a Team Lead, Site Reliability Engineer, you will work collaboratively with cross-functional teams to...


  • Mississauga, Ontario, Canada KUBRA Full time

    We are seeking a seasoned Site Reliability Engineer to lead our DevOps team in optimizing customer experience management platforms.About the RoleThe ideal candidate will have 5+ years of experience in site reliability engineering or a related field, with a strong background in systems programming languages, such as Go or Python, and shell scripting. They...


  • Mississauga, Ontario, Canada KUBRA Full time

    We are growing at KUBRA, and we're seeking a skilled Site Reliability Engineer to lead our DevOps team in optimizing our customer experience management platforms. The ideal candidate will have a passion for enhancing platform stability, reliability, and efficiency.Job SummaryThe Site Reliability Engineering Team Lead will play a pivotal role in identifying...


  • Mississauga, Ontario, Canada KUBRA Full time

    KUBRA: A Leader in Customer Experience ManagementAre you a seasoned Site Reliability Engineer looking to take on a leadership role? Do you have a passion for enhancing platform stability, reliability, and efficiency?We are growing at KUBRA, a company that specializes in billing and payments, mapping, mobile apps, proactive communications, and artificial...


  • Mississauga, Ontario, Canada KUBRA Full time

    About UsKUBRA is a leading provider of innovative solutions for customers, offering billing and payments, mapping, mobile apps, proactive communications, and artificial intelligence. Our company culture fosters growth, diversity, and inclusion, making us an attractive employer.Job DescriptionWe are seeking an experienced Site Reliability Engineering Team...


  • Mississauga, Ontario, Canada KUBRA Data Transfer Ltd Full time

    At KUBRA Data Transfer Ltd, we're seeking a highly skilled Team Lead, Site Reliability Engineer to join our DevOps team.We're growing rapidly, and this role will play a crucial part in optimizing our customer experience management platforms.About the RoleWe're looking for someone with experience in implementing automation and observability to achieve low and...


  • Mississauga, Ontario, Canada KUBRA Full time

    About KUBRAKUBRA is a fast-growing company that delivers customer communications solutions to some of the largest utility, insurance, and government entities across North America.We offer billing and payments, mapping, mobile apps, proactive communications, and artificial intelligence solutions for customers. With more than 1.5 billion customer interactions...


  • Mississauga, Ontario, Canada Thermo Fisher Scientific Full time

    About the RoleWe are seeking a highly skilled Reliability Engineering Specialist to join our team at Thermo Fisher Scientific. This is a full-time position that offers a competitive salary and benefits package.Job DescriptionThe main focus of this position is to provide support for the Engineering department, with a primary emphasis on developing and...


  • Mississauga, Canada KUBRA Full time

    Are you an experienced Site Reliability Engineer with a passion for enhancing platform stability, reliability, and efficiency? We are growing at KUBRA, and we're looking for a skilled **Team Lead, Site Reliability **Engineer, where you will guide our DevOps team in optimizing our customer experience management platforms. This is hybrid opportunity in...


  • Mississauga, Ontario, Canada KUBRA Full time

    Job DescriptionWe are seeking an experienced Site Reliability Engineer to lead our DevOps team in optimizing our customer experience management platforms.You will be responsible for guiding the team in identifying potential issues, resolving complex problems, and leading technical and business discussions.The ideal candidate will have a passion for enhancing...


  • Mississauga, Ontario, Canada Thermo Fisher Scientific Inc. Full time

    Job SummaryWe are seeking a highly motivated Reliability Engineering Specialist to join our team at Thermo Fisher Scientific Inc.The successful candidate will work closely with the Engineering department to develop and implement processes and procedures necessary to establish a Reliability Centered Maintenance (RCM) culture in the site and improve overall...


  • Mississauga, Ontario, Canada RocMar Engineering Inc.] Full time

    Company OverviewRocMar Engineering Inc. is a leading engineering firm committed to delivering high-quality services.SalaryThe estimated annual salary for this position is $85,000.Job DescriptionWe are seeking a skilled Civil Draftsperson to join our team. As a Civil Draftsperson, you will be responsible for creating detailed drawings and designs for various...


  • Mississauga, Ontario, Canada KUBRA Data Transfer Ltd Full time

    We are seeking a highly skilled Reliability Engineering Manager to join our team at KUBRA Data Transfer Ltd. in Mississauga, ON.This is a hybrid opportunity that offers a unique blend of technical and leadership challenges. As a Reliability Engineering Manager, you will play a critical role in ensuring the high availability and security of our customer...


  • Mississauga, Ontario, Canada KUBRA Full time

    About the RoleWe are seeking a seasoned Technical Infrastructure Manager to lead our DevOps team in optimizing customer experience management platforms. As a Site Reliability Engineer, you will drive continuous improvement and ensure high standards of availability and security.Key Responsibilities:Platform Stability and Efficiency: Guide our infrastructure...