Senior Site Reliability Engineer

6 months ago


Toronto, Canada Criteo Full time

What You'll Do:

What’s a PRE Team?
The concept of Product Reliability Engineering (PRE) was born from an industry leading online SRE book (go ahead, “Google” it). At Criteo, we are the bridge between Product and Platform Engineering. The PRE group is composed of 7 teams of people with a wide variety of backgrounds, experiences and perspectives.

How You’ll Make an Impact
As a Site Reliability Engineer, you’ll work closely with product engineering to improve the reliability of our apps, systems and pipelines and assess where optimization is needed most. You’ll tell stories with meaningful monitoring and hopefully never be paged on your on-call rotation because we’ve worked hard with dev teams to make our platform the most reliable in AdTech. Speaking of on-call, rotations are shared with your local and global team and your time is compensated in addition to your salary You’ll continuously learn skills directly from the other team members along the way and have opportunities to teach us too. It’s perfect for an engineer who wants to be involved in system design, infrastructure capacity and performance, troubleshooting and optimizing code, preventing incidents, and loves scaling tech with operational excellence.

Engage in and improve the whole lifecycle of services—from inception and design, through to deployment, operation and optimization. Improve services before they go live through activities such as system design consulting, developing software platforms and frameworks, capacity planning and launch reviews. Maintain services once they are live by measuring and monitoring availability, latency and overall system health. Scale systems through automation, tooling, and leverage continuous deployment pipelines to ensure changes to production are reliably smooth. Practice sustainable incident response and blameless postmortems. Communicate often within your team and with internal stakeholders. Stack: .NET Core, C#, K8s, Mesos, Java/Scala, Python and more.

Who You Are:

Bachelor’s degree in Computer Science, a related field, or equivalent practical experience. Extensive experience with software development in one or more programming languages, data structures or algorithms. Proficient in designing, analyzing, and troubleshooting large-scale distributed systems and codebases. Experience working in computing, distributed systems, storage, or networking. Ability to debug, optimize code, and to automate routine tasks. Systematic problem-solving approach, coupled with effective verbal and written communication skills.

  • Toronto, Canada Thomson Reuters Full time

    Are you passionate about the chance to bring your technical experience to a digital organization? The Onesource team is looking to add a Senior Site Reliability Engineer to a well-established global digital team. This position requires someone who is a passionate learner, an independent thinker, wor


  • Toronto, Canada Northbridge Financial Corporation Full time

    What is it like to be a Senior Site Reliability Engineer at Northbridge Financial The Senior Site Reliability Engineer oversees the creation and implementation of Service Level Objectives (SLOs). The Senior SRE handles service reliability solutions and processes of increasing complexity, and are responsible for mentoring and leading less experienced...


  • Greater Toronto Area, Canada GlossGenius Full time

    About GlossGenius GlossGenius is building an ecosystem enabling entrepreneurs to succeed. We empower small business owners to focus on being creators, not admins, by offering a range of business management tools including booking and scheduling, marketing, analytics, payment processing and much more.  Over 75,000 small business owners have chosen to...


  • Toronto, Canada CB Canada Full time

    Site Reliability Engineer On behalf of our client in the Banking Sector, PROCOM is looking for a Site Reliability Engineer. Site Reliability Engineer – Job Description Azure cloud Jira and confluence CICD Experience with automating (provisioning, configuration management, deployment) and integrating Azure PaaS solutions (Azure App services, Azure...


  • Toronto, Canada Thomson Reuters Full time

    Description Thomson Reuters is seeking a Senior Site Reliability Engineer to join our Service Management, Technology team. This role calls for an individual who is capable of analyzing customer problems of high complexity and assessing the scope of impact, while mitigating customer impact of issues and executing work arounds. Willingness to learn is...


  • Toronto, Canada Vantage Full time

    Senior Site Reliability Engineer / DevOps Engineer Are you passionate about ensuring the seamless operation of large-scale, distributed, and robust systems? Do you thrive on optimizing performance, increasing reliability, and automating tasks to create more efficient processes? Are you hungry for learning? If so, we would want to chat to you! As a...


  • Old Toronto, Canada Lorien Full time

    Hybrid - Manchester We are currently working with a leading gambling company dedicated to providing exceptional gaming experiences. They are looking for an experienced Site Reliability Engineer with a strong skill set in system reliability to join its world-class technology team. This role is ideal for someone who has 4+ years of experience within the...


  • Old Toronto, Canada Soda Full time

    Job Description Job Title: Site Reliability Engineer Location: Poland - Fully Remote Salary: 324K PLN or 27.3K monthly Start: ASAP Stack: AWS, Docker, Kubernetes, Terraform, Jenkins, Ansible, Linux, JavaScript, and Lambda. Are you a seasoned DevOps/SRE professional passionate about building high-performance, scalable systems? I am working with a Media/IT...


  • Old Toronto, Canada TD Bank Full time

    Site Reliability Engineer Site Reliability Engineer Work Location: Canada Hours: 37.5 Line of Business: Technology Solutions Pay Details: We’re committed to providing fair and equitable compensation to all our colleagues. As a candidate, we encourage you to have an open dialogue with a member of


  • Toronto, Canada KPMG Canada Full time

    OverviewAt KPMG, you'll join a team of diverse and dedicated problem solvers, connected by a common cause: turning insight into opportunity for clients and communities around the world.The OPS Site Reliability Engineer will be a focal role owning and ensuring the fluent operations of Managed Service


  • Toronto, Ontario, Canada Thomson Reuters Full time

    About the RoleThomson Reuters is seeking a skilled Senior Site Reliability Engineer to join our Service Management, Technology team. This role requires an individual who can analyze complex customer problems, assess the scope of impact, and mitigate customer impact of issues while executing workarounds. A willingness to learn is essential for this...


  • Old Toronto, Canada Lorien Full time

    p>Hybrid - ManchesterWe are currently working with a leading gambling company dedicated to providing exceptional gaming experiences. They are looking for an experienced Site Reliability Engineer with a strong skill set in system reliability to join its world-class technology team. This role is ideal for someone who has 4+ years of experience within the...


  • Toronto, Canada Thomson Reuters Full time

    Do you have a passion for DevOps culture and Site reliability engineering? That is, building and operating scalable, reliable, and secured services that underpin all Thomson Reuters’ products. Then we want you on our team! As we expand our Service Reliability team in Toronto, we are currently seeking an experienced Senior SRE to join our Shared...


  • Toronto, Ontario, Canada Index Exchange Full time

    About Index ExchangeWe are shaping the future of ad tech and seeking an experienced Senior Site Reliability Engineering Manager to lead our SRE team.As a key member of our technical leadership, you will be responsible for building and managing a high-performing SRE team, fostering a culture of innovation, collaboration, and accountability. You will provide...


  • Old Toronto, Canada Sentry Full time

    About the role The Site Reliability Engineering team is responsible for the deployment, configuration, maintenance, and monitoring of Sentry's hosted platform. We do this by leveraging automation tools to automatically spin up and scale services to meet the traffic demands of 1,000,000+ developers.


  • Toronto, Canada Broadridge Full time

    At Broadridge, we've built a culture where the highest goal is to empower others to accomplish more. If you're passionate about developing your career, while helping others along the way, come join the Broadridge team.Broadridge is growing! We are seeking a Site Reliability Engineer Lead to join our


  • Old Toronto, Canada Street Context Full time

    Are you a Site Reliability Engineer that has a passion for building reliable, resilient and performant systems that scale ? Do you command with a steady hand when incidents unfold? Are you motivated by team success ? If so, continue reading… We are on a mission to build and strengthen our engineering teams to match the accelerating success of Street...


  • Old Toronto, Canada Street Context Full time

    p>Are you a Site Reliability Engineer that has a passion for building reliable, resilient and performant systems that scale? p>We are on a mission to build and strengthen our engineering teams to match the accelerating success of Street Context. We provide a premium Email, Analytics and Broker Relationship platform, purpose-built for capital markets and...


  • Toronto, Canada SGS Full time

    Job Description The Site Reliability Engineer will play a critical part in ensuring the reliability, supportability, scalability, and performance of our .NET stack applications built with MVC, Angular, and Web API. Partner with developers and product operations teams to understand application requirements and translate them into operational practices....


  • Toronto, Ontario, Canada Thomson Reuters Full time

    About the RoleIn this opportunity as a Senior Site Reliability Engineer, you will:Identify options for problem resolution and initiate action.Engage others as appropriate and escalate as required.Liaise with various application development and content teams, customer service teams, and other software and hardware support teams.Proactively monitor production...