AWS Site Reliability Engineer
2 months ago
Tecsys is a fast-growing innovator offering supply chain solutions to industry leading healthcare systems, hospitals, and pharmacy businesses to distributors, retailers, and 3PLs. We work with industry leaders to transform their supply chains through technology. p>About the Role
We are looking for a Site Reliability Engineer to work within our “Network and Security Operations Center” department. Our NOC team is aimed at improving the reliability and uptime of our platform and applications in a data-driven way to support internal and external customers' needs.
Your responsibilities
- Collaborate with other Engineering teams to support services before they go live through activities such as system design consulting, developing software platforms and frameworks, capacity planning and launch reviews.
- Maintain services once they are live by measuring and monitoring availability, latency and overall system health.
- Develop tools & automation on top of Azure & AWS to continuously reduce the need for manual intervention.
- Scale systems sustainably through automation and evolve systems by pushing for changes that improve reliability and velocity.
- Implement monitoring, logging, alerting, and SLA reporting.
- Implement service monitoring dashboards displaying key metrics.
- Create and maintain technical documentation.
- Provide support for our planning and deployment teams to enable stability, predictability, and scale in our continued growth.
- Collaborate with members of the Platform Engineering team to implement and support far-reaching strategic efforts, provide constructive feedback, and foster a collaborative environment.
- Work cross-functionally with internal teams and vendors to manage our growth around the globe, with a strong focus on maintaining the high level of performance, availability, and reliability for our users.
Requirements:
- Bachelor's degree in computer science or related technical discipline.
- At least 5 years’ experience in systems engineering experience; demonstrable technical experience in new platform development, orchestration, product ownership, and iterative design and deployment.
- Self-organize, collaborate, and manage efforts with peers and teams across responsibility areas, languages, geography, and time zones.
- Basic knowledge of Java- or .Net-based development required.
- Knowledge of GitLab (enterprise license) preferred (or at minimum, Jenkins required).
- Experience with SaaS company is a strong asset.
- Experience with Fedramp (The Federal Risk and Authorization Management Program) compliance is a strong asset.
- Strong English communication skills, both written and spoken, are essential for effective correspondence with customers, business partners and colleagues beyond the province of Quebec.
- Escalation on-call rotation.
-
AWS Site Reliability Engineer
2 months ago
Old Toronto, Canada Street Context Full timep>Are you a Site Reliability Engineer that has a passion for building reliable, resilient and performant systems that scale? p>We are on a mission to build and strengthen our engineering teams to match the accelerating success of Street Context. We provide a premium Email, Analytics and Broker Relationship platform, purpose-built for capital markets and...
-
AWS Site Reliability Engineer
2 months ago
Old Toronto, Canada Soda Full timeJob Description Job Title: Site Reliability Engineer Location: Poland - Fully Remote Salary: 324K PLN or 27.3K monthly Start: ASAP Stack: AWS, Docker, Kubernetes, Terraform, Jenkins, Ansible, Linux, JavaScript, and Lambda. Are you a seasoned DevOps/SRE professional passionate about building high-performance, scalable systems? I am working with a Media/IT...
-
AWS Site Reliability Engineer
1 month ago
Old Toronto, Canada Tecsys Inc. Full timep>Having recognized the advantages of remote work, including employee morale, productivity, reduced commuting on employee wellbeing and the environment, we are proud to be a digital-first company. The technologies and programs in which we invested have provided a fantastic foundation to this end. Our digital-first work environment, together with our...
-
AWS Site Reliability Engineer
1 month ago
Old Toronto, Canada Tecsys Full timeTecsys is a fast-growing innovator offering supply chain solutions to industry-leading healthcare systems, hospitals, and pharmacy businesses to distributors, retailers, and 3PLs. As a Cloud Infrastructure Specialist, you will be responsible for ensuring the reliability and uptime of our platform and applications in a data-driven way to support internal and...
-
AWS Engineer
1 month ago
Old Toronto, Canada Street Context Full timeWe're seeking a seasoned Site Reliability Engineer with a passion for designing and implementing robust, scalable systems on AWS.About Street Context: We provide a premium Email, Analytics, and Broker Relationship platform for capital markets and institutional investors.Scale our system to meet increasing global demand by collaborating with development...
-
AWS Site Reliability Engineer
3 months ago
Old Toronto, Canada Sentry Full timep>The Site Reliability Engineering team is responsible for the deployment, configuration, maintenance, and monitoring of Sentry's hosted platform. We do this by leveraging automation tools to automatically spin up and scale services to meet the traffic demands of 1,000,000+ developers. Sentry receives over a billion events a day and processes terabytes of...
-
AWS Site Reliability Engineer
2 months ago
Old Toronto, Canada Olx Full timep>Site Reliability EngineerRemote Poland, PolandOLX – Engineering / Full-time / Remote At OLX, we work together to build a more sustainable world through trade. We make it safe, smart, and convenient to buy and sell cars, find housing, get jobs, buy and sell household goods, and more. Our colleagues around the world help to serve millions of people around...
-
Site Reliability Engineer- Automation
3 months ago
Old Toronto, Canada Ascend Fundraising Solutions Full timeWe are currently seeking a full-time Site Reliability Engineer to join our IT team. In this role, you will collaborate closely with the client services team to diagnose, troubleshoot, and resolve issues related to system reliability.RESPONSIBILITIES:Take ownership of customer-reported issues and see problems through to resolution.Develop preventive measures...
-
AWS Site Reliability Engineer
3 months ago
Old Toronto, Canada Sentry Full timeBad software is everywhere, and we’re tired of it. Sentry is on a mission to help developers write better software faster, so we can get back to enjoying technology.With more than $217 million in funding and 100,000+ organizations that believe we’re on to something, we're building performance and error monitoring tools that help companies like Disney,...
-
AWS Site Reliability Engineer
3 months ago
Old Toronto, Canada Sentry Full timeBad software is everywhere, and we’re tired of it. Sentry is on a mission to help developers write better software faster, so we can get back to enjoying technology.With more than $217 million in funding and 100,000+ organizations that believe we’re on to something, we're building performance and error monitoring tools that help companies like Disney,...
-
AWS Site Reliability Engineer
2 months ago
Old Toronto, Canada Sentry Full timeSentry is on a mission to simplify software development and improve application performance. We need a skilled AWS Site Reliability Engineer to join our team and help us achieve our goals. This role involves ensuring the uptime and reliability of our hosted platform, architecting and automating services and systems to meet scaling demands, and collaborating...
-
AWS Site Reliability Engineer
4 weeks ago
Old Toronto, Canada Royal Bank of Canada> Full timeb>We are looking to expand our Digital team at RBC. p>We are currently building out our SRE team, with the goal being to provide expertise and tooling to manage the health, security, and availability of their applications in production. We work with other teams to provide guidance throughout the lifecycle of building, deploying, and operating the...
-
AWS SRE Engineer
1 month ago
Old Toronto, Canada Jobber Full timep>At Jobber, we don’t just build a product - we work on real problems that help people in small businesses to become successful. We release early and often while dedicating time to addressing technical debt. p>We help employees grow professionally; we have a ton of onboarding resources, tutorials, hackathons and buddies to support learnings and provide...
-
AWS Site Reliability Engineer
2 months ago
Old Toronto, Canada PharmaLex Full timeYour Job SRE at Pharmalex The SRE at Pharmalex is the software engineering approach to production operations. 50% of your time will be building software to automate the manual work you do during the other 50% of your time will be providing operational support to the products you cover. SRE operates critical products 24/7/365 operating within agreed SLOs....
-
AWS Cloud Infrastructure Engineer
1 month ago
Old Toronto, Canada Ascend Fundraising Solutions Full timeWe are seeking a highly skilled AWS Cloud Infrastructure Engineer to collaborate closely with our IT team. In this role, you will diagnose, troubleshoot, and resolve system reliability issues related to infrastructure management.Key Responsibilities:Take ownership of customer-reported issues, ensuring timely resolution and implementing preventive measures to...
-
Old Toronto, Canada Sentry Full timeSentry is on a mission to empower developers to write better software, faster. As the Engineering Manager of the Site Reliability team, you will lead a talented group of Site Reliability Engineers (SREs) in ensuring the resilience of Sentry's products as they scale.About the RoleThis role involves influencing the engineering culture by promoting operational...
-
Site Reliability Engineering Lead
4 weeks ago
Old Toronto, Canada TD Full timeJob OverviewWe are seeking a highly skilled Site Reliability Engineering Lead to join our team at TD. As a key member of our technology group, you will be responsible for ensuring the stability, scalability, and reliability of our platforms.About the RoleThe ideal candidate will have a minimum of 8 years of experience in site reliability engineering, with a...
-
Old Toronto, Canada Criteo Full timeAbout the Role:We are seeking a skilled and experienced Site Reliability Engineer to join our team at Criteo. As a key member of our Product Reliability Engineering (PRE) group, you will play a critical role in ensuring the reliability and scalability of our cloud infrastructure.Key Responsibilities:Collaborate with product engineering teams to design,...
-
AWS Cloud Infrastructure Specialist
3 weeks ago
Old Toronto, Canada Sentry Full timeThe Sentry platform is built to scale, handling over a billion events daily. We're seeking an experienced AWS Site Reliability Engineer to join our team and contribute to the evolution of our hosted platform.ResponsibilitiesWe collaborate with cross-functional teams to deploy and scale services, ensuring seamless user experiences.In this role, you'll be part...
-
Site Reliability Engineering Linux or Windows
3 months ago
Old Toronto, Canada Thomson Reuters Full timeh3>(Canada) Site Reliability Engineer (Contract)Contract (9 months 4 days)Published 3 days agoNew RelicData DogSite Reliability Engineer - in the Service Management OrganizationDo you have experience in IT Service Management, working with cloud providers, software development, and technology infrastructure?The Site Reliability Engineer will analyze chronic...