Senior Software Engineer, Site Reliability
2 days ago
Senior Software Engineer, Site Reliability Join to apply for the Senior Software Engineer, Site Reliability role at Babylist . Babylist is the leading registry, e-commerce, and content platform for growing families. More than 9 million people shop with Babylist every year, making it the go-to destination for seamless purchasing, trusted guidance, and expert product recommendations for new parents and the people who love them. What began as a universal registry has grown into a full ecosystem for new parents, including the Babylist Shop, Babylist Health, and a flagship showroom in Los Angeles. Hundreds of brands in baby and beyond partner with Babylist to engage meaningfully with families during one of life’s most important transitions. With over $1 billion in annual GMV, and more than $500 million in 2024 revenue, Babylist is reshaping the $320 billion baby product industry. We’re helping parents feel confident, connected, and cared for at every step. As we build the generational brand in baby, our mission remains simple: to connect growing families with everything they need to thrive. To learn more, visit Our Ways of Working Babylist thrives as a remote-first company, with HQ team members located across the U.S. and Canada. We meet in person twice a year—once as a company and once by department to strengthen the relationships that power our work. We show up consistently, stay purpose-driven, and achieve results —together, from anywhere. Ruby on Rails React AWS MySQL Redis Native iOS and Android What the Role Is Babylist is looking for a Senior Software Engineer, Site Reliability to join our Platform team. In this position, you will play a vital role in ensuring our systems and services’ stability, scalability, and reliability. You will work closely with all Babylist Engineering teams to support shared infrastructure and developer tools. Your expertise in site reliability engineering, AWS cloud infrastructure, and modern DevOps practices will be instrumental in optimizing our systems and driving continuous improvement. Who You Are 8+ years of experience as a Site Reliability Engineer or similar role, demonstrating a strong background in maintaining highly available and scalable systems Experience supporting high-traffic consumer-facing websites, understanding the unique challenges and considerations in maintaining such systems Proficiency with Terraform is a must, as you will be a member of the team responsible for managing and building our AWS infrastructure using Infrastructure as Code (IaC) practices You possess strong experience working with AWS cloud-based infrastructure and services, ensuring their reliability, performance, and security Proficiency with Docker and Kubernetes is essential, as you will contribute to the design, deployment, and management of containerized applications in our environment You have a solid understanding of cloud-native systems design, including CDNs, load balancers, cloud networking, DNS, caching, and distributed systems Troubleshooting and debugging are second nature to you, allowing you to quickly identify and resolve issues across various environments Experience designing and supporting CI systems such as CircleCI, Jenkins, or GitHub Actions You are familiar with monitoring and alerting best practices, utilizing tools like Datadog, Cronitor, Sentry, and PagerDuty to ensure proactive identification and resolution of issues Proven experience in on-call management best practices, including effective incident response, escalation procedures, and post-incident reviews to drive continuous improvement and ensure system reliability You have excellent verbal and written communication skills, and the ability to collaborate effectively with cross-functional teams You’re comfortable and enthusiastic about working in an AI-forward environment where AI tools are part of daily operations. You embrace using technology to enhance your work while keeping people at the center How You Will Make an Impact Manage and build our AWS infrastructure using Infrastructure as Code (IaC) tools like Terraform. You will ensure that our EKS clusters and databases are running up-to-date versions, optimizing performance and reliability Improve the speed and reliability of our Continuous Integration (CI) systems to support the entire Engineering Team, enabling faster and more efficient development and deployment processes Provide support to developers in troubleshooting issues across local development, staging, and production environments Establish, communicate, and support best practices for monitoring and alerting. This will involve setting up effective monitoring systems and defining actionable alerts for proactive incident management Why You Will Love Working At Babylist Our Culture We work with focus and intention, then step away to recharge We believe in exceptional management and invest in tools and opportunities to connect with colleagues We build products that positively impact millions of people's lives AI is intentionally embedded in how we work, create, and scale—supporting innovation and impact Growth & Development Competitive pay and meaningful opportunities for career advancement We believe technology and data can solve hard problems We’re committed to career progression and performance-based advancement Competitive salary with equity and bonus opportunities Company-paid medical, dental, and vision insurance Retirement savings plan with company matching and flexible spending accounts Generous paid parental leave and PTO Remote work stipend to set up your office Perks for physical, mental, and emotional health, parenting, childcare, and financial planning About Compensation We use a market-based approach to compensation. The starting salary range for this role is: US: $186,818 to $224,183 Canada: $185,600 to $232,000 CAD Your starting salary will be based on your location, experience, and qualifications, with increases over time tied to performance, role growth, and internal pay equity. Interview Process & Consent Babylist uses AI to record and transcribe all interviews for evaluation purposes in accordance with CCPA and GDPR. By participating in an interview, you consent to this recording and transcription. Interview Integrity During the interview process, we’re evaluating your individual problem-solving skills, creativity, and approach to challenges. While AI tools like ChatGPT, Claude, and Cursor are part of your daily toolkit once you join Babylist, all interviews, assessments, and take-home assignments must be completed independently. You may not use AI tools, third-party services, coaching platforms, or content-farming services during any part of the interview process unless we explicitly permit it. We will clearly communicate when AI tools are allowed for specific assessments. Any indication of third-party assistance or AI-generated responses will result in immediate disqualification. We may also verify educational credentials through third-party sources—providing false or misleading information will result in removal from consideration. All communication will come only from the Babylist Talent Team via an @babylist.com email address. We will never request payment, bank information, or personal financial details. Be cautious of fraudulent outreach via non-company email addresses, messaging platforms (e.g., WhatsApp, Telegram), or unsolicited phone calls. Verify legitimate opportunities on our careers page. SMS Consent You may opt in to receive text message updates about your application or interviews. Opting out will not affect your application status—communication will continue via email or phone. Message and data rates may apply. Reply STOP to unsubscribe or HELP for assistance. See our Privacy Policy for more information. Seniority level: Mid-Senior level Employment type: Part-time Industries: Consumer Services #J-18808-Ljbffr
-
Senior Site Reliability Engineer
2 weeks ago
, , Canada Akamai Technologies Full timeSenior Site Reliability Engineer Join Akamai Technologies as we build a reliable, secure, and scalable Internet. We are looking for a Senior Site Reliability Engineer to help us solve complex performance and reliability challenges. Job Description Are you passionate about cutting‑edge technology and ready to tackle some of the Internet’s most difficult...
-
Senior Site Reliability Engineer
2 weeks ago
, , Canada TekRek Full timeThis range is provided by TekRek. Your actual pay will be based on your skills and experience — talk with your recruiter to learn more. Base pay range CA$90.00/hr - CA$120.00/hr Senior Site Reliability Engineer – Distributed Systems, Kubernetes, AWS/GCP The Company TekRek has partnered with a fast‑scaling AI infrastructure company building one of the...
-
Senior Site Reliability Engineer
2 weeks ago
, , Canada D-Wave Full timeJoin to apply for the Senior Site Reliability Engineer role at D‑Wave . D‑Wave (NYSE: QBTS) is a leader in the development and delivery of quantum computing systems, software, and services. We are the world’s first commercial supplier of quantum computers, and the only company building both annealing and gate‑model quantum computers. Our mission is...
-
Site Reliability Engineer
1 week ago
(s): Canada : Ontario : Toronto Scotiabank Global Site Full timeRequisition ID: 244027Join a purpose driven winning team, committed to results, in an inclusive and high-performing culture.Overview: As a Site Reliability Engineer (SRE), you will join the Digital Engineering Operations team, responsible for ensuring the operations and reliability of Scotiabank digital applications. You will have the opportunity to drive...
-
Site Reliability Engineer
1 week ago
(s): Canada : Ontario : Toronto Scotiabank Global Site Full timeRequisition ID: 244026Join a purpose driven winning team, committed to results, in an inclusive and high-performing culture.Overview: As a Site Reliability Engineer (SRE), you will join the Digital Engineering Operations team, responsible for ensuring the operations and reliability of Scotiabank digital applications. You will have the opportunity to drive...
-
Site Reliability Engineer
3 weeks ago
, , Canada SPECTRAFORCE Full timeJob Title: DevOps/Site Reliability Engineer Duration: 12+ months Core hours of the position: somewhat flexible, but able to attend meetings and collaborate with team members between 8 am Pacific and 3 pm Pacific. Team members are located in Pacific, Mountain, Central, and East time zones Top 3 items to see on resumes 5+ years of experience in DevOps, Site...
-
Site Reliability Engineer
3 weeks ago
, , Canada Blue Signal Search Full timeDirect message the job poster from Blue Signal Search Sr. Executive Recruiter at Blue Signal Search Site Reliability Engineer Location: Remote, Canada Our client is a fast‑growing provider of AI‑driven edge‑computing platforms that keep industrial operations safe, smart, and always on. Their distributed hardware and software suite processes...
-
Senior Site Reliability Engineer
2 weeks ago
Eastern Canada|Ontario|Montreal|Quebec|Nova Scotia|Vancouver|Calgary|Winnipeg|British Columbia|Manitoba|Edmonton|Saskatoon|Ottawa|Saint John|White Rock|Kitchener|Halifax|Coquitlam|Burnaby|St. John's Targeted Talent Full timeWe are looking for an experienced Senior Site Reliability Engineer for our client. This is a permanent position that is remote to start with later relocation to Calgary or Winnipeg. Our client is a global enterprise company with a product that you've likely used. Experience with coding/software development, along with Site Reliability will be the key...
-
Senior Site Reliability Engineer
4 weeks ago
, BC, Canada Orion Innovation Full timeOverview Senior Site Reliability Engineer (SRE) with Kubernetes and Rancher. Full-time role focused on building and maintaining highly resilient, secure systems, including in air-gapped environments. Responsibilities System Architecture & Management: Design, architect, and maintain highly reliable, multi-tenant systems using Kubernetes and related tools...
-
Senior Site Reliability Engineer
5 days ago
, , Canada Medium Full timeWe believe communication belongs to everyone. We exist to democratize phone service. TextNow is evolving the way the world connects and that's because we're made up of people with curious minds who bring an optimistic, yet critical lens into the work we do. We're the largest provider of free phone service in the nation. And we're just getting started. Join...