Staff Site Reliability Engineer, Database
4 weeks ago
Staff Site Reliability Engineer, Database Who We Are: Alpaca is a US-headquartered self-clearing broker-dealer and brokerage infrastructure for stocks, ETFs, options, crypto, fixed income, 24/5 trading, and more. Our recent Series C funding round brought our total investment to over $170 million, fueling our ambitious vision. Amongst our subsidiaries, Alpaca is a licensed financial services company, serving hundreds of financial institutions across 40 countries with our institutional-grade APIs. This includes broker-dealers, investment advisors, wealth managers, hedge funds, and crypto exchanges, totalling over 6 million brokerage accounts. Our global team is a diverse group of experienced engineers, traders, and brokerage professionals who are working to achieve our mission of opening financial services to everyone on the planet . We're deeply committed to open-source contributions and fostering a vibrant community, continuously enhancing our award-winning, developer-friendly API and the robust infrastructure behind it. Alpaca is proudly backed by top-tier global investors, including Portage Ventures, Spark Capital, Tribe Capital, Social Leverage, Horizons Ventures, Unbound, SBI Group, Derayah Financial, Elefund, and Y Combinator. Our Team Members: We're a dynamic team of 230+ globally distributed members who thrive working from our favorite places around the world, with teammates spanning the USA, Canada, Japan, Hungary, Nigeria, Brazil, the UK, and beyond We're searching for passionate individuals eager to contribute to Alpaca's rapid growth. If you align with our core values—Stay Curious, Have Empathy, and Be Accountable—and are ready to make a significant impact, we encourage you to apply. Your Role: As a Site Reliability Engineer (SRE) at Alpaca, you will ensure the reliability, scalability, and performance of our systems and services. You will work closely with development, operations and devops teams to build and maintain robust applications, ensuring they run smoothly and efficiently. This role requires a blend of software engineering and operations skills, with a strong ability to troubleshoot technical issues and resolve problems before they impact our users. Things You Get To Do: Triage difficult technical problems and implement solutions Improve our observability stack (monitoring, logging, profiling) Incident Management: Respond to and resolve incidents in a timely manner, conducting post-incident reviews to identify and implement improvements. Collaboration: Work closely with development teams to ensure new features and services are designed with reliability and scalability in mind. Capacity Planning: Monitor system capacity and performance, making recommendations and implementing changes to handle future growth. Who you are (must-haves): 5+ years of experience in Site Reliability Engineering, Performance Engineering, or similar roles. 5+ years of experience with multi-terabyte scale PostgreSQL clusters. Proven track record of managing and maintaining large-scale, high-availability, and high-performance PostgreSQL database. Experience designing and implementing SLIs, SLOs, and SLAs for internal systems and databases. Experience with troubleshooting PostgreSQL performance problems and slow queries. Extensive experience with efficient schema design and efficient query design. Experience migrating multi-terabyte tables into more efficient schemas. Proficient with Go. Proficient with Prometheus. Proficient with Linux. Knowledgeable in trading/fintech domains. Experience with low-latency systems. Experience with distributed tracing. Experience scaling PostgreSQL clusters rapidly. Experience with pgx, gorm, or sqlc. How We Take Care of You: Competitive Salary & Stock Options New Hire Home-Office Setup: One-time USD $500 Monthly Stipend: USD $150 per month via a Brex Card Alpaca is proud to be an equal opportunity workplace dedicated to pursuing and hiring a diverse workforce. Interested in building your career at Alpaca? Get future opportunities sent straight to your email. #J-18808-Ljbffr
-
Staff Database Reliability Engineer
7 days ago
Canada Achievers Full timeAbout Achievers Achievers offers more than just a thank you program. Our employee recognition and rewards software inspires employees to recognize everyone, every day, everywhere. With 4.3 million global users, we empower employees across 190 countries. Visit us at to learn more and check out our platform in action. Join our team of A-players who bring...
-
Staff Database Reliability Engineer
3 days ago
Canada Achievers Full timeAbout Achievers Achievers offers more than just a thank you program. Our employee recognition and rewards software inspires employees to recognize everyone, every day, everywhere. With 4.3 million global users, we empower employees across 190 countries. Visit us at to learn more and check out our platform in action. Join our team of A-players who bring...
-
Staff Database Reliability Engineer
4 weeks ago
Canada Creek Medium Full timeAbout Achievers Achievers offers more than just a thank you program. Our employee recognition and rewards software inspires employees to recognize everyone, every day, everywhere. With 4.3 million global users, we empower employees across 190 countries. Visit us at achievers.com to learn more and check out our platform in action. Join our team of A-players...
-
Senior Database Site Reliability Engineer
4 weeks ago
, , Canada Ll Oefentherapie Full timeOracle’s Health Application and Infrastructure Database Services Team is a client-facing organization supporting more than 400 customer databases across multiple geographies. SLA adherence is mission-critical, as any lapse directly impacts customers who rely on OHAI to run their healthcare businesses. In alignment with our mission, we are focused on...
-
Staff Infrastructure Site Reliability Engineer
4 weeks ago
, , Canada Remoteworldwide Full timeStaff Infrastructure Site Reliability Engineer Staff Infrastructure Site Reliability Engineer Posted: 04/05/2025 Anywhere in the world Remote Senior About the Team: Netlify’s SRE team is scaling to meet the demands of our rapidly growing platform and user base. Our SRE team is responsible for ensuring the reliability, scalability, and efficiency of...
-
Site Reliability Engineer
2 days ago
(s): Canada : Ontario : Toronto Scotiabank Global Site Full timeRequisition ID: 244026Join a purpose driven winning team, committed to results, in an inclusive and high-performing culture.Overview: As a Site Reliability Engineer (SRE), you will join the Digital Engineering Operations team, responsible for ensuring the operations and reliability of Scotiabank digital applications. You will have the opportunity to drive...
-
Site Reliability Engineer
1 day ago
(s): Canada : Ontario : Toronto Scotiabank Global Site Full timeRequisition ID: 244027Join a purpose driven winning team, committed to results, in an inclusive and high-performing culture.Overview: As a Site Reliability Engineer (SRE), you will join the Digital Engineering Operations team, responsible for ensuring the operations and reliability of Scotiabank digital applications. You will have the opportunity to drive...
-
Staff Site Reliability Engineer
1 week ago
Remote - USA & Canada Boulevard Full timeWho is Boulevard? Boulevard provides the first and only client experience platform for appointment-based, self-care businesses. We empower our customers to give their clients more of the magical moments that matter most.Before launching in 2016, our founders spent months interviewing salon managers and working behind front desks to understand their pain...
-
Remote - United States, Remote - Canada Paxos Full timeAbout Paxos Today's financial infrastructure is archaic, expensive, inefficient and risky — supporting a system that leaves out more people than it lets in. So we're rebuilding it. We're on a mission to open the world's financial system to everyone by enabling the instant movement of any asset, any time, in a trustworthy way. For over a decade, we've...
-
Database Reliability Engineer
4 weeks ago
, , Canada PointClickCare Full timeAt PointClickCare our mission is simple: to help providers deliver exceptional care. And that starts with our people. As a leading health tech company that’s founder-led and privately held, we empower our employees to push boundaries, innovate, and shape the future of healthcare. With the largest long‑term and post‑acute care dataset and a Marketplace...