Site Reliability Engineer II
4 weeks ago
Overview The Company You’ll Join Carta develops purpose-built software that transforms traditional accounting into a powerful growth engine. Carta’s world-class fund administration platform supports nearly 7,000 funds and SPVs, and represents nearly $130B in assets under management in venture capital and private equity. Trusted by more than 40,000 companies, Carta also helps private businesses in over 160 countries manage their cap tables, valuations, taxes, equity programs, compensation, and more. Together, Carta is setting a new standard as the end-to-end platform for private markets. Our best-in-class solution for fund management seamlessly integrates investor and portfolio company insights via a suite of tools designed ground-up to support the strategic impact of the fund CFO. For more information about our offices and culture, check out our Carta careers page. The Problems You’ll Solve At Carta, our employees set out on a mission to unlock the power of equity ownership for more people in more places. We believe that the problems we solve today unlock the opportunities of tomorrow. As a Site Reliability Engineer II, you’ll work to : Build and scale our internal platform offerings (compute, storage and networking services) to ensure the reliability, and performance of our applications. Design and implement monitoring, alerting, and incident response systems. Collaborate with application software engineers (as needed) to guide their design and ensure it scales for what Carta needs in the long run. Act as an agent of change and push boundaries to incrementally improve our systems as we expand globally. The Team You’ll Work With You’ll be joining the Infrastructure Engineering team at Carta. The Infrastructure Engineering team is responsible for providing secure, reliable, scalable and performant Infrastructure to Carta’s customers and developers. We are Software and Infrastructure Engineers who specialize in cloud computing, networking, systems design and architecture, storage, real time data telemetry, associated automation, tooling and processes. We possess a breadth and depth of knowledge about Carta’s infrastructure and industry wide best practices, that translates into leverage for Carta’s business. About You You are excited by the idea of developing scalable, reliable and efficient infrastructure that powers the entire company. We’re looking for strong communicators who enjoy collaborating to solve complex problems. Familiarity with infrastructure best practices on performance, reliability and security and their associated tools is appreciated. Our stack is Python, Java, Terraform, gRPC, Docker, Kubernetes, Postgres, running on AWS. Come join us Cloud Platforms: Extensive experience with cloud services such as AWS, Google Cloud Platform, or Azure, including services like EC2, S3, RDS, and Lambda. Experience with Kubernetes or other container orchestration is preferred Infrastructure as Code (IaC): Proficient in using tools such as Terraform, Ansible, or CloudFormation for managing and provisioning cloud infrastructure. Networking: Experience with networking concepts and tools, including Container Network Interface (CNI), Network policy implementations. Experience with proxies and service mesh is a big plus. Monitoring and Observability: Strong knowledge of monitoring tools and practices, such as Prometheus, Grafana, ELK Stack, or Datadog, and the ability to set up and maintain comprehensive monitoring solutions. Software Development: Proficiency in Python, with the ability to write efficient, maintainable, and scalable code. Database Management: Strong knowledge of PostgreSQL, including performance tuning, replication, backup, and recovery. API Services: Experience in designing, deploying, and maintaining API services, with a strong understanding of RESTful and / or GraphQL API design principles. Knowledge of service mesh technologies such as Istio, Cilium, or Linkerd is appreciated though not essential Experience operating CI / CD and its associated best practices is also appreciated though not essential We are an equal opportunity employer and are committed to providing a positive interview experience for every candidate. If accommodations due to a disability or medical condition are needed, please connect with the talent partner via email. Interested in data privacy? Check out our policies on Privacy and CA Candidate Privacy. Please note that all official communications from us will come from an @carta.com domain. #J-18808-Ljbffr
-
Site Reliability Engineer
3 days ago
Alton, Canada Apptoza Inc. Full timeHI, Hope you are doing Great, If you are fine with below JD please share me your Updated resume ASAP. Site Reliability Engineer Location: TORONTO (ONSITE) Duration: 6 months Exp Required: 10 Years Job Description: Job Title : SRE Technical/Functional Skills • 8+ years of overall IT experience. • Advanced Linux / Unix support experience required. •...
-
Senior Site Reliability Engineer
3 weeks ago
Alton, Canada Canonical Full timeJoin to apply for the Senior Site Reliability Engineer role at CanonicalCanonical is a leading provider of open source software and operating systems to the global enterprise and technology markets. Our platform, Ubuntu, is widely used in breakthrough enterprise initiatives such as public cloud, data science, AI, engineering innovation and IoT. Our customers...
-
Lead Site Reliability Administrator
3 weeks ago
Alton, Canada OpenText Full timeJoin OpenText as a Lead Site Reliability AdministratorOpenText is a global leader in information management, where innovation, creativity, and collaboration are the key components of our corporate culture. As a member of our team, you will have the opportunity to partner with the most highly regarded companies in the world, tackle complex issues, and...
-
Alton, Canada Manulife Full timeJoin our Group Benefits Engineering Team! We are passionate about building software that solves problems. We count on our team to empower our users with a rich feature set, high availability, and stellar performance level to pursue their missions. The successful applicant will be involved with application support, application deployment, business requirement...
-
Platform Reliability Engineer
2 weeks ago
Alton, Canada Manulife Insurance Malaysia Full time***Nous utilisons des* *pour fournir des statistiques qui nous aident à vous offrir la meilleure expérience sur note site. Vous y trouverez des renseignements sur les témoins, ou vous pouvez les désactiver si vous préférez. Toutefois, en continuant d’utiliser le site sans modifier les paramètres, vous consentez à notre utilisation de***Platform...
-
Lead Site Reliability Administrator
4 weeks ago
Alton, Canada OpenText Full timeSite Reliability AdministratorOpentext - The Information CompanyOpenText is a global leader in information management, where innovation, creativity and collaboration form the core of our culture. As part of our team you will partner with world‑renowned companies, tackle complex issues and help shape the future of digital transformation.YOUR IMPACTThe role...
-
Alton, Canada Menlo Ventures Full timeA leading technology firm in Southwestern Ontario seeks a Senior QA Automation Engineer II to enhance quality across engineering teams. You'll define and implement automation frameworks, partner with developers, and mentor junior team members. The ideal candidate has extensive automation experience, strong programming skills, and a passion for improving...
-
Backend Engineer II: Scale AI Compute
3 weeks ago
Alton, Canada Zepl Full timeA leading AI solutions firm is seeking a Backend Engineer II for their AI Compute team in Northwestern Ontario. In this role, you'll develop and support compute solutions, ensuring performance and security for AI workloads. Ideal candidates will have strong Kubernetes expertise and a solid background in Python development, along with experience in CI/CD...
-
Senior Observability
2 weeks ago
Alton, Canada Manulife Full timeA leading financial services provider is seeking a Senior Infrastructure Reliability Engineer to build and maintain scalable systems and ensure optimal monitoring of infrastructure. The role requires over 8 years of experience in infrastructure reliability engineering and automation skills using Terraform and Ansible. This position offers a competitive...
-
Senior Observability
3 weeks ago
Alton, Canada Manulife Financial Full timeA leading financial services provider based in Southwestern Ontario is seeking an experienced Infrastructure Reliability Engineer. The role involves building and maintaining scalable systems, automating processes, and ensuring infrastructure performance. Ideal candidates should possess over 8 years of experience in infrastructure reliability engineering,...