Software Engineer, Data Platform

1 week ago


Kitchener, Ontario, Canada Tbwa ChiatDay Inc Full time
Software Engineer, Developer Experience & MLOps

About Dialpad

Dialpad is the leading Ai-powered customer communications platform creating human-first, Ai-enhanced solutions that will drive the next wave of how businesses communicate with and serve their customers. Enterprise customers like Randstad, Remax, Mizuho, Cigna, T-Mobile, Johns Hopkins, Motorola, Warby Parker, Panera Bread, and Netflix use Dialpad and its Ai capabilities to deliver amazing customer experiences. Supported by notable investors such as Andreessen Horowitz, Google Ventures, and ICONIQ Capital, Dialpad is a dynamic force in Ai technology with a rapidly expanding presence. Visit dialpad.com to learn more.

About the team

Dialpad's Ai Engineering team works centrally alongside Data Science, Telephony, and Product Engineering teams to produce The Good Ai. In this role, you'll leverage and acquire a broad skill set ranging from Distributed Systems Engineering, DevOps, MLOps, and Data Engineering to deliver functionality essential to powering Dialpad's Ai products.

Your role

As a Software Engineer – AI Developer Experience & ML Platform, you will design, build, and optimize the infrastructure, tooling, and workflows that enable engineers and data scientists to efficiently develop, deploy, and scale AI-powered applications. Your role will be multifaceted, spanning both developer experience (DevEx)—streamlining development, testing, and deployment—and MLOps & ML platform engineering—ensuring scalable, reliable, and high-performance AI/ML workloads.

What you'll do

First Week

- Merge your first PR & learn the review process: Make a small contribution, go through the code review process, and get familiar with team coding standards.
- Learn CI/CD workflows & deployment process: Understand how changes move from development to production, including CircleCI, Terraform, and GitOps workflows.
- Test, deploy, and monitor a change in production: Push a minor update, observe logs, metrics, traces and alerts, and ensure smooth rollout.
- Meet the team & key stakeholders: Get to know your immediate team and cross-functional teams (ML engineers, data scientists, platform engineers).

First 3 Months

- Work directly on Dialpad's AI/ML pipelines, Vertex AI and Kubernetes-based dev environments to enhance platform performance.
- Optimize developer workflows, including CI/CD pipelines (CircleCI) and infrastructure (Terraform), to accelerate AI and ML deployments.
- Strengthen observability and debugging (Grafana, Loki, OpenTelemetry) for better insights and faster issue resolution.
- Collaborate with cross-functional teams to identify bottlenecks, ship quick wins, and demonstrate measurable improvements.

First 6 Months

- Streamlining ML deployments, data ingestion, and environment setup through internal CLI tools, templates, and dashboards to empower self-serve developer workflows.
- Automating ML model testing and deployment rollbacks to refine and automate CI/CD for AI/ML, including improving GitOps workflows with CircleCI and Terraform for increased reliability.
- Using Grafana, Loki, OpenTelemetry, and Vertex AI Model Monitoring to enhance AI/ML observability and monitoring by expanding logging, tracing, and real-time metrics.
- Optimizing Kubernetes and cloud workflows to improve GKE-based AI workloads, autoscaling policies, and resource efficiency to address growing ML and data pipeline demands.

First 12 Months

- Optimize AI compute cost and efficiency by implementing autoscaling, spot instance scheduling, and GPU/TPU resource optimization to balance performance and cost.
- Build a self-serve AI infrastructure by developing internal developer tooling, dashboards, or APIs that enable engineers and data scientists to easily deploy models and manage data pipelines.
- Enable AI-driven analytics at scale by ensuring real-time AI insights power customer-facing features with sub-second query latencies in Pinot, BigQuery, and Dataflow.
- Automate infrastructure provisioning by expanding GitOps-driven automation for deploying and managing AI workloads, Kubernetes clusters, and cloud resources.

Technologies you know

- Kubernetes & Cloud Infrastructure – Managing GKE, Terraform, Cloud Workstations, and IAM for scalable AI/ML workloads.
- CI/CD & GitOps – Automating deployments with CircleCI, Terraform, and Cloud Build to streamline AI/ML workflows.
- ML Pipeline & Data Processing – Working with Vertex AI Pipelines, MLFlow, Apache Beam, Spark, Dataflow, Pub/Sub, and BigQuery to enable real-time AI analytics.
- Observability & Monitoring – Implementing Grafana, Loki, OpenTelemetry, and Vertex AI Model Monitoring for debugging and tracking AI performance.
- Model Deployment & Serving – Understanding Kubeflow, TensorFlow Serving, Triton Inference Server, and strategies for scalable ML inference.

Skills you'll bring

- You have a Bachelor's Degree in Computer Science, Software Engineering, Mathematics, or a related field, or equivalent work experience.
- You have 3+ years of experience in DevOps, MLOps, Developer Experience, or related roles.
- You have strong fundamentals in software engineering, cloud infrastructure, and distributed systems.
- You thrive in a collaborative, distributed team and can work effectively across time zones.
- You have experience building and maintaining AI/ML infrastructure, CI/CD pipelines, or developer tooling.
- You enjoy automating and optimizing development workflows, from CI/CD pipelines to AI/ML deployments.
- You take a data-driven approach to system reliability, ensuring observability, monitoring, and performance tracking.
- You believe in choosing the right tool for the job, balancing scalability, efficiency, and maintainability.
- You are comfortable working across infrastructure, AI/ML pipelines, and developer tooling to support high-scale applications.
- You enjoy continuous learning and knowledge-sharing, improving both your skills and your team's capabilities.
- You are fluent in English and communicate complex technical concepts clearly.

Bonus Points For

- A track record of Open Source contributions in DevOps, MLOps, or AI tooling.
- Experience in the Python ecosystem and related ML/DevOps libraries.
- Hands-on expertise with cloud providers such as Google Cloud Platform (GCP) or AWS.
- Experience with GitOps workflows and tools like ArgoCD, Flux, or Terraform.
- Familiarity with AI/ML observability, model monitoring, and real-time inference optimization.

Benefits, time-off, and wellness

An apple a day keeps the doctor away—and it doesn't hurt that we offer flexible time off and great options for medical, dental, and vision plans for all employees. Along with that, employees also receive a monthly stipend to help cover your cell phone bill, home internet bill, and we reimburse for gym membership costs, a variety of wellness events, and more

Dialpad offers reimbursement for expenses related to professional development, up to an annual limit per calendar year.

Culture
We've been named a Top Workplace seven times, and a big part of this is because of our collaborative culture that elevates our teammates, celebrates wins, and brings together passion and talent.

Diversity, Equity, and Inclusion (DEI) at Dialpad

At Dialpad, we are passionate aboutDoing the Right Thing.This means we are committed to building a values-driven culture that celebrates identity, inclusion and belonging. As a global company, it's our responsibility to come together to create a culture where all Dialers canWork Beautifully,Delight Our Users,andInnovate Continuouslyto bring our world-class product to life.

Every Voice Matters at Dialpad. We build community through our Employee Resource Groups, company-wide celebrations, service days, and a robust internal learning & development program focused on the success of our Dialers.

Don't meet every single requirement? Studies have shown that women and marginalized groups are less likely to apply to jobs unless they meet every single qualification. At Dialpad we are dedicated to building an inclusive and authentic workplace, so if you're excited about this role but your past experience doesn't align perfectly with every qualification in the job description, we encourage you to apply anyways. You may be just the right candidate for this or other roles.

Dialpad is an equal-opportunity employer. We are dedicated to creating a community of inclusion and an environment free from discrimination or harassment.

#J-18808-Ljbffr

  • Kitchener, Ontario, Canada Faire Full time

    Faire is a cutting-edge online wholesale marketplace that is revolutionizing the way small businesses operate. Our platform connects independent retailers with suppliers from around the world, helping them discover new products and grow their customer base.As a software engineer on our machine learning team, you will play a critical role in developing and...


  • Kitchener, Ontario, Canada Alert Labs Inc. Full time

    About the RoleThis is a senior software development role within our platform team.You will be responsible for designing and developing backend software projects, collaborating with cross-functional teams, and mentoring junior developers.Our platform uses Node.js/TypeScript on the backend and relies primarily on MongoDB for data storage.We are looking for...


  • Kitchener, Ontario, Canada ApplyBoard Full time

    ApplyBoard simplifies the study abroad search, application, and acceptance process by connecting international students, recruitment partners, and educational institutions on one intuitive and personalized platform. ApplyBoard is a mission-driven, hyper-growth organization. It has been attracting dedicated individuals for more than eight years who are...

  • AI Platform Engineer

    2 weeks ago


    Kitchener, Ontario, Canada Faire Full time

    Faire is an online wholesale marketplace built on the belief that the future is local — independent retailers around the globe are doing more revenue than Walmart and Amazon combined, but individually, they are small compared to these massive entities. At Faire, we're using the power of tech, data, and machine learning to connect this thriving community of...

  • AI Platform Engineer

    3 weeks ago


    Kitchener, Ontario, Canada Faire Full time

    Faire is an online wholesale marketplace built on the belief that the future is local — independent retailers around the globe are doing more revenue than Walmart and Amazon combined, but individually, they are small compared to these massive entities. At Faire, we're using the power of tech, data, and machine learning to connect this thriving community of...


  • Kitchener, Ontario, Canada Faire Full time

    Our platform at Faire is built on a foundation of machine learning and data science, empowering small businesses to succeed in a rapidly changing retail landscape. As a key member of our engineering team, you will contribute to the development and maintenance of our machine learning systems.This includes designing and building highly scalable systems for...


  • Kitchener, Ontario, Canada Faire Full time

    Faire is an online wholesale marketplace that empowers small businesses to succeed in a rapidly changing retail landscape. Our platform connects independent retailers with suppliers from around the world, helping them discover new products and grow their customer base.As a key member of our engineering team, you will play a critical role in developing and...


  • Kitchener, Ontario, Canada Equator Studios Full time

    About the JobWe're seeking an experienced software developer to join our team as a Geospatial Data Platform Developer. As part of this role, you will be responsible for developing and implementing new software features, implementing front-end designs, building and optimizing Large Language Model (LLM) prompts, and writing clean, maintainable code.Develop and...


  • Kitchener, Ontario, Canada Tbwa ChiatDay Inc Full time

    Dialpad is the leading Ai-powered customer communications platform creating human-first, Ai-enhanced solutions that will drive the next wave of how businesses communicate with and serve their customers. Enterprise customers like Randstad, Remax, Mizuho, Cigna, T-Mobile, Johns Hopkins, Motorola, Warby Parker, Panera Bread, and Netflix use Dialpad and its Ai...


  • Kitchener, Ontario, Canada Untether AI Full time

    The early productization team works at the frontiers of AI technologies including areas such as large language model generative AI, autonomous vehicles, and next-generation silicon. We operate at the intersection between hardware and software. Our work helps shape both the hardware and software solutions that underlie Untether AI technology, and we are...


  • Kitchener, Ontario, Canada Tbwa ChiatDay Inc Full time

    About the TeamOur AI Engineering team works centrally alongside Data Science, Telephony, and Product Engineering teams to produce The Good Ai. We're looking for a talented DevOps Engineer to help us deliver functionality essential to powering Dialpad's AI products.Key ResponsibilitiesLeverage a broad skill set ranging from Distributed Systems Engineering,...


  • Kitchener, Ontario, Canada Faire Full time

    Company OverviewFaire is an e-commerce platform that empowers small businesses and entrepreneurs around the world to succeed. Our mission is to create a fair and transparent marketplace where independent retailers can grow their business without the need for massive resources.We use technology and data to level the playing field, connecting brands and...


  • Kitchener, Ontario, Canada Tbwa ChiatDay Inc Full time

    Dialpad is the leading Ai-powered customer communications platform creating human-first, Ai-enhanced solutions that will drive the next wave of how businesses communicate with and serve their customers. Enterprise customers like Randstad, Remax, Mizuho, Cigna, T-Mobile, Johns Hopkins, Motorola, Warby Parker, Panera Bread, and Netflix use Dialpad and its Ai...


  • Kitchener, Ontario, Canada Tbwa ChiatDay Inc Full time

    Job DescriptionWe're looking for a talented AI Developer Experience Engineer to join our team at Dialpad. As a key member of our Ai Engineering group, you'll play a critical role in designing and building the infrastructure, tooling, and workflows that enable engineers and data scientists to develop, deploy, and scale Ai-powered applications.About the...


  • Kitchener, Ontario, Canada Faire Full time

    About FaireFaire is a leading online wholesale marketplace that empowers independent retailers and brands to thrive in a rapidly changing retail landscape. Our mission is to connect this vibrant community of entrepreneurs across the globe, leveraging technology and data to drive growth and success.We're passionate about creating a fairer and more sustainable...


  • Kitchener, Ontario, Canada Miovision Technologies, Inc. Full time

    Job SummaryWe are seeking a skilled software developer to join our team and contribute to the development of transportation data and traffic management solutions.The successful candidate will have experience with C or C++ programming languages, embedded security systems, and distributed host applications that communicate with hardware devices.The role...


  • Kitchener, Ontario, Canada ApplyBoard Full time

    ApplyBoard simplifies the study abroad search, application, and acceptance process by connecting international students, recruitment partners, and educational institutions on one intuitive and personalized platform. ApplyBoard is a mission-driven, hyper-growth organization. It has been attracting dedicated individuals for more than eight years who are...


  • Kitchener, Ontario, Canada Radical Imaging LLC Full time

    Work with known imaging industry leaders...Grow open-source projects in medical imaging...Enable cutting-edge tools to advance healthcare outcomes...When you join Radical Imaging, you'll get to do all of the above and so much moreWho We AreStarted in 2010, Radical Imaging is a growing consulting company that specializes in delivering custom web and cloud...


  • Kitchener, Ontario, Canada Radical Imaging LLC Full time

    Work with known imaging industry leaders...Grow open-source projects in medical imaging...Enable cutting-edge tools to advance healthcare outcomes...When you join Radical Imaging, you'll get to do all of the above and so much moreWho We AreStarted in 2010, Radical Imaging is a growing consulting company that specializes in delivering custom web and cloud...

  • Data Engineer Leader

    7 hours ago


    Kitchener, Ontario, Canada Edjuster Full time

    Edjuster - Building Social-First Ecosystems for BrandsWe specialize in creating innovative, data-driven solutions to help brands connect with modern consumers. Our team is passionate about using technology to drive business growth and customer engagement.Job OverviewThe Senior Data Engineer will lead the design, development, and maintenance of efficient,...