Senior AI Platform Engineer

4 weeks ago

BC Canada Semantic Enterprise AI Full time

Senior AI Platform Engineer About the Role Semantic Enterprise AI (SEAI) builds next‑generation Decision Engine workflows that integrate machine learning, agentic automation, and advanced reasoning tools into enterprise products that empower organizations to make better upside decisions faster. As a Senior AI Platform Engineer, you will architect and build the foundational platform infrastructure that powers SEAI’s Decision Engine at enterprise scale. You’ll drive technical architecture decisions, establish platform patterns and best practices, and build critical systems that enable reliable multi‑agent orchestration for Fortune 1000 clients. This is a senior technical role where you shape the platform’s technical direction, solve complex distributed systems challenges, and build infrastructure that supports client‑critical AI workflows. What You’ll Do Architect and implement highly scalable multi‑agent orchestration platforms that handle thousands of concurrent agent executions with sub‑second latency requirements. Design advanced state management systems for distributed agent coordination, including checkpoint/recovery mechanisms, distributed locking strategies, and event sourcing architectures for full workflow reproducibility. Build sophisticated configuration management systems that enable declarative workflow definitions with versioning, A/B testing capabilities, canary deployments, and automatic rollback mechanisms. Architect zero‑trust security models for agent‑to‑agent communication, including mTLS implementation, service mesh integration, secret rotation systems, and fine‑grained RBAC for multi‑tenant isolation. Design and implement advanced observability platforms specifically for AI workflows, including distributed tracing across agent boundaries, custom metrics for LLM performance, cost attribution systems, and automated anomaly detection. Create sophisticated evaluation frameworks that combine multiple validation strategies (rule‑based, statistical, LLM‑as‑judge) with automatic performance regression detection and workflow reliability scoring. Build intelligent resource optimization systems including predictive scaling for agent workloads, intelligent request routing based on model capabilities, and cost‑aware execution planning for LLM inference. Design fault‑tolerant integration patterns for external services, including circuit breakers, intelligent retry mechanisms with exponential backoff, and graceful degradation strategies when downstream services fail. Architect data pipeline infrastructure for agent context management, including vector database optimization, semantic caching layers, and efficient state hydration for long‑running workflows. Required Qualifications Bachelor’s or Master’s degree in Computer Science, Distributed Systems, or related technical field (or equivalent practical experience). 8+ years of production engineering experience with 5‑7 years specifically focused on platform infrastructure and distributed systems architecture. Expert‑level Python proficiency with deep understanding of async programming, concurrency patterns, and performance optimization at scale. Extensive production experience with modern agent frameworks (LangChain, LlamaIndex, AutoGen, CrewAI) and workflow orchestration systems (Temporal, Cadence, Airflow, Prefect) including custom extensions and performance tuning. Advanced cloud architecture expertise (AWS, GCP, or Azure) including serverless patterns, container orchestration (ECS, GKE, AKS), service mesh implementations, and multi‑region deployment strategies. Deep Infrastructure‑as‑Code expertise with production experience managing complex multi‑environment deployments using Terraform or other tools. Proven track record designing and building enterprise multi‑tenant platforms with production experience in data isolation patterns, tenant resource quotas, cross‑tenant security boundaries, and compliance framework implementation. Expert‑level distributed systems knowledge including consensus algorithms, distributed transactions, event‑driven architectures, and sophisticated service failure management. Production experience with advanced observability stacks including distributed tracing (OpenTelemetry), time‑series databases (Prometheus, InfluxDB), log aggregation at scale, and custom instrumentation for AI/ML workloads. Strong background in platform reliability engineering including SLI/SLO definition, load testing frameworks, and incident response automation. Preferred Qualifications Experience building production LLM infrastructure including prompt caching systems, semantic routing, model gateway design, and inference optimization strategies (batching, quantization, distillation). Deep knowledge of distributed state machines, workflow DAG optimization, dynamic task scheduling, and building domain‑specific languages (DSLs) for workflow definition. Production experience with vector databases (Pinecone, Weaviate, Qdrant) including index optimization, hybrid search strategies, and scaling to billions of embeddings. Background in AI safety and governance including prompt injection detection, output validation frameworks, PII redaction systems, and audit trail implementation for regulatory compliance. Experience with advanced testing strategies for AI systems including property‑based testing, metamorphic testing, adversarial testing, and building synthetic test data generation pipelines. Track record of technical leadership including driving architecture reviews, creating technical RFCs, establishing engineering standards, and mentoring teams on complex technical topics. Contributions to open‑source projects in the agent/LLM/workflow orchestration space, published technical articles, or conference speaking experience. Relevant advanced certifications (AWS Solutions Architect Professional, Google Cloud Professional Architect, CKS, or similar). We value diverse perspectives and encourage all qualified candidates to apply, even if you don’t match every qualification perfectly. *We are currently seeking candidates who are legally authorized to work in the United States or Canada. Preference will be given to applicants located in Washington, Oregon, or British Columbia. We are committed to providing equal employment opportunities and do not discriminate based on race, color, religion, sex, national origin, age, disability, or genetic information.* *Salary Range: Up to $137,000 USD (US) / $190,000 CAD (Canada), depending on experience and location.* #J-18808-Ljbffr

Senior AI Platform Engineer: Scalable Orchestration

4 weeks ago

, BC, Canada Semantic Enterprise AI Full time

A leading AI solutions provider in Canada seeks a Senior AI Platform Engineer to architect and implement a robust foundational platform. You'll drive technical architecture decisions and solve complex challenges while building critical systems. The role requires a minimum of 8 years in production engineering, expert-level Python proficiency, and extensive...
Staff Platform Engineer

4 weeks ago

, BC, Canada Inworld AI Full time

2 days ago Be among the first 25 applicants Direct message the job poster from Inworld AI At Inworld, we believe the processes of building, scaling, and evolving applications are monsters that consume value before it can reach users. Our mission is to solve evolution and transform static software into AI systems that autonomously evolve to better serve their...
Platform Engineer

1 week ago

, BC, Canada Inworld AI Full time

A dynamic AI company is seeking a Platform Engineer to build and scale their AI engine. This role requires collaboration with backend and ML engineers to maintain cloud infrastructure, manage CI/CD pipelines, and enhance operational efficiency. Ideal candidates will have extensive experience in software engineering, Kubernetes, and infrastructure as code....
Senior NodeJS Engineer

2 weeks ago

, BC, Canada Inworld AI Full time

Direct message the job poster from Inworld AI At Inworld, we believe the processes of building, scaling, and evolving applications are monsters that consume value before it can reach users. Our mission is to solve evolution and transform static software into AI systems that autonomously evolve to better serve their users. We are building an intelligent...
Senior AI Platform Engineer

1 week ago

, , Canada NegotiateAI Inc. Full time

Join to apply for the Senior AI Platform Engineer role at NegotiateAI Inc. Get AI-powered advice on this job and more exclusive features. About NegotiateAI NegotiateAI is building an enterprise AI platform to modernize procurement for manufacturing and industrial companies. Procurement today is fragmented, manual, and opaque. We’re changing that by giving...
Senior Software Engineer

3 hours ago

, BC, Canada Stellar AI Full time

Senior Software Engineer Join to apply for the Senior Software Engineer role at Stellar AI . Get AI-powered advice on this job and more exclusive features. This range is provided by Stellar AI. Your actual pay will be based on your skills and experience — talk with your recruiter to learn more. Base pay range $70.00/hr - $70.00/hr Location Fully remote:...
Senior Frontend Engineer

4 hours ago

, BC, Canada Inworld AI Full time

Senior Frontend (Web) Software Engineer At Inworld, we believe the processes of building, scaling, and evolving applications are monsters that consume value before it can reach users. Our mission is to solve evolution and transform static software into AI systems that autonomously evolve to better serve their users. We are building an intelligent runtime to...
Lead Full-Stack Engineer for AI Platform

2 weeks ago

, , Canada Sequencr AI Full time

A forward-thinking tech company in Canada is seeking a Senior Full Stack Engineer to lead the development of their proprietary AI platform. Your responsibilities will include full-stack application development and cloud infrastructure management. The ideal candidate will have proven experience in both front-end and back-end technologies, including React and...
Staff Platform Engineer

4 hours ago

, BC, Canada Inworld AI Full time

About Inworld At Inworld, we believe that the benefits of AI should extend beyond business workflows to the applications and experiences that we enjoy every day. We began by pushing the frontier of lifelike, interactive characters for games and entertainment, pioneering realtime conversational AI at scale. Today, we apply that expertise to provide the...
Senior Software Development Engineer in Test

4 weeks ago

, BC, Canada Inworld AI Full time

Senior Software Development Engineer in Test (SDET) Senior Software Development Engineer in Test (SDET) This range is provided by Inworld AI. Your actual pay will be based on your skills and experience — talk with your recruiter to learn more. Base pay range $120,000.00/yr - $160,000.00/yr Direct message the job poster from Inworld AI At Inworld, we...

Americas

Europe

Asia / Oceania

Africa

Senior AI Platform Engineer