Senior AI Platform Engineer
2 days ago
Senior AI Platform Engineer About the Role Semantic Enterprise AI (SEAI) builds next‑generation Decision Engine workflows that integrate machine learning, agentic automation, and advanced reasoning tools into enterprise products that empower organizations to make better upside decisions faster. As a Senior AI Platform Engineer, you will architect and build the foundational platform infrastructure that powers SEAI’s Decision Engine at enterprise scale. You’ll drive technical architecture decisions, establish platform patterns and best practices, and build critical systems that enable reliable multi‑agent orchestration for Fortune 1000 clients. This is a senior technical role where you shape the platform’s technical direction, solve complex distributed systems challenges, and build infrastructure that supports client‑critical AI workflows. What You’ll Do Architect and implement highly scalable multi‑agent orchestration platforms that handle thousands of concurrent agent executions with sub‑second latency requirements. Design advanced state management systems for distributed agent coordination, including checkpoint/recovery mechanisms, distributed locking strategies, and event sourcing architectures for full workflow reproducibility. Build sophisticated configuration management systems that enable declarative workflow definitions with versioning, A/B testing capabilities, canary deployments, and automatic rollback mechanisms. Architect zero‑trust security models for agent‑to‑agent communication, including mTLS implementation, service mesh integration, secret rotation systems, and fine‑grained RBAC for multi‑tenant isolation. Design and implement advanced observability platforms specifically for AI workflows, including distributed tracing across agent boundaries, custom metrics for LLM performance, cost attribution systems, and automated anomaly detection. Create sophisticated evaluation frameworks that combine multiple validation strategies (rule‑based, statistical, LLM‑as‑judge) with automatic performance regression detection and workflow reliability scoring. Build intelligent resource optimization systems including predictive scaling for agent workloads, intelligent request routing based on model capabilities, and cost‑aware execution planning for LLM inference. Design fault‑tolerant integration patterns for external services, including circuit breakers, intelligent retry mechanisms with exponential backoff, and graceful degradation strategies when downstream services fail. Architect data pipeline infrastructure for agent context management, including vector database optimization, semantic caching layers, and efficient state hydration for long‑running workflows. Required Qualifications Bachelor’s or Master’s degree in Computer Science, Distributed Systems, or related technical field (or equivalent practical experience). 8+ years of production engineering experience with 5‑7 years specifically focused on platform infrastructure and distributed systems architecture. Expert‑level Python proficiency with deep understanding of async programming, concurrency patterns, and performance optimization at scale. Extensive production experience with modern agent frameworks (LangChain, LlamaIndex, AutoGen, CrewAI) and workflow orchestration systems (Temporal, Cadence, Airflow, Prefect) including custom extensions and performance tuning. Advanced cloud architecture expertise (AWS, GCP, or Azure) including serverless patterns, container orchestration (ECS, GKE, AKS), service mesh implementations, and multi‑region deployment strategies. Deep Infrastructure‑as‑Code expertise with production experience managing complex multi‑environment deployments using Terraform or other tools. Proven track record designing and building enterprise multi‑tenant platforms with production experience in data isolation patterns, tenant resource quotas, cross‑tenant security boundaries, and compliance framework implementation. Expert‑level distributed systems knowledge including consensus algorithms, distributed transactions, event‑driven architectures, and sophisticated service failure management. Production experience with advanced observability stacks including distributed tracing (OpenTelemetry), time‑series databases (Prometheus, InfluxDB), log aggregation at scale, and custom instrumentation for AI/ML workloads. Strong background in platform reliability engineering including SLI/SLO definition, load testing frameworks, and incident response automation. Preferred Qualifications Experience building production LLM infrastructure including prompt caching systems, semantic routing, model gateway design, and inference optimization strategies (batching, quantization, distillation). Deep knowledge of distributed state machines, workflow DAG optimization, dynamic task scheduling, and building domain‑specific languages (DSLs) for workflow definition. Production experience with vector databases (Pinecone, Weaviate, Qdrant) including index optimization, hybrid search strategies, and scaling to billions of embeddings. Background in AI safety and governance including prompt injection detection, output validation frameworks, PII redaction systems, and audit trail implementation for regulatory compliance. Experience with advanced testing strategies for AI systems including property‑based testing, metamorphic testing, adversarial testing, and building synthetic test data generation pipelines. Track record of technical leadership including driving architecture reviews, creating technical RFCs, establishing engineering standards, and mentoring teams on complex technical topics. Contributions to open‑source projects in the agent/LLM/workflow orchestration space, published technical articles, or conference speaking experience. Relevant advanced certifications (AWS Solutions Architect Professional, Google Cloud Professional Architect, CKS, or similar). We value diverse perspectives and encourage all qualified candidates to apply, even if you don’t match every qualification perfectly. *We are currently seeking candidates who are legally authorized to work in the United States or Canada. Preference will be given to applicants located in Washington, Oregon, or British Columbia. We are committed to providing equal employment opportunities and do not discriminate based on race, color, religion, sex, national origin, age, disability, or genetic information.* *Salary Range: Up to $137,000 USD (US) / $190,000 CAD (Canada), depending on experience and location.* #J-18808-Ljbffr
-
Senior AI Platform Engineer
3 weeks ago
, BC, Canada Semantic Enterprise AI Full timeEmployee Experience Advocate | Shaping Positive Work Cultures | Trusted HR Advisor About the Role Semantic Enterprise AI (SEAI) builds next-generation Decision Engine workflows that integrate machine learning, agentic automation, and advanced reasoning tools into enterprise products that empower organizations to make better upside decisions faster. As a...
-
, BC, Canada Semantic Enterprise AI Full timeA leading AI solutions provider in Canada seeks a Senior AI Platform Engineer to architect and implement a robust foundational platform. You'll drive technical architecture decisions and solve complex challenges while building critical systems. The role requires a minimum of 8 years in production engineering, expert-level Python proficiency, and extensive...
-
AI Platform Engineer
3 weeks ago
, BC, Canada Semantic Enterprise AI Full timeDirect message the job poster from Semantic Enterprise AI Employee Experience Advocate | Shaping Positive Work Cultures | Trusted HR Advisor About the Role Semantic Enterprise AI (SEAI) builds next-generation Decision Engine workflows that integrate machine learning, agentic automation, and advanced reasoning tools into enterprise products that empower...
-
, BC, Canada Semantic Enterprise AI Full timeA leading AI technology firm in British Columbia is seeking a Senior AI Platform Engineer to lead the technical architecture of their AI Decision Engine. The role involves building scalable systems, designing security models, and tackling complex distributed challenges. The ideal candidate has over 8 years of experience in platform engineering and expertise...
-
Staff Platform Engineer
4 days ago
, BC, Canada Inworld AI Full time2 days ago Be among the first 25 applicants Direct message the job poster from Inworld AI At Inworld, we believe the processes of building, scaling, and evolving applications are monsters that consume value before it can reach users. Our mission is to solve evolution and transform static software into AI systems that autonomously evolve to better serve their...
-
, BC, Canada Semantic Enterprise AI Full timeA technology solutions provider in British Columbia is seeking an experienced AI Platform Engineer to architect and build their platform infrastructure. The role involves managing agent frameworks, cloud architecture, and ensuring reliability at scale. Ideal candidates will have 5-7 years of experience in distributed systems and expertise in Python....
-
Senior Platform Engineer
3 weeks ago
, , Canada ExaCare AI Full timeWe are a trailblazing health tech company on a mission to revolutionize the nursing home & post‑acute space. Our innovative AI software is transforming the admissions process and care delivery in these settings. We’ve raised $10.35M to date and are experiencing rapid growth. We are looking for a Senior Platform Engineer to join our growing team. About...
-
Senior NodeJS Engineer
3 weeks ago
, BC, Canada Inworld AI Full timeDirect message the job poster from Inworld AI At Inworld, we believe the processes of building, scaling, and evolving applications are monsters that consume value before it can reach users. Our mission is to solve evolution and transform static software into AI systems that autonomously evolve to better serve their users. We are building an intelligent...
-
Senior Software Engineer
2 weeks ago
, BC, Canada Stellar AI Full timeSenior Software Engineer Join to apply for the Senior Software Engineer role at Stellar AI . Get AI-powered advice on this job and more exclusive features. This range is provided by Stellar AI. Your actual pay will be based on your skills and experience — talk with your recruiter to learn more. Base pay range $70.00/hr - $70.00/hr Location Fully remote:...
-
Senior Frontend Engineer
2 weeks ago
, BC, Canada Inworld AI Full timeSenior Frontend (Web) Software Engineer At Inworld, we believe the processes of building, scaling, and evolving applications are monsters that consume value before it can reach users. Our mission is to solve evolution and transform static software into AI systems that autonomously evolve to better serve their users. We are building an intelligent runtime to...