Senior Devops Engineer
2 weeks ago
TextLayer helps enterprises and funded startups deploy advanced AI systems without rewriting their infrastructure. We work with organizations across fintech, healthtech, and other sectors to bridge the gap between AI potential and practical implementation.
Our approach combines deep technical expertise with proven frameworks like TextLayer Core to accelerate development and ensure production-ready results. From bespoke AI workflows to agentic systems, we help clients adopt AI that actually works in their existing tech stacks.
We're on a mission to help address the implementation gap that over 85% of enterprise clients experience in adding AI to their operations and products. We're looking for sharp, curious people who want to meaningfully shape how we build, operate, and deliver.
If you're excited to work on foundational AI infrastructure, solve complex problems for diverse clients, and help define what agentic software looks like in practice, we'd love to meet you.
The Role
The Senior DevOps Engineer will architect production-grade monitoring, logging, and tracing systems specifically designed for AI workloads, implement OpenTelemetry-based data collection pipelines, build robust deployment workflows using IaC, and create resilient observability solutions that provide deep insights into LLM applications and conversational AI systems. Observability into AI systems plays a critical role in helping us build and maintain the infrastructure that powers our client engagements and plays a key role in an upcoming product launch.
Key Responsibilities
- Design and maintain OpenTelemetry-based observability infrastructure for distributed AI systems and LLM applications
- Build and scale ELK stack deployments (Elasticsearch, Logstash, Kibana) for log aggregation, search, and visualization of AI application data
- Implement comprehensive tracing and monitoring solutions for LLM inference, RAG pipelines, and AI Agent workflows
- Develop and maintain data ingestion pipelines for processing high-volume telemetry data from AI applications
- Configure and optimize OpenSearch clusters for real-time analytics and trace reconstruction of conversational flows
- Deploy and manage LLM observability platforms like Langfuse, OpenLLMetry, and custom monitoring solutions
- Implement Infrastructure as Code (Terraform, CloudFormation) for reproducible observability and application stack deployments
- Build automated alerting and incident response systems for AI application performance and reliability
- Collaborate with engineering teams to instrument AI applications with proper telemetry and observability hooks
- Optimize data retention policies, indexing strategies, and query performance for large-scale observability data
What You Will Bring
To succeed in this role, you'll need deep expertise in observability infrastructure, hands-on experience with OpenTelemetry and ELK stack, and a strong understanding of AI/ML system monitoring challenges. You should be passionate about building scalable, reliable infrastructure that provides actionable insights into complex AI workloads.
Required Qualifications
- 4+ years of DevOps/Infrastructure engineering experience with focus on observability and monitoring
- Expert-level experience with OpenTelemetry implementation, configuration, and custom instrumentation
- Production experience with ELK stack (Elasticsearch, Logstash, Kibana) including cluster management and optimization
- Strong knowledge of distributed tracing, metrics collection, and log aggregation architectures
- Experience with container orchestration (Kubernetes, Docker) and cloud infrastructure (AWS/GCP/Azure)
- Proficiency with Infrastructure as Code tools (Terraform, Ansible, CloudFormation)
- Experience building high-throughput data ingestion pipelines and real-time analytics systems
- Strong scripting skills (Python, Bash/Sh) for automation and tooling
- Knowledge of observability best practices, SLI/SLO definitions, and incident response
- Experience with monitoring tools like Prometheus, Grafana, or DataDog
Bonus Points
- Experience with LLMOps observability tools (Langfuse, LiteLLM, Weights & Biases, Phoenix, Braintrust)
- Experience with Golang (Go), Rust, or C/C++
- Knowledge of AI/ML system monitoring patterns and LLM application telemetry
- Experience with OpenSearch and ClickHouse for analytics workloads
- Familiarity with conversational AI analytics and trace reconstruction techniques
- Experience instrumenting LLM applications, RAG systems, or AI Agent workflows
- Background in time-series databases and vector search optimization
- Contributions to open-source observability or LLMOps projects
- Knowledge of eval-driven development and automated AI system testing frameworks
Compensation Range: CA$200K - CA$220K
-
Senior DevOps Engineer
3 days ago
Ottawa, Ontario, Canada Global Talent Alliance, Canada Full time $120,000 - $180,000 per yearJob #24-301 G-TAC Employer partner is looking for a Senior DevOps Engineer to promote automation efforts and manage infrastructure of their CloudGen Access product. You will be working with the product development team to create automation procedures that can make their product more robust and error-free while supporting a continuous delivery...
-
Senior DevOps Engineer
7 days ago
Ottawa, Ontario, Canada LogicsT Technologies Full time $120,000 - $180,000 per yearJob DescriptionPrimary ResponsibilitiesDesign, implement, and manage scalable, secure, and highly available cloud infrastructure using Azure services.Monitor, optimize, and troubleshoot cloud resources and performance.Work with Azure compute, storage, database, and networking services including Azure VMs, AKS, Azure SQL, Blob Storage, VNETs, VPNs, and...
-
Senior DevOps Engineer
6 days ago
Ottawa, Ontario, Canada Ciena Full time $102,000 - $164,200 per yearAs the global leader in high-speed connectivity, Ciena is committed to a people-first approach. Our teams enjoy a culture focused on prioritizing a flexible work environment that empowers individual growth, well-being, and belonging. We're a technology company that leads with our humanity—driving our business priorities alongside meaningful social,...
-
Senior DevOps Engineer
1 week ago
Ottawa, Ontario, Canada Ciena Full time $102,800 - $164,200As the global leader in high-speed connectivity, Ciena is committed to a people-first approach. Our teams enjoy a culture focused on prioritizing a flexible work environment that empowers individual growth, well-being, and belonging. We're a technology company that leads with our humanity—driving our business priorities alongside meaningful social,...
-
DevOps Engineer
3 days ago
Ottawa, Ontario, Canada Tulloch Full time $80,000 - $120,000 per yearCome Join Us"We want to build an organization where everyone loves their job and their leaders care for them."Over the last 30 years, TULLOCH has built a robust multi-disciplinary consulting engineering firm recognized Canada-wide for its strengths in the diverse service offerings and commitment to excellence. TULLOCH's innovative use of emerging...
-
Senior Devops Engineer
2 weeks ago
Ottawa, Ontario, Canada TextLayer Full time $120,000 - $140,000 per yearAbout TextLayerTextLayer helps enterprises and funded startups deploy advanced AI systems without rewriting their infrastructure. We work with organizations across fintech, healthtech, and other sectors to bridge the gap between AI potential and practical implementation.Our approach combines deep technical expertise with proven frameworks like TextLayer Core...
-
DevOps Engineer
7 days ago
Ottawa, Ontario, Canada Fujitsu Full time $80,000 - $140,000 per yearFujitsu CanadaFujitsu Canada is seeking a full-time, permanent DevOps Engineer to join our innovative team supporting enterprise-scale Knowledge Management (KM) modernization projects. This role is ideal for a technical professional with hands-on experience in secure, scalable cloud solutions leveraging Azure, DevSecOps, and containerization.Top primary...
-
DevOps Engineer
1 week ago
Ottawa, Ontario, Canada Fujitsu Full time $80,000 - $120,000 per yearDescriptionFujitsu CanadaFujitsu Canada is seeking a full-time, permanent DevOps Engineer to join our innovative team supporting enterprise-scale Knowledge Management (KM) modernization projects. This role is ideal for a technical professional with hands-on experience in secure, scalable cloud solutions leveraging Azure, DevSecOps, and containerization.Top...
-
DevOps Engineer
7 days ago
Ottawa, Ontario, Canada Fujitsu Full time $80,000 - $120,000 per yearDescriptionFujitsu CanadaFujitsu Canada is seeking a full-time, permanent DevOps Engineer to join our innovative team supporting enterprise-scale Knowledge Management (KM) modernization projects. This role is ideal for a technical professional with hands-on experience in secure, scalable cloud solutions leveraging Azure, DevSecOps, and containerization.Top...
-
Senior DevOps Migration Engineer
1 week ago
Ottawa, Ontario, Canada Sopra Steria Full time $120,000 - $180,000 per yearCompany Description Sopra Steria is a European leader in consulting, digital services, and software development, supporting its clients in their digital transformation through innovative and collaborative solutions. With 50,000 employees in nearly 30 countries and a revenue of €5.1 billion in 2022, we are committed to achieving sustainable results and...