Platform Engineer
2 days ago
Job Title: Platform Engineer (Cloud Reliability Engineer)Reports to: Director, Global OperationsBased in: Ottawa, ONTerm:Full TimeAbout Nanometrics:With 40 years of seismic technology and industry application experience, we are a global, award-winning company providing monitoring solutions and equipment for studying artificial and natural seismicity. From mission-critical seismic arrays, tsunami and early earthquake warning systems in over 90 countries across the globe to induce seismicity monitoring in the energy sector. We specialize in full-service, integrated solutions for studying artificial and natural seismicity, including turnkey seismic networks, industry-leading precision instrumentation, complete data processing, analysis services, and software applications. At Nanometrics, we take pride in fostering a culture of innovation, collaboration, and excellence. We are passionate about making a global impact through cutting-edge technology while staying rooted in values of intentional innovation, trust, ethics, and stability.About the role: This is an exciting opportunity for a motivated and experienced Platform Engineer to evolve, enhance and lead the technological footprint of our Seismic Monitoring Services portfolio. Nanometrics provides a top tier portfolio of tools and services which is supported by a continuously evolving cloud based platform.The Platform Engineer / Cloud Reliability Engineer ensures the reliability, performance, and operational excellence of cloud-hosted seismic monitoring and data processing services. This role blends software engineering, cloud infrastructure management, and SRE practices to build resilient systems, reduce manual toil through automation, and improve observability across AWS and Kubernetes ecosystems.The successful candidate will use Terraform or similar Infrastructure-as-Code technologies (Pulumi, AWS CDK, CloudFormation, OpenTofu) to deliver consistent, automated, scalable infrastructure.Responsibilities:Cloud Reliability & ResilienceEnsure uptime, performance, and reliability of AWS-hosted services and Kubernetes workloadsImplement self-healing patterns, automated rollbacks, health checks, and safe-deployment strategiesParticipate in on-call rotation and lead first-response triage for cloud and platform incidentsBuild and maintain service-level indicators (SLIs) and service-level objectives (SLOs)Automation & Infrastructure EngineeringDevelop automation for cloud operations using Python, Bash, and IaC (Terraform)Reduce operational toil through automated runbooks, event-driven remediation, and system orchestrationImprove deployment reliability in collaboration with Platform Engineering and R&D teamsImplement and refine configuration standards, CI/CD hygiene, and environment stabilityObservability & Operational IntelligenceMaintain and extend observability stack (Prometheus, Grafana, InfluxDB, OpenTelemetry)Tune alerts for accuracy, reduce noise, and implement actionable alerting tied to SLOsAnalyze logs, metrics, and traces to detect reliability issues and validate system behaviorBuild dashboards that provide real-time visibility into system health and reliability trendsOperational ExcellenceSupport release processes, platform upgrades, and cloud infrastructure changesConduct root-cause analysis and drive post-incident corrective actionsMaintain operational documentation, runbooks, and environment validation workflowsCollaborate cross-functionally with NetOps, Platform Engineering, Field Ops, and R&DRequirements:Education and ExperienceBachelor's degree or higher in Software Engineering, Computer Science, or related field.7+ years experience in software development3+ years hands-experience working with cloud providers like AWS, etc and cloud-native technologies like Kubernetes, Helm, etc. and related technologies including observability platforms.Experience with database operations (MySQL, PostgreSQL, MongoDB, Redis) in cloud and on-prem environments.Cloud & InfrastructureStrong experience with AWS (EC2, S3, IAM, VPC, EKS/ECS, CloudWatch)Solid understanding of Kubernetes, Helm charts, and container orchestrationFamiliarity with hybrid cloud environments (cloud + on-prem integration)Infrastructure as Code & AutomationHands-on experience with TerraformScripting skills in Python and BashAbility to build automated workflows and cloud operations toolingCI/CD & Deployment EngineeringExperience with deployment pipelines (Jenkins, Bitbucket Pipelines, ArgoCD)Familiarity with GitOps workflowsUnderstanding of build systems (Maven, Gradle)Monitoring & ObservabilityExperience with monitoring/metrics/logging tools such as Prometheus, Grafana, InfluxDBFamiliarity with OpenTelemetry for distributed tracingAbility to diagnose performance issues in distributed systemsReliability Engineering ConceptsKnowledge of SLOs/SLIs/error budgetsIncident management principlesUnderstanding of resilience patterns (retry, circuit breakers, autoscaling, etc.)Why Nanometrics? We are a global leader in seismic solutions and a Canada's Best Managed Companies Platinum member. We value sustainable growth that benefits our employees, our community, and the environment. Maximize your productivity with our flexible hybrid work model. Our centrally located office space offers a stimulating environment for collaboration and focused work. Plus, enjoy a convenient commute with easy access to biking paths and public transportation.Engage in virtual and onsite social events centered around collaboration, learning, and fun, including volunteer events, celebrations, and team-building activities. Our comprehensive group benefits program includes RRSP matching, health/dental benefits, a corporate bonus program, education assistance, and a health spending account. Our Employee Assistance Program (EAP) provides services and support for health, work-life solutions, legal guidance, financial resources, wellness tools, and more.Enjoy a competitive leave program, including a holiday shutdown (December 25 to January 1). Grow your career with learning and development opportunities. Collaborate with high-performing teams and some of the industry's top minds.
-
Senior Platform Engineer
2 weeks ago
Ottawa, Canada Facilisgroup Full timeJob Description Job Description Senior Platform Engineer - Product Infrastructure Facilisgroup is a leading technology provider in the Promotional Products industry. We build software-as-a-service solutions that help promotional products distributors become more efficient and grow their sales. Over $1 billion in sales are processed through Facilisgroup's...
-
Senior Platform Engineer
2 weeks ago
Ottawa, Canada Facilisgroup Full timeJob Description Job Description Senior Platform Engineer - Product Infrastructure Facilisgroup is a leading technology provider in the Promotional Products industry. We build software-as-a-service solutions that help promotional products distributors become more efficient and grow their sales. Over $1 billion in sales are processed through Facilisgroup's...
-
Platform Engineer
1 week ago
Ottawa, Canada Nanometrics Inc. Full timePlatform Engineer (Cloud Reliability Engineer) Reports to: Director, Global Operations Based in Ottawa, ON Term: Full Time About Nanometrics With 40 years of seismic technology and industry application experience, we are a global, award‑winning company providing monitoring solutions and equipment for studying artificial and natural seismicity. From...
-
Platform Engineer
7 days ago
Ottawa, Canada Nanometrics Inc. Full timePlatform Engineer (Cloud Reliability Engineer) Reports to: Director, Global Operations Based in Ottawa, ON Term: Full Time About Nanometrics With 40 years of seismic technology and industry application experience, we are a global, award‑winning company providing monitoring solutions and equipment for studying artificial and natural seismicity. From...
-
Platform Engineer
5 days ago
Ottawa, Canada Nanometrics Inc. Full timePlatform Engineer (Cloud Reliability Engineer) Reports to: Director, Global Operations Based in Ottawa, ON Term: Full Time About Nanometrics With 40 years of seismic technology and industry application experience, we are a global, award‑winning company providing monitoring solutions and equipment for studying artificial and natural seismicity. From...
-
Site Reliability Engineer
5 days ago
Ottawa, Canada Targeted Talent Full timeJob Description We are looking for an experienced Site Reliability Engineer or Platform Operations Engineer for our client. This is a permanent position that is remote to start with later relocation to Calgary or Winnipeg. Our client is a global enterprise company with a product that you've likely used. You Will: Own development projects, providing technical...
-
Site Reliability Engineer
5 days ago
Ottawa, Canada Targeted Talent Full timeJob Description We are looking for an experienced Site Reliability Engineer or Platform Operations Engineer for our client. This is a permanent position that is remote to start with later relocation to Calgary or Winnipeg . Our client is a global enterprise company with a product that you've likely used. You Will: Own development projects, providing...
-
Engineering Manager
5 days ago
Ottawa, Canada Canonical Full timeJoin to apply for the Engineering Manager - Data Platform role at Canonical 1 month ago Be among the first 25 applicants Join to apply for the Engineering Manager - Data Platform role at Canonical Get AI-powered advice on this job and more exclusive features. Canonical is building a comprehensive suite of multi-cloud and on-premise data solutions for the...
-
Senior Engineer
2 weeks ago
Ottawa, Canada Wind River Full timeJoin to apply for the Senior Engineer - Cloud Platform role at Wind River. Job Title: Senior Engineer – Wind River Conductor. About the Opportunity Wind River Systems is building Wind River Studio for Operators, delivering an integrated cloud platform, unifying infrastructure, orchestration, and analytics capabilities so operators can deploy and manage...
-
Senior Engineer
4 weeks ago
Ottawa, Canada Wind River Full timeJoin to apply for the Senior Engineer - Cloud Platform role at Wind River . Job Title: Senior Engineer – Wind River Conductor. About the Opportunity Wind River Systems is building Wind River Studio for Operators, delivering an integrated cloud platform, unifying infrastructure, orchestration, and analytics capabilities so operators can deploy and manage...