Senior Cloud Infrastructure
2 weeks ago
3 days ago Be among the first 25 applicants Get AI-powered advice on this job and more exclusive features. Direct message the job poster from CanCap Group Inc. CanCap Group Inc. is part of privately-owned Canadian national financial services company with multiple verticals across automotive, consumer, and merchant lending portfolios. We manage the entire lifecycle of the finance receivable from credit adjudication through to contract administration, customer service, default management and post charge-off recoveries. We are a company of innovators, we learn from each other, respect each other and create together. When it comes to our customers, partners, and each other, we are always motivated by doing the “right thing”. We are always looking to find the best people and the right methods that allow us to meet this goal and look to the future for growth. What Your Day and Week Could Look Like Reporting to the Head of Data Platform, the Senior Cloud Infrastructre - Data Platform will be responsible for designing, implementing, and maintaining our Google Cloud-based Databricks environment to support the company’s AI, data analytics and business intelligence needs. A key focus of this role will be managing Databricks permissions, user access, security policies, and workspace configurations to ensure a well-governed, scalable, and secure data environment deployed on GCP. You will be responsible for ensuring Databricks infrastructure is properly maintained to support business-critical data operations and our ETL pipelines built using Data Build Tool (DBT). The ideal candidate will be responsible for the deployment, configuration, and administration of cloud-native platforms, AI tools integrations, ensuring secure, scalable, and efficient environments for data and analytics workloads. Key Responsibilities Provision and manage Databricks workspaces across development, test, and production environments. Set up and configure GCP services including networking, IAM, Cloud Storage etc. Implement and maintain IAM policies and role-based access controls across GCP and Databricks environments. Supporting applications and projects with infrastructure design decision, and monitoring solution Performing complex application programming activities, including coding, testing, debugging, documenting, maintaining, and modifying complex applications. Collaborate with data engineering and security teams to ensure compliance with cloud security best practices and organizational standards. Contribute to building AI/ML infrastructure, including feature stores, model training pipelines, and model deployment frameworks. Build automation around infrastructure provisioning, CI/CD pipelines, observability, and cost management. Ensure platform security, availability, and compliance through robust access control, auditing, and data governance practices. Monitor platform usage, optimize performance, and ensure high availability and reliability of cloud environments. Automate platform operations using Infrastructure as Code (IaC) tools like Terraform. Conduct security reviews, manage secrets, and monitor audit logs for anomalous activity. Assist in incident response and troubleshooting across platform layers. Maintain documentation related to platform architecture, procedures, standards, configurations, permissions structures and access control policies. Develop and enforce best practices for Databricks security, governance, and resource optimization. Design and implement scalable Databricks workspaces and clusters, optimizing cost efficiency and performance. Participate in design reviews with peers and stakeholders to determine the best Databricks configurations and integrations. Monitor and optimize Databricks notebooks, jobs, and workflows to enhance efficiency and reliability. Maintain and contribute to documentation and best practices for Databricks environments. Troubleshoot and resolve Databricks platform issues, identifying root causes of system performance bottlenecks. Create and review technical design documents, understand how the design will be used in the code development process, and facilitate meetings to design, troubleshoot, and execute projects. What You Bring 6+ years of experience in platform administration, engineering or cloud roles. Strong hands‑on experience with Databricks setup, provisioning, and workspace administration. Solid understanding of Google Cloud Platform (GCP) services and architecture. Proven experience in managing IAM (preferably in GCP and Databricks). Strong knowledge of cloud security principles, encryption, and compliance standards. Expertise in Terraform, CI/CD pipelines, and DevOps practices. Proficient in scripting languages like Python, Shell, or Bash. Excellent problem‑solving, communication, and documentation skills. Foundational knowledge on working with common ETL tools. Strong expertise in Databricks or other Data Lakehouses. Experience managing user access, permissions, security settings, and governance policies within Databricks. Experience working with large‑scale data processing, data structures, and algorithms. Strong platform engineering experience and experience implementing scalable distributed systems. Working with building and maintaining DevOps pipeline such as Jenkins, GitHub actions. Experience with MLOps orchestration tools such as AirFlow, KubeFlow, Dagster, Flyte, or MetaFlow. Experience implementing monitoring solutions to identify system bottlenecks and production issues. Hands‑on experience building and deploying hybrid environments on‑prem and major cloud environments, such as GCP and AWS. Strong experience implementing Infrastructure as Code in Terraform, CloudFormation, or Google Cloud Deployment Manager Templates. Experience working with Databricks, especially GCP Databricks. Preferred Qualifications Databricks certification Experience with multi‑cloud environments. Exposure to data engineering pipelines and ML platform management. Understanding of networking concepts (VPC, firewall rules, peering, etc.) in GCP. Hands‑on experience in MLOps, DataOps, or platform SRE practices. Knowledge of data governance, lineage, and privacy frameworks (e.g., GDPR, HIPAA). Nice to Have Master's degree in Computer Science or related technical fields. Proficiency in performance tuning, large‑scale data analysis and debugging skills. What You Can Expect From Us Our Employee Experience is aimed at supporting and inspiring our talented team through: A passionate team dedicated to supporting and empowering others. An environment where creative, innovative thinking is encouraged. Health and Dental Benefits. Work Location & Remote Flexibility This role follows a hybrid model, requiring employees to work 50% in‑office, with flexibility to work remotely or from the office on other days. The company has two office locations: Downtown Toronto (Church Street) – The tech team is primarily based here. Mississauga – Another office location, but less frequently used by the tech team. CanCap is an equal opportunity employer and values diversity. We are committed to building and evolving a team reflecting a variety of backgrounds, perspectives, and skills. To be considered for employment, you will need to successfully pass a criminal background check and validate your work experience. #J-18808-Ljbffr
-
, , Canada QuickNode Full timeA leading cloud-based infrastructure company is seeking a Senior Infrastructure Engineer to architect and implement next-generation infrastructure platforms. This remote role focuses on building robust systems using automation and ensuring performance and reliability while managing hybrid cloud environments. The ideal candidate has a minimum of 5 years’...
-
Cloud Infrastructure
3 weeks ago
, BC, Canada Thrive Health Full timeAt Thrive Health , we’re on a mission to make healthcare work better for everyone. Our digital care coordination platform connects people, data, and care across the entire health journey – empowering individuals and health professionals alike. As an AI‑first company , we’re building products that place people at the centre of care by enhancing care...
-
, NU, Canada Donna Cona Inc. Full timeOverview Reference #: 8081Location: Nunavut (Remote)Type: Sub-contract Donna Cona Inc. is currently seeking a Senior Solution Architect – Cloud, Applications & Infrastructure, for one of our key government clients. The Solution Architect – Cloud, Applications & Infrastructure will lead the design and delivery of secure, scalable cloud-based solutions —...
-
Senior Cloud Infrastructure Engineer
4 days ago
Toronto, Ontario, MR E, Canada Signal 1 Full timeWe're looking for a Senior Cloud Infrastructure Engineer to build and maintain our platform on Azure. You'll play an essential role in making sure our healthcare AI solutions work reliably and securely.What You'll Do:• Design and implement stable Azure-based infrastructure for our services• Create automation tools for deployment and system monitoring•...
-
Cloud Infrastructure Engineer
2 weeks ago
Canada Inovatec Systems Corporation Full timeAbout Inovatec:Inovatec is an exciting growth company based in Vancouver, BC, established in 2006. We are North America's leading provider of cloud-based software solutions for the automotive, motorcycle, powersports, and equipment financing industries. Our solutions are used by some of the largest banks, credit unions, and finance companies in Canada and...
-
Cloud Infrastructure Architect
2 weeks ago
, , Canada Toparo Full timeWe are looking for a highly skilled Azure Infrastructure Architect with a minimum of 5 years of experience in architecting and designing complex cloud infrastructure solutions on Microsoft Azure. The ideal candidate will possess deep expertise in Azure services and provide guidance on best practices for implementing and managing cloud-based solutions....
-
Remote Senior Cloud Cost
3 weeks ago
, PE, Canada Affirm Full timeA financial technology company is seeking a Senior Manager for Software Engineering in Cloud Cost Management. This remote role requires leading a team to develop cost management frameworks, collaborate on financial targets, and implement strategies for managing substantial infrastructure costs. The ideal candidate will have over 10 years of experience in...
-
Senior Cloud Security Developer
1 week ago
, , Canada Coveo Full timeJoin to apply for the Senior Cloud Security Developer role at Coveo Design threat detection at cloud scale.⚙️ At Coveo, we’re building advanced security engineering capabilities to protect our people, platforms, and customers. As a Senior Cloud Threat Detection Developer, you will design and implement detection strategies deeply integrated into our...
-
Senior Cloud Solution Architect
3 weeks ago
, , Canada Nesto Cloud Full timeA leading provider of mortgage technology is seeking an experienced Solution Architect to drive innovation within their cloud-based mortgage platform. This fully remote position welcomes candidates nationwide, promoting a culture of diversity and high performance. The ideal candidate will have extensive software engineering and architectural experience,...
-
Senior Software Engineer, Engineering
1 week ago
, , Canada Spectro Cloud Full timeWho We Are Spectro Cloud aims to make infrastructure boundaryless for the enterprise, from data center to edge and every platform in between. We provide solutions that help enterprises run applications on Kubernetes, their way, anywhere. Established by a team of multi-cloud management experts and industry veterans with a track record of success, we're at the...