DataBricks Data Architect
5 hours ago
Job Summary
The Databricks Data Architect is a senior technical leader responsible for building and optimizing a robust data platform in a financial services environment. In this full-time role, you will lead a team of 10+ data engineers and own the end-to-end architecture and implementation of the Databricks Lakehouse platform. You will collaborate closely with application development and analytics teams to design scalable data solutions that drive business insights. This position demands deep expertise in Databricks (Azure), hands-on experience with PySpark and Delta Lake, and strong leadership to ensure best practices in data engineering, performance tuning, and governance.
Key Responsibilities
- Own the Databricks platform architecture and implementation, ensuring the environment is secure, scalable, and optimized for the organizations data processing needs. Design and oversee the Lakehouse architecture leveraging Delta Lake and Apache Spark.
- Implement and manage Databricks Unity Catalog for unified data governance. Ensure fine-grained access controls and data lineage tracking are in place to secure sensitive financial data and comply with industry regulations.
- Provision and administer Databricks clusters (in Azure), including configuring cluster sizes, auto-scaling, and auto-termination settings. Set up and enforce cluster policies to standardize configurations, optimize resource usage, and control costs across different teams and projects.
- Collaborate with analytics teams to develop and optimize Databricks SQL queries and dashboards. Tune SQL workloads and caching strategies for faster performance and ensure efficient use of the query engine.
- Lead performance tuning initiatives for Spark jobs and ETL pipelines. Profile data processing code (PySpark/Scala) to identify bottlenecks and refactor for improved throughput and lower latency. Implement best practices for incremental data processing with Delta Lake, and ensure compute cost efficiency (e.g., by optimizing cluster utilization and job scheduling).
- Work closely with application developers, data analysts, and data scientists to understand requirements and translate them into robust data pipelines and solutions. Ensure that data architectures support analytics, reporting, and machine learning use cases effectively.
- Integrate Databricks workflows into the CI/CD pipeline using Azure DevOps and Git. Develop automated deployment processes for notebooks, jobs, and clusters (infrastructure-as-code) to promote consistent releases. Manage source control for Databricks code (using Git integration) and collaborate with DevOps engineers to implement continuous integration and delivery for data projects.
- Collaborate with security and compliance teams to uphold data governance standards. Implement data masking, encryption, and audit logging as needed, leveraging Unity Catalog and Azure security features to protect sensitive financial data.
- Stay up-to-date with the latest Databricks features and industry best practices. Proactively recommend and implement improvements (such as new performance optimization techniques or cost-saving configurations) to continuously enhance the platforms reliability and efficiency.
- Bachelors degree in Computer Science, Information Systems, or a related field
- 7+ years of experience in data engineering, data architecture, or related roles, with a track record of designing and deploying data pipelines and platforms at scale.
- Significant hands-on experience with Databricks (preferably Azure Databricks) and the Apache Spark ecosystem. Proficient in building data pipelines using PySpark/Scala and managing data in Delta Lake format.
- Strong experience working with cloud data platforms (Azure preferred, or AWS/GCP). Familiarity with Azure data services (such as Azure Data Lake Storage, Azure Blob Storage, etc.) and managing resources in an Azure environment.
- Advanced SQL skills with the ability to write and optimize complex queries. Solid understanding of data warehousing concepts and performance tuning for SQL engines.
- Proven ability to optimize ETL jobs and Spark processes for performance and cost efficiency. Experience tuning cluster configurations, parallelism, and caching to improve job runtimes and resource utilization.
- Demonstrated experience implementing data security and governance measures. Comfortable configuring Unity Catalog or similar data catalog tools to manage schemas, tables, and fine-grained access controls. Able to ensure compliance with data security standards and manage user/group access to data assets.
- Experience leading and mentoring engineering teams. Excellent project leadership abilities to coordinate multiple projects and priorities. Strong communication skills to effectively collaborate with cross-functional teams and present architectural plans or results to stakeholders.
Preferred
- Databricks Certified Data Engineer Professional or Databricks Certified Data Engineer Associate. Equivalent certifications in cloud data engineering or architecture (e.g., Azure Data Engineer, Azure Solutions Architect)
- Exposure to related big data and streaming tools such as Apache Kafka/Event Hubs, Apache Airflow or Azure Data Factory for orchestration, and BI/analytics tools (e.g., Power BI) is advantageous.
- Experience implementing CI/CD pipelines for data projects. Familiarity with Databricks Repos, Jenkins, or other CI tools for automated testing and deployment of data pipelines.
Tools & Technologies
- Databricks Lakehouse Platform: Databricks Workspace, Apache Spark, Delta Lake, Databricks SQL, MLflow (for model tracking).
- Data Governance: Databricks Unity Catalog for data cataloging and access control; Azure Active Directory integration for identity management.
- Programming & Data Processing: PySpark and Python for building data pipelines and Spark Jobs; SQL for querying and analytics;
- Cloud Services (Azure-focused): Azure Databricks, Azure Data Lake Storage (ADLS Gen2), Azure Blob Storage, Azure Synapse or SQL Database, Azure Key Vault (for secrets).
- DevOps & CI/CD: Azure DevOps (Azure Pipelines) for build/release pipelines, Git for version control (GitHub or Azure Repos); experience with Terraform or ARM templates for infrastructure-as-code is a plus.
- Other Tools: Project and workflow management tools (JIRA or Azure Boards), monitoring tools (Azure Log Analytics, Spark UI or Databricks performance monitoring), and collaboration tools for documentation and design (Figma, Visio, Lucidcharts etc.).
-
Specialist Solutions Architect
1 week ago
Toronto, Ontario, Canada Databricks Full time $120,000 - $180,000 per yearP-1363Location: Toronto, ONAs a Specialist Solutions Architect (SSA) - ML Engineering, you will be the trusted technical ML expert to both Databricks customers and the Field Engineering organization. You will work with Solution Architects to guide customers in architecting production-grade ML applications on Databricks, while aligning their technical roadmap...
-
Solutions Architect
2 weeks ago
Toronto, Ontario, Canada Databricks Full time $160,500 - $224,700Location: Toronto, ONAs a Solutions Architect on the Canada team focused on Digital Native customers, you will shape the future of the big data landscape by working with the most sophisticated data engineering and data science teams in the world.Reporting to the Field Engineering Manager, you will collaborate with customers, product teams, and the...
-
Azure Databricks Architect
10 hours ago
Toronto, Ontario, Canada Themesoft Inc. Full time US$100,000 - US$150,000 per yearRole: Azure Databricks ArchitectToronto , Hybrid 2 days/weekAzure Databricks Architect, Data Modeler with Lake house and Data as a Product knowledge,
-
Databricks Data Engineer
2 days ago
Toronto, Ontario, Canada CloudTech Innovations Full time $120,000 - $180,000 per yearJob Title: Data Engineer – DatabricksLocation:Onsite – Toronto, CanadaEmployment Type:ContractAbout the RoleWe are seeking an experienced Data Engineerwith a strong background inDatabricks,Apache Spark, andmodern cloud data platforms. The ideal candidate has over5 years of experiencedesigning, developing, and maintaining scalable data pipelines and...
-
Databricks Data Engineer
1 week ago
Toronto, Ontario, Canada Rubicon Path Full time $120,000 - $180,000 per yearWho we are looking forThis is a hands-on Data Engineer position. We are looking for candidate with good knowledge on Bigdata technology and strong development experience with Databricks. What you will be responsible forAs Data Engineer you willDesign & Develop custom high throughput and configurable frameworks/librariesArchitect and implement scalable data...
-
Databricks Lead/Architect
8 hours ago
Toronto, Ontario, Canada Exdonuts Full time US$100,000 - US$180,000 per yearDatabricks Lead/ArchitectLocation : Toronto, CanadaWork model : Hybrid ( 4 days to office)Job Description:• This will be a mix of support and development role, not pure development, but also lot of coordination.• this role is a mix of Business and Technical as my team support Cloud Consumption (which includes operational support for Analytics Zone...
-
Azure Data Architect
1 week ago
Toronto, Ontario, Canada Tata Consultancy Services (TCS) Full time US$120,000 - US$180,000 per yearInclusion without Exception:Tata Consultancy Services (TCS) is an equal opportunity employer, and embraces diversity in race, nationality, ethnicity, gender, age, physical ability, neurodiversity, and sexual orientation, to create a workforce that reflects the societies we operate in. Our continued commitment to Culture and Diversity is reflected in our...
-
Sr. Data Architect
6 days ago
Toronto, Ontario, Canada Source Code Full time $120,000 - $180,000 per yearSr. Data Architect - GOAPRDJP Eleventh Floor StreetEdmontonPrimarily work remotely but must be available for on-site meetings as required (Meetings may occur up to 3-4 times per fiscal month, but the actual frequency will depend on the specific initiative and will be determined on an on-demand basis.)Contract 4+ months1 Opening/ 3...
-
Data Architect
1 week ago
Toronto, Ontario, Canada E-Solutions Full time $120,000 - $180,000 per yearWE are hiring for below roleRole : Data ArchitectLocation : Toronto, ON M5V 3H6 (5 days onsite)Strong experience in unified data modeling, re-usable data pipeline framework, data governance, Payments / AML experienceAbout 15-20 Years experienceTech Stack – Snowflake, Python, Pyspark, big data technologies.Strong ETL and data architecture experience in...
-
Data Architect
1 week ago
Toronto, Ontario, Canada Kumaran Systems Full time $120,000 - $180,000 per yearArchitect and develop scalable data pipelines using Java and Go.Design, implement, and optimize workflows in Azure Data Factory.Build and maintain analytics solutions in Databricks.Oversee Data Lake architecture and management.Establish and enforce data quality frameworks and controls.Monitor and improve data integrity, accuracy, and consistency.Collaborate...