Lead Data Engineer
1 week ago
Job TitleLead Data Engineer – Python, PySpark & SQLLocationCanadaJob TypeFull time contractResponsibilitiesBuild scalable data ingestion and transformation pipelines using Python, PySpark, and SQL.Process raw CSV/text files from AWS S3, including validating headers, schema checks, and malformed file detection.Convert raw data into structured DataFrames and implement reusable data quality checks.Develop advanced transformations using SQL/PySpark (Window functions, LAG(), grouping logic, date gap detection, etc.).Deploy and tune PySpark applications on AWS EMR, optimizing executor memory, cores, shuffle behavior, and cluster performance.Work with AWS services such as S3, EMR, Glue, Lambda, IAM.Debug performance issues (OOM errors, shuffle spill, GC problems) and improve pipeline reliability.Lead design discussions, code reviews, and mentor junior engineers.Required Skills8+ years of experience in Data Engineering.Expert Python (file processing, scripting, validation automation).Strong PySpark (DataFrames, job tuning, distributed processing).Advanced SQL (analytical functions, performance tuning).Hands‑on with AWS data stack: S3, EMR, Glue, Lambda.Strong understanding of Spark memory allocation, YARN container usage, and EMR resource tuning.Excellent debugging, communication, and problem‑solving skills.Nice to HaveAirflow or Databricks experience.Terraform or CloudFormation.Experience with data lake formats (Delta, Iceberg, Hudi).Seniority levelMid-Senior levelEmployment typeContractJob functionInformation Technology #J-18808-Ljbffr
-
Lead Data Engineer
1 week ago
Toronto, Montreal, Calgary, Vancouver, Edmonton, Old Toronto, Ottawa, Mississauga, Quebec, Winnipeg, Halifax, Saskatoon, Burnaby, Hamilton, Surrey, Victoria, London, Halton Hills, Regina, Markham, Brampton, Vaughan, Kelowna, Laval, Southwestern Ontario, R, Canada North Eastern Services Full timeAbout Fusemachines Fusemachines is a leading AI strategy, talent, and education services provider. Founded by Sameer Maskey Ph.D., Adjunct Associate Professor at Columbia University, Fusemachines has a core mission of democratizing AI. With a presence in 4 countries (Nepal, United States, Canada, and Dominican Republic) and more than 400 full‑time...
-
Lead Data Engineer
1 week ago
Toronto, Montreal, Calgary, Vancouver, Edmonton, Old Toronto, Ottawa, Mississauga, Quebec, Winnipeg, Halifax, Saskatoon, Burnaby, Hamilton, Surrey, Victoria, London, Halton Hills, Regina, Markham, Brampton, Vaughan, Kelowna, Laval, Southwestern Ontario, R, Canada Fusemachines Full timeAbout Fusemachines Fusemachines is a leading AI strategy, talent, and education services provider founded by Sameer Maskey, Ph.D., Adjunct Associate Professor at Columbia University. With a presence in Nepal, the United States, Canada, and the Dominican Republic and more than 400 full‐time employees, Fusemachines is dedicated to democratizing AI and...
-
Lead Data Engineer
2 weeks ago
Calgary, Toronto, Montreal, Vancouver, Edmonton, Old Toronto, Ottawa, Mississauga, Quebec, Winnipeg, Halifax, Saskatoon, Burnaby, Hamilton, Surrey, Victoria, London, Halton Hills, Regina, Markham, Brampton, Vaughan, Kelowna, Laval, Southwestern Ontario, R, Canada Lumenalta Full timeJoin to apply for the Lead Data Engineer - Databricks (Remote) role at Lumenalta We help global enterprises launch digital products that reach millions of users. Our work spans massive datasets, scalable pipelines, and critical business challenges across industries. Base Pay Range CA$90,000 – CA$181,000 per year What You’ll Do Lead technical direction...
-
Senior Design Engineer
2 weeks ago
Toronto, Montreal, Calgary, Vancouver, Edmonton, Old Toronto, Ottawa, Mississauga, Quebec, Winnipeg, Halifax, Saskatoon, Burnaby, Hamilton, Surrey, Victoria, London, Halton Hills, Regina, Markham, Brampton, Vaughan, Kelowna, Laval, Southwestern Ontario, R, Canada Redpanda Data Full timeRedpanda is pioneering the Agentic Data Plane (ADP) - a new category in AI infrastructure that makes it simple and secure to connect AI agents with enterprise data and systems. Built on a multi‑modal data streaming engine, Redpanda empowers agentic applications that reason and act in real‑time with speed, autonomy, and precision. Global leaders including...
-
Lead Data Engineer
3 weeks ago
Calgary, Toronto, Montreal, Vancouver, Edmonton, Old Toronto, Ottawa, Mississauga, Quebec, Winnipeg, Halifax, Saskatoon, Burnaby, Hamilton, Victoria, Surrey, Halton Hills, London, Regina, Markham, Brampton, Vaughan, Kelowna, Laval, Southwestern Ontario, R, Canada Lumenalta Full timeLead Data Engineer - Databricks (Remote) at Lumenalta We help global enterprises launch digital products that reach millions of users. Our work spans massive datasets, scalable pipelines, and critical business challenges across industries. What You’ll Do Lead technical direction for data engineering initiatives. Architect scalable data pipelines and...
-
Senior Quality Engineer
2 weeks ago
Toronto, Montreal, Calgary, Vancouver, Edmonton, Old Toronto, Ottawa, Mississauga, Quebec, Winnipeg, Halifax, Saskatoon, Burnaby, Hamilton, Surrey, Victoria, London, Halton Hills, Regina, Markham, Brampton, Vaughan, Kelowna, Laval, Southwestern Ontario, R, Canada NTT DATA Full timeReq ID:343699 NTT DATA strives to hire exceptional, innovative and passionate individuals who want to grow with us. If you want to be part of an inclusive, adaptable, and forward-thinking organization, apply now. We are currently seeking a Senior Quality Engineer - Remote to join our team in Toronto, Ontario (CA-ON), Canada (CA). Senior Quality Engineer –...
-
Manager/Lead - Data Engineering
4 weeks ago
Toronto, Montreal, Calgary, Vancouver, Edmonton, Old Toronto, Ottawa, Mississauga, Quebec, Winnipeg, Halifax, Saskatoon, Burnaby, Hamilton, Victoria, Surrey, Halton Hills, London, Regina, Markham, Brampton, Vaughan, Kelowna, Laval, Southwestern Ontario, R, Canada HRB Full timeManager/Lead Data EngineeringWe are a boutique, rapidly growing, GCP (Google Cloud Platform) consulting company based out of Toronto. We work with GCP’s top customers (banking, telco, energy, retail, etc.) to help them with cloud transformation, security, analytics, ML, and data governance. Clients usually engage us to solve their most challenging business...
-
Lead Data Engineer
1 week ago
Calgary, Toronto, Montreal, Vancouver, Edmonton, Old Toronto, Ottawa, Mississauga, Quebec, Winnipeg, Halifax, Saskatoon, Burnaby, Hamilton, Surrey, Victoria, London, Halton Hills, Regina, Markham, Brampton, Vaughan, Kelowna, Laval, Southwestern Ontario, R, Canada Lumenalta Full time2 days ago Be among the first 25 applicants This range is provided by Lumenalta. Your actual pay will be based on your skills and experience — talk with your recruiter to learn more. Base pay range CA$90,000.00/yr - CA$181,000.00/yr What We're Working On We help global enterprises launch digital products that reach millions of users. Our work spans massive...
-
MDM Lead Data Engineer
4 weeks ago
Toronto, Montreal, Calgary, Vancouver, Edmonton, Old Toronto, Ottawa, Mississauga, Quebec, Winnipeg, Halifax, Saskatoon, Burnaby, Hamilton, Victoria, Surrey, Halton Hills, London, Regina, Markham, Brampton, Vaughan, Kelowna, Laval, Southwestern Ontario, R, Canada Persistent Systems Full timeWe are an AI‑led, platform‑driven Digital Engineering and Enterprise Modernization partner, combining deep technical expertise and industry experience to help our clients anticipate what’s next. Our offerings and proven solutions create a unique competitive advantage for our clients by giving them the power to see beyond and rise above. We work with...
-
Senior Data Engineer
3 weeks ago
Toronto, Montreal, Calgary, Vancouver, Edmonton, Old Toronto, Ottawa, Mississauga, Quebec, Winnipeg, Halifax, Saskatoon, Burnaby, Hamilton, Victoria, Surrey, Halton Hills, London, Regina, Markham, Brampton, Vaughan, Kelowna, Laval, Southwestern Ontario, R, Canada BET99 Full timeA leading online sportsbook and casino is seeking a Senior Data Engineer to own and scale their data platform. The role focuses on building a reliable data warehouse in Snowflake, implementing data quality monitoring, and collaborating with analytical teams. Ideal candidates will have strong SQL and Python skills, along with expertise in data integrations....