Lead Data Scientist- Nlp, Llm and Genai

1 week ago


Toronto, Canada S&P Global Full time

**About the Role**:
**Grade Level (for internal use)**: 11

**The Role**: Lead Data Scientist
- NLP, LLM and GenAI

**Responsibilities**:
**ML, Gen AI, NLP, LLM Model Development**: Design and develop custom ML, Gen AI, NLP, LLM Models for batch and stream processing-based AI ML pipelines. Model components will include data ingestion, preprocessing, search and retrieval, Retrieval Augmented Generation (RAG), NLP/LLM model development, fine-tuning and prompt engineering and ensure the solution meets all technical and business requirements. Work closely with other members of data science, MlOps, technology teams in the design, development, and implementation of the ML model solutions.
**ML, NLP, LLM Model Evaluation**: Work closely with the other data science team members to develop, validate, and maintain robust evaluation solutions and tools to evaluate model performance, accuracy, consistency, reliability, during development, UAT. Implement model optimizations to improve system efficiency.
**NLP, LLM, Gen AI Model Deployment**: Work closely with the MLOps team for the deployment of machine learning models into production environments, ensuring reliability and scalability.
**Internal Collaboration**: Collaborate closely with product teams, business stakeholders, Mlops, machine learning engineers, and software engineers to ensure smooth integration of machine learning models into production systems.
**Documentation**: Write and Maintain comprehensive documentation of ML modeling processes and procedures for reference and knowledge sharing.
**Develop Models Based on Standards and Best Practices**: Ensure that the models are designed and developed while adhering to specified standards, governance and best practices in ML model development as specified by senior Data Science and MLOps leads.
**Assist in Problem Solving**: Troubleshoot complex issues related to machine learning model development and data pipelines and develop innovative solutions.

**What We’re Looking For**:
Bachelor's / Master’s in Computer Science, Mathematics or Statistics, Computational linguistics, Engineering, or a related field.
2+ years of professional hands-on experience leveraging large sets of structured and unstructured data to develop data-driven tactical and strategic analytics and insights using ML, NLP, computer vision solutions.
Demonstrated 2+ years hands-on experience with Python, Hugging Face, TensorFlow, Keras, PyTorch, Spark or similar statistical tools. Expert in python programming.
2+ years hands-on experience developing natural language processing (NLP) models, ideally with transformer architectures.
2+ years of experience with implementing information search and retrieval at scale, using a range of solutions ranging from keyword search to semantic search using embeddings.
Knowledge of and measurable hands-on experience with developing or tuning Large Language Models (LLM) and Generative AI (GAI)
Experienced with NLP, LLMs (extractive and generative), fine-tuning and LLM model development. Familiar with higher level trends in LLMs and open-source platforms
**Nice to have**: Experience with contributing to Github and open source initiatives or in research projects and/or participation in Kaggle competitions.

**Compensation/Benefits Information**:
S&P Global states that the anticipated base salary range for this position is $100,200 - $215,000. Base salary ranges may vary by geographic location.

This role is eligible to receive S&P Global benefits.

About S&P Global Ratings
At S&P Global Ratings, our analyst-driven credit ratings, research, and sustainable finance opinions provide critical insights that are essential to translating complexity into clarity so market participants can uncover opportunities and make decisions with conviction. By bringing transparency to the market through high-quality independent opinions on creditworthiness, we enable growth across a wide variety of organizations, including businesses, governments, and institutions.
**S&P Global Ratings is a division of S&P Global (NYSE**: SPGI). S&P Global is the world’s foremost provider of credit ratings, benchmarks, analytics and workflow solutions in the global capital, commodity and automotive markets. With every one of our offerings, we help many of the world’s leading organizations navigate the economic landscape so they can plan for tomorrow, today.

What’s In It For You?

**Our Purpose**:
Progress is not a self-starter. It requires a catalyst to be set in motion. Information, imagination, people, technology-the right combination can unlock possibility and change the world.

Our world is in transition and getting more complex by the day. We push past expected observations and seek out new levels of understanding so that we can help companies, governments and individuals make an impact on tomorrow. At S&P Global we transform data into Essential Intelligence®, pinpointing risks and opening possibilities. We Accelerate Progress.

**Our Peo


  • Data Scientist

    1 week ago


    Toronto, Canada S&P Global Full time

    **About the Role**: **Grade Level (for internal use)**: 09 **The Role**: Data Scientist - NLP, LLM and GenAI **Responsibilities**: **ML, Gen AI, NLP, LLM Model Development**: Design and develop custom ML, Gen AI, NLP, LLM Models for batch and stream processing-based AI ML pipelines. Model components will include data ingestion, preprocessing, search and...


  • Toronto, Canada Thomson Reuters Full time

    Join to apply for the Lead Applied Scientist, NLP/GenAI role at Thomson Reuters. Lead Applied Scientist, Document Understanding Document understanding is a foundational intelligence layer that powers every major capability across our legal AI platform—from search and information extraction to agentic reasoning in products like Westlaw, PracticalLaw, and...


  • Toronto, Canada Thomson Reuters Full time

    Join to apply for the Lead Applied Scientist, NLP/GenAI role at Thomson Reuters . Lead Applied Scientist, Document Understanding Document understanding is a foundational intelligence layer that powers every major capability across our legal AI platform—from search and information extraction to agentic reasoning in products like Westlaw, PracticalLaw, and...


  • Toronto, Canada Thomson Reuters Full time

    Join to apply for the Lead Applied Scientist, NLP/GenAI role at Thomson Reuters. Lead Applied Scientist, Document Understanding Document understanding is a foundational intelligence layer that powers every major capability across our legal AI platform—from search and information extraction to agentic reasoning in products like Westlaw, PracticalLaw, and...


  • Toronto, Canada S&P Global Full time

    **About the Role**: **Grade Level (for internal use)**: 12 **The Role**: Associate Director of Data Science - RAG, NLP, LLM and GenAI **Responsibilities and Impact**: **ML, Gen AI, NLP, LLM Model Development**: Design and develop custom ML, Gen AI, NLP, LLM Models for batch and stream processing-based AI ML pipelines. Model components will include data...


  • Toronto, Canada Thomson Reuters Full time

    Join to apply for the Senior Applied Scientist, NLP/IR/GenAI role at Thomson Reuters . Are you excited about working at the forefront of applied research in an industry setting? Thomson Reuters Labs in Canada is seeking scientists with a passion for solving problems using state-of-the-art natural language processing, information retrieval, and generative AI....


  • Toronto, Canada Thomson Reuters Full time

    Join to apply for the Senior Applied Scientist, NLP/IR/GenAI role at Thomson Reuters.Are you excited about working at the forefront of applied research in an industry setting? Thomson Reuters Labs in Canada is seeking scientists with a passion for solving problems using state-of-the-art natural language processing, information retrieval, and generative AI....


  • Toronto, Canada Thomson Reuters Full time

    Join to apply for the Senior Applied Scientist, NLP/IR/GenAI role at Thomson Reuters.Are you excited about working at the forefront of applied research in an industry setting? Thomson Reuters Labs in Canada is seeking scientists with a passion for solving problems using state-of-the-art natural language processing, information retrieval, and generative AI....


  • Toronto, Canada Mindlance Full time

    A leading technology firm in Toronto is seeking a Senior Data Engineer to develop large-scale data pipelines for LLM and GenAI applications. The role requires mentoring junior engineers and collaborating with data scientists. Candidates should have over 5 years in data engineering, with strong skills in Python and SQL. This mid-senior level contract position...

  • Senior Data Scientist

    8 hours ago


    Toronto, Canada Intact Full time

    A leading insurance company in Toronto is seeking a Senior Data Scientist to develop AI solutions and lead NLP projects. The ideal candidate will have over 5 years of experience in data science, and a Master's degree, focusing on transforming data into actionable insights. This full-time role emphasizes collaboration in a cross-functional team environment, a...