Summer Intern, AI Evaluation

18 hours ago


Toronto, Ontario, Canada Armilla AI Full time
The Role: AI Evaluation & Testing

We're looking for a motivated summer intern to join our team and gain hands-on experience developing AI evaluation and testing frameworks. As a Summer Intern focused on AI Evaluation & Testing, you'll work directly with our AI assessment teams to build the tools, methodologies, and infrastructure needed to systematically evaluate and test AI systems, particularly Large Language Models (LLMs). This is a unique opportunity to work at the forefront of AI risk management while developing valuable technical skills in a startup environment.

Role Responsibilities

In this exciting internship, you'll:

  • Assist in designing and implementing evaluation frameworks to test AI models, with a focus on Large Language Models (LLMs).
  • Conduct experiments to identify potential failure modes, biases, and vulnerabilities in AI systems.
  • Develop automated testing scripts and tools in Python to streamline AI evaluation processes.
  • Contribute to in-depth quantitative analysis and research, staying ahead of emerging AI risks and industry trends.
  • Support the team in building datasets, benchmarks, and evaluation metrics for AI risk assessment.
  • Gain exposure to the insurance industry and how AI systems are assessed for insurability.
What We're Looking For

We're seeking a candidate who brings:

  • Currently pursuing or recently completed a degree in Computer Science, Data Science, Machine Learning, Statistics, Mathematics, or a related field.
  • Strong programming skills in Python, with experience in libraries such as Pandas, NumPy, or similar tools.
  • Familiarity with machine learning concepts and interest in AI/LLM technologies (hands-on experience is a plus but not required).
  • A curious, analytical mindset with strong attention to detail and problem-solving abilities.
  • Ability to work both independently and collaboratively in a fast-paced startup environment.
  • Excellent communication skills and the ability to present technical findings clearly.
  • Enthusiasm for learning about AI safety, evaluation methodologies, and emerging risks in AI systems.
What's In It For You

Joining Armilla AI means:

  • Cutting-Edge Experience: Work hands-on with the latest AI technologies, particularly LLMs, in a real-world business context.
  • Meaningful Impact: Your work will directly contribute to how we assess and understand AI risks, shaping our product development and risk frameworks.
  • Mentorship & Learning: Learn from experienced AI and insurance professionals and gain insight into an emerging industry at the intersection of technology and risk management.
  • Startup Culture: Experience the dynamic, collaborative environment of a growing startup where your ideas and contributions are valued.
How to Apply

Excited about the opportunity to work on AI evaluation and testing at the frontier of AI risk management? We'd love to hear from you Please send your resume and a brief note outlining:

  1. Your relevant coursework, projects, or experience with AI/ML and Python
  2. Any examples of technical work (e.g., GitHub repositories, course projects, Kaggle competitions, or personal projects)
  3. What excites you about AI evaluation and why you're interested in joining Armilla AI this summer


  • Toronto, Ontario, Canada Exiger Full time

    AI Data Engineer Intern – TES Research & Delivery (Summer 2026)Location: Toronto, ON (Hybrid)Duration: June 8, 2026 – August 14, 2026Pay Rate: $25/hourRole SummaryExiger is seeking a highly motivated AI Data Engineer Intern to join our Tech Enabled Services (TES): Research & Delivery team for Summer 2026. Reporting to senior research and delivery...


  • Toronto, Ontario, Canada Exiger Full time

    AI Data Engineer Intern – TES Research & Delivery (Summer 2026)Location: Toronto, ON (Hybrid)Duration: June 8, 2026 – August 14, 2026Pay Rate: $25/hourRole SummaryExiger is seeking a highly motivated AI Data Engineer Intern to join our Tech Enabled Services (TES): Research & Delivery team for Summer 2026. Reporting to senior research and delivery...


  • Toronto, Ontario, Canada Dayforce Full time

    Job Title: Product & AI Intern Location: Virtual Duration: Summer 2026 – 4 months (May 2026 – August 2026) Availability: *Full-time availability of 37.5 – 40 hours weekly is required to be eligible for this opportunity. Benefits for Students: Experience working for one of the fastest growing Human Capital Management technology companies in the...

  • AI Engineer

    1 week ago


    Toronto, Ontario, Canada Armilla AI Full time

    The Role: Building & Evaluating AI SolutionsWe're looking for a talented and experienced AI Engineer to join our growing team. This is a pivotal role where you'll be instrumental in both building our core AI-powered tools and developing the programmatic platforms to rigorously evaluate AI systems. If you're passionate about applying robust engineering...


  • Toronto, Ontario, Canada Dayforce Full time

    Dayforce, a global leader in Human Capital Management (HCM) with headquarters in Toronto, Ontario, and Minneapolis, Minnesota, operates across North America, EMEA, and APJ regions. Our Cloud HCM platform, recognized for its unified database and continuous calculation engine, enhances efficiency, productivity, and compliance for global workforces. We are...


  • Toronto, Ontario, Canada Armilla AI Full time

    The RoleWe're seeking an exceptional Applied Scientist who bridges the worlds of deep AI research and practical, production-grade applications. As our Applied Scientist, AI Risk, you'll be instrumental in both advancing our understanding of AI systems and translating that knowledge into robust risk assessment and evaluation frameworks. This isn't just about...


  • Toronto, Ontario, Canada Dayforce Full time

    Job Title: AI Developer Intern Location: Hybrid – Dayforce Toronto Office (3 days/week) Duration: Summer Months (May 2026 – December 2026) Availability: *Full-time availability of 37.5 – 40 hours weekly is required to be eligible for this opportunity. Benefits for Students: Experience working for one of the fastest growing Human Capital Management...


  • Toronto, Ontario, Canada Rally Assets Full time

    Full job description, in English and French | Description complète du poste, en anglais et en français: AI Enablement Intern, Impact OperationsLength of Internship: May 4 - August 7, weeks)Compensation: $25 an hour, 37.5 hours a weekLocation: Toronto (in person or hybrid)Application deadline: Sunday, January 18, 2026Want to build, test, and refine AI tools...


  • Toronto, Ontario, Canada Dayforce Full time

    Dayforce, a global leader in Human Capital Management (HCM) with headquarters in Toronto, Ontario, and Minneapolis, Minnesota, operates across North America, EMEA, and APJ regions. Our Cloud HCM platform, recognized for its unified database and continuous calculation engine, enhances efficiency, productivity, and compliance for global workforces. We are...

  • Applied AI Engineer

    7 days ago


    Toronto, Ontario, Canada Boam AI Full time

    Ship production ML and agentic AI powering market leaders worldwideBoam AI builds managed data solutions that transform messy, unstructured signals from public, private, and proprietary sources into structured, reliable, and always up-to-date intelligence on millions of SMBs and enterprises worldwide. These agentic systems power CRMs, data warehouses, AI...