AI Evaluator

3 weeks ago

Toronto Montreal Calgary Vancouver Edmonton Old Toronto Ottawa Mississauga Quebec Winnipeg Halifax Saskatoon Burnaby Hamilton Victoria Surrey Halton Hills London Regina Markham Brampton Vaughan Kelowna Laval Southwestern Ontario R, Canada Braintrust Full time

Join to apply for the AI Evaluator / Annotator (Remote‑freelance, 100+ openings) role at Braintrust. Position Overview iMerit seeks detail‑oriented and analytically minded Multimodal GenAI Evaluation Analysts to perform highly nuanced evaluations of AI system outputs across different modalities: text, image, video, and multimodal interactions. Analysts will assess accuracy, appropriateness, quality, clarity, and cultural alignment of model outputs against complex guidelines, ensuring that results align with project standards and real‑world use cases. These evaluations will directly inform the development and fine‑tuning of advanced large language models (LLMs), vision models (LVMs), and multimodal AI systems. Role Responsibilities Evaluate outputs generated by LLMs across multiple modalities (text, image captions, video descriptions, and multimodal prompts). Assess quality against project‑specific criteria such as correctness, coherence, completeness, style, cultural appropriateness, and safety. Identify subtle errors, hallucinations, or biases in AI responses. Apply domain expertise and logical reasoning to resolve ambiguous or unclear outputs. Provide detailed written feedback, tagging, and scoring of outputs to ensure consistency across the evaluation team. Escalate unclear cases and contribute to refining evaluation guidelines. Collaborate with Project Managers and Quality Leads to meet accuracy, reliability, and turnaround benchmarks. Skills & Competencies Strong critical reading, observational, and evaluative skills across different modalities. Ability to articulate nuanced judgments with precision and clarity. Excellent English comprehension (CEFR B2 or above); additional languages a plus. Familiarity with LLMs, generative AI, and multimodal systems. Strong attention to detail and ability to apply guidelines consistently. Awareness of cultural and linguistic nuances, including potential bias and harm in AI outputs. Comfort with evolving workflows, rapid feedback cycles, and complex quality frameworks. Requirements Bachelor's degree/diploma or equivalent educational qualification. 1+ years of experience in data annotation, LLM evaluation, content moderation, or related AI/ML domains. Demonstrated experience working with data annotation tools and software platforms. Strong understanding of language and multimodal communication (instruction following in image generation, fact‑checking, narrative coherence in video, etc.). Ability to adapt quickly to changing project directions and fast‑paced work environments. Previous experience creating or annotating complex data specifically for Large Language Model (LLM) training. Prior exposure to generative AI, prompt engineering, or LLM fine‑tuning workflows is a plus. Familiarity with potential exposure to NSFW or sensitive content; comfortable working in environments where incidental exposure may occur. What We Offer Opportunities to shape the evaluation standards for next‑generation multimodal AI systems. Innovative and supportive global working environment. Competitive compensation and flexible remote working arrangements. Continuous learning and growth in applied AI evaluation. Selection Process You will receive an iMerit platform assessment (15–30 minutes). If successfully completed, you’ll be invited to join the first project. After onboarding, once you’ve completed 10 hours of work, a quality test will be conducted. If you pass the quality test, you’ll continue on a 3‑month project and will be invited to participate in upcoming projects. Note: You will complete a quick 15–30 minute assessment (requires downloading a browser extension, which can be removed once the assessment is completed). ID verification and background check are required. Onboarding will be completed through iMerit’s platform. Commitment Minimum 20 hours per week (flexible schedule). You may work more hours if desired. Hourly Rates Malaysia – $5/hr Mexico, Colombia, Brazil, Costa Rica – $8.50/hr Argentina, Poland, Bulgaria, Romania, Malta, Latvia, Lithuania, UAE – $13/hr Portugal, Italy, Greece, Spain – $15.50/hr Canada, Australia, New Zealand, United Kingdom, Ireland, US, Finland, France, Sweden, Belgium, Austria, Denmark, Germany, Luxembourg, Estonia – $22/hr #J-18808-Ljbffr

AI Evaluation Expert

2 weeks ago

Toronto, Montreal, Calgary, Vancouver, Edmonton, Old Toronto, Ottawa, Mississauga, Quebec, Winnipeg, Halifax, Saskatoon, Burnaby, Hamilton, Surrey, Victoria, London, Halton Hills, Regina, Markham, Brampton, Vaughan, Kelowna, Laval, Southwestern Ontario, R, Canada Taskify AI Full time

AI Evaluation Expert (Remote) – Taskify AI Join a worldwide community of skilled professionals outsmarting AI with human creativity and analytical brilliance. As a Data Contributor, you will directly influence the quality and safety of cutting‑edge language models, including collaborations with industry-standard partners. Salary and Compensation Earn up...
AI Content Evaluator

2 weeks ago

Toronto, Montreal, Calgary, Vancouver, Edmonton, Old Toronto, Ottawa, Mississauga, Quebec, Winnipeg, Halifax, Saskatoon, Burnaby, Hamilton, Surrey, Victoria, London, Halton Hills, Regina, Markham, Brampton, Vaughan, Kelowna, Laval, Southwestern Ontario, R, Canada Taskify AI Full time

A leading AI development firm is seeking English Speakers/Writers for a freelance role focused on evaluating and improving AI model outputs. You'll contribute to exciting AI projects and work on your own schedule, earning up to $35/hr. Ideal candidates hold a Bachelor's degree in writing and possess strong writing and critical thinking skills, with a keen...
Remote DBA for AI Systems Evaluation

2 weeks ago

Toronto, Montreal, Calgary, Vancouver, Edmonton, Old Toronto, Ottawa, Mississauga, Quebec, Winnipeg, Halifax, Saskatoon, Burnaby, Hamilton, Surrey, Victoria, London, Halton Hills, Regina, Markham, Brampton, Vaughan, Kelowna, Laval, Southwestern Ontario, R, Canada Sepal AI Full time

A tech research firm in Canada is seeking experienced database administrators to influence AI systems evaluation standards. This fully remote short-term project offers hourly compensation ranging from $38 to $87 based on experience. Participants will review AI-generated database queries and provide expert insights into database administration for safety and...
Remote AI Content Evaluator

6 days ago

Toronto, Montreal, Calgary, Vancouver, Edmonton, Old Toronto, Ottawa, Mississauga, Quebec, Winnipeg, Halifax, Saskatoon, Burnaby, Hamilton, Surrey, Victoria, London, Halton Hills, Regina, Markham, Brampton, Vaughan, Kelowna, Laval, Southwestern Ontario, R, Canada Taskify AI Full time

A technology-driven company in Canada seeks a Public Relations Specialist for a remote, part-time role involving content evaluation and feedback on AI-generated outputs. Ideal candidates should have strong written communication skills, an analytical mindset, and a keen attention to detail. Join us to help enhance the quality and clarity of next-generation AI...
AI Content Quality Evaluator — Remote, Flexible Hours

6 days ago

Toronto, Montreal, Calgary, Vancouver, Edmonton, Old Toronto, Ottawa, Mississauga, Quebec, Winnipeg, Halifax, Saskatoon, Burnaby, Hamilton, Surrey, Victoria, London, Halton Hills, Regina, Markham, Brampton, Vaughan, Kelowna, Laval, Southwestern Ontario, R, Canada Taskify AI Full time

A leading AI evaluation company in Canada seeks a detail-oriented Writing Assessment Specialist for remote work. The role involves reviewing written material and evaluating AI-generated content to enhance quality. This part-time position offers competitive compensation based on performance and flexible hours. Candidates should have strong written...
Remote Accounting Clerk

3 weeks ago

Toronto, Montreal, Calgary, Vancouver, Edmonton, Old Toronto, Ottawa, Mississauga, Quebec, Winnipeg, Halifax, Saskatoon, Burnaby, Hamilton, Victoria, Surrey, Halton Hills, London, Regina, Markham, Brampton, Vaughan, Kelowna, Laval, Southwestern Ontario, R, Canada AI Jobs Full time

A global leader in AI training data is seeking an experienced Accounting Clerk to enhance AI models with accounting insights. Responsibilities include creating accounting prompts, developing grading rubrics, and evaluating AI-generated responses. Successful candidates will have a PhD or Master’s in Accounting, strong analytical skills, and excellent...
Remote Writing Evaluator

6 days ago

Toronto, Montreal, Calgary, Vancouver, Edmonton, Old Toronto, Ottawa, Mississauga, Quebec, Winnipeg, Halifax, Saskatoon, Burnaby, Hamilton, Surrey, Victoria, London, Halton Hills, Regina, Markham, Brampton, Vaughan, Kelowna, Laval, Southwestern Ontario, R, Canada Taskify AI Full time

A leading AI evaluation company is looking for a Writing Evaluator to join their remote team. This part-time role involves reviewing AI-generated content for accuracy and clarity, providing structured feedback, and working independently. With competitive monthly compensation and flexible working hours, this is an excellent opportunity for detail-oriented...
Remote AI Content Evaluator

6 days ago

Toronto, Montreal, Calgary, Vancouver, Edmonton, Old Toronto, Ottawa, Mississauga, Quebec, Winnipeg, Halifax, Saskatoon, Burnaby, Hamilton, Surrey, Victoria, London, Halton Hills, Regina, Markham, Brampton, Vaughan, Kelowna, Laval, Southwestern Ontario, R, Canada Taskify AI Full time

A leading technology firm is seeking a Literature Evaluator for a part-time remote role. In this position, you will review AI-generated content, assess the clarity and accuracy of written responses, and provide structured feedback. Ideal candidates are detail-oriented, possess strong written communication skills, and are comfortable working independently....
Literature Evaluator

6 days ago

Toronto, Montreal, Calgary, Vancouver, Edmonton, Old Toronto, Ottawa, Mississauga, Quebec, Winnipeg, Halifax, Saskatoon, Burnaby, Hamilton, Surrey, Victoria, London, Halton Hills, Regina, Markham, Brampton, Vaughan, Kelowna, Laval, Southwestern Ontario, R, Canada Taskify AI Full time

Literature Evaluator (Remote) – Taskify AI Pay: Competitive monthly compensation up to $10,000, based on role and performance. We’re looking for detail‑oriented professionals to support a variety of content and AI‑related evaluation tasks. This role involves reviewing written material, analysing responses, and helping enhance the quality, accuracy,...
Writing Evaluator

6 days ago

Toronto, Montreal, Calgary, Vancouver, Edmonton, Old Toronto, Ottawa, Mississauga, Quebec, Winnipeg, Halifax, Saskatoon, Burnaby, Hamilton, Surrey, Victoria, London, Halton Hills, Regina, Markham, Brampton, Vaughan, Kelowna, Laval, Southwestern Ontario, R, Canada Taskify AI Full time

Join to apply for the Writing Evaluator (Remote) role at Taskify AI 1 day ago Be among the first 25 applicants Pay: Competitive monthly compensation up to $10,000, based on role and performance We’re looking for detail-oriented professionals to support a variety of content and AI-related evaluation tasks. This role involves reviewing written material,...

Americas

Europe

Asia / Oceania

Africa

AI Evaluator