🌿 Back to all jobs

🥝 Ai qa trainer - llm evaluation - freelance project

Invisible Expert Marketplace | South-Africa, South-Africa | Posted June 30, 2026

Job Description

Overview

Get AI-powered advice on this job and more exclusive features. We’re looking for AI QA trainers who specialize in model evaluation, LLM safety, prompt robustness, data quality assurance, multilingual and domain-specific testing, grounding verification, and compliance readiness checks. You’ll evaluate advanced language models on tasks such as hallucination detection, factual consistency, prompt-injection and jailbreak resistance, bias/fairness audits, chain-of-reasoning reliability, tool-use correctness, retrieval-augmentation fidelity, and end-to-end workflow validation. You will document every failure mode to raise the bar for quality.

On a typical day, you will converse with the model on real-world scenarios and evaluation prompts, verify factual accuracy and logical soundness, design and run test plans and regression suites, build clear rubrics and pass/fail criteria, capture reproducible error traces with root-cause hypotheses, and suggest improvements to promp...

Apply for This Position

Submit Application