Llm evaluator (model response analyst)

Odixcity Consulting • Remote, Remote, South-Africa • Posted June 23, 2026

Location Remote, Remote
Job Type Full-time
Category Other-General
Posted June 23, 2026

Job Title: LLM Evaluator (Model Response Analyst)

Location: Remote (Worldwide)

Job Summary: We are seeking a detail-oriented and analytical LLM Evaluator to assess, analyze, and improve the performance of large language models (LLMs). In this role, you will evaluate AI-generated content for accuracy, coherence, factual reliability, bias, safety, and alignment with defined guidelines. Responsibilities Evaluate and rank model-generated text based on complex rubrics covering dimensions such as factuality, coherence, safety, instruction‑following, and creativity. Review multiple model responses to the same prompt and determine which output a human would prefer, providing justifications for your choices. Provide clear, concise feedback to the modeling and training teams regarding recurring failure models observed during evaluation sessions. Attempt to “break” the model by crafting prompts designed to elicit biased, harmful, or insecure outputs to help patch safety vul...

Interested in this role?

Click the button below to start your application.

Apply Now