AI Evaluation Specialist | $35/hr Remote

Crossing Hurdles • distrito federal, distrito federal, Mexico • Posted June 19, 2026

Location distrito federal, distrito federal
Job Type Full-time
Category Other-General
Posted June 19, 2026
  • Design and implement self-contained evaluation tasks, including prompts, supporting files, and detailed grading rubrics to assess AI performance on practical computer-based workflows.
  • Define clear, unambiguous written criteria for successful and unsuccessful task completion across diverse administrative and workflow scenarios.
  • Meticulously observe and document AI agent behaviors, producing crisp, precise summaries and reports in high-quality English.
  • Iterate and refine evaluation tasks and rubrics based on feedback and team collaboration to ensure robust benchmarking methodologies.
  • Collaborate with the customer's team to share insights and help drive continuous improvement in AI evaluation techniques.
Requirements
  • Have a minimum of of experience in roles emphasizing written precision and structured thinking, such as paralegal, executive assistant, junior analyst, librarian, document archival specialist, rese...

Interested in this role?

Click the button below to start your application.

Apply Now