Senior Software Engineer for LLM Evaluation
Location
remote, remote
Job Type
Full-time
Category
technology
Posted
June 05, 2026
Role Overview
As a Software Engineering evaluator, you will play a crucial role in creating advanced datasets for training, benchmarking, and enhancing large language models. This position involves collaborating closely with researchers to curate code examples, provide precise solutions, and refine AI-generated code across various programming languages, ensuring the development of reliable and efficient AI-driven coding solutions.
- Curate code examples, build solutions, and correct code in Python, JavaScript (including ReactJS), C/C++, Java, Rust, and Go for AI model training initiatives.
- Evaluate and refine AI-generated code to ensure efficiency, scalability, and reliability.
- Collaborate with cross-functional teams to enhance AI-driven coding solutions against industry performance benchmarks.
- Develop agents to verify code quality and identify error patterns.
- Hypothesize on soft...