Post-Doctoral Research Visit F/M Benchmarks for Evaluating LLMs for Lean Elegance

INRIA • Paris, Île-de-France, France • Posted June 10, 2026

Location Paris, Île-de-France
Job Type CDD
Category Life Scientists
Posted June 10, 2026

Contexte et atouts du poste

The postdoc will be within the SIERRA team, which focuses on theoretical machine learning, statistics and optimization. There will be interactions with other teams within INRIA interested in AI for maths (ARGO, PICUBE, SCOOL) as well as at ENS (CSD).


Travel, equipment, and compute expenses will be covered within reasonable limits.

Mission confiée

Proposed research subject :


Frontier models have demonstrated rapid progress in producing correct Lean code, saturating existing Lean benchmarks of advanced problems in both the IMO and Putnam competitions, and can produce Fields Medal-level formalizations of research math. However, while Lean code that type checks might be reasonably declared correct, for formalizations to be useful to humans we need to extend our assessment beyond mere correctness to code quality, such as concision, transparency, maintainability, human readability, eleg...

Interested in this role?

Click the button below to start your application.

Apply Now