AI Alignment Engineer: RLHF & Reward Modeling
Location
Remote, Remote
Job Type
Full-time
Category
IT & Technology
Posted
June 17, 2026
Odixcity Consulting is hiring an RLHF Specialist to enhance and align AI models using reinforcement learning methodologies. This role involves designing feedback pipelines, generating high-quality preference data, and collaborating with machine learning engineers. Candidates should have at least 2 years of experience in relevant fields, strong Python skills, and familiarity with deep learning frameworks. The position is remote, allowing for global collaboration on cutting-edge AI technologies.
#J-18808-Ljbffr
#J-18808-Ljbffr