AI Alignment Engineer: RLHF & Reward Modeling

Odixcity Consulting • Remote, Remote, South-Africa • Posted June 17, 2026

Location Remote, Remote

Job Type Full-time

Category IT & Technology

Posted June 17, 2026

                Odixcity Consulting is hiring an RLHF Specialist to enhance and align AI models using reinforcement learning methodologies. This role involves designing feedback pipelines, generating high-quality preference data, and collaborating with machine learning engineers. Candidates should have at least 2 years of experience in relevant fields, strong Python skills, and familiarity with deep learning frameworks. The position is remote, allowing for global collaboration on cutting-edge AI technologies.
#J-18808-Ljbffr
            

Interested in this role?

Click the button below to start your application.

Apply Now