Researcher - Reinforcement Learning

Huawei Technologies Canada Co., Ltd. • Edmonton, Alberta, Canada • Posted March 23, 2026

Location Edmonton, Alberta

Job Type contract

Category Computer Occupations

Posted March 23, 2026

Job description
Huawei Canada has an immediate 12-month contract opening for a Reinforcement Learning Researcher.

About the team:
Founded in 2012, the Noah’s Ark lab has evolved into a prominent research organization with notable achievements in academia and industry. The lab’s mission focuses on advancing artificial intelligence and related fields to benefit the company and society. Driven by impactful, long-term projects, the aim is to enhance state-of-the-art research while integrating innovations into the company's products and services, including LLMs, RL, NLP, computer vision, AI theory, and Autonomous driving.
About the job:
Enabling Large Language Models (LLMs) to learn from experience, interaction, and environment feedback, moving beyond static fine-tuning toward continual, agentic self-improvement.
LLM post-training paradigms (e.g., RLHF, GRPO, reward-free methods, etc.).
<...
            

Interested in this role?

Click the button below to start your application.

Apply Now