Human-in-the-loop learning (HILL), Reinforcement Learning with Human Feedback (RLHF), Reinforcement learning (RL)