volume_mute
What is a key advantage of reinforcement learning from human feedback (RLHF)?
publish date: 2025/09/27 06:57:33.283899 UTC
volume_mute
Correct Answer
It helps models align outputs with human preferences and judgements
Explanation
RLHF uses human input to guide AI behavior in complex scenarios. It supplements, but doesn't replace, model updates. RLHF helps systems adapt, not stay static. It still involves some form of labeling or feedback.
Reference
AWS Certified AI Practitioner (AIF-C01) Study Guide, Tom Taulli