volume_mute

What is a key advantage of reinforcement learning from human feedback (RLHF)?

publish date2025/09/27 06:57:33.283899 UTC

volume_mute

Correct Answer

It helps models align outputs with human preferences and judgements

Explanation

RLHF uses human input to guide AI behavior in complex scenarios.  It  supplements, but doesn't replace, model updates.  RLHF helps systems adapt, not stay static.  It still involves some form of labeling or feedback.

Reference

AWS Certified AI Practitioner (AIF-C01) Study Guide, Tom Taulli


Quizzes you can take where this question appears