Reinforcement Learning from Human Feedback (RLHF) in Notebooks from Hacker News on 2025-07-06 14:23 (#6YEZY) Comments