Hacker Newsnew | past | comments | ask | show | jobs | submit | fromlogin
Reinforcement Learning from Human Feedback (rlhfbook.com)
133 points by onurkanbkrc 23 days ago | past | 5 comments
RLHF Book (rlhfbook.com)
479 points by jxmorris12 on Feb 1, 2025 | past | 37 comments

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: