Biapy Web Directory
Tag cloud
Picture wall
Daily
Search
RSS Feed
RSS Feed
Daily Feed
Weekly Feed
Monthly Feed
Filters
Links per page
20 links
50 links
100 links
Custom value
Filters
Untagged links
Type 1 or more characters for results.
tags
search
1 result tagged
rlhf
✕
(WIP) A Little Bit of Reinforcement Learning from Human Feedback
https://rlhfbook.com/
Mon Feb 3 14:07:29 2025
email
A short introduction to RLHF and post-training focused on language models.
9027 links