https://www.lesswrong.com/posts/HLJoJYi52mxgomujc/realistic-reward-hacking-induces-different-and-deeper-1
Sharan Maiya
maius
AI & ML interests
None yet
Recent Activity
updated
a model
5 days ago
maius/llama-3.3-70b-it-sarcasm-dpo-resample
published
a model
5 days ago
maius/llama-3.3-70b-it-sarcasm-dpo-resample
updated
a model
5 days ago
maius/llama-3.3-70b-it-sarcasm-dpo-reword