Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Website
Tasks
HuggingChat
Collections
Languages
Organizations
Community
Blog
Posts
Daily Papers
Learn
Discord
Forum
GitHub
Solutions
Team & Enterprise
Hugging Face PRO
Enterprise Support
Inference Providers
Inference Endpoints
Storage Buckets
Log In
Sign Up
NiuTrans
/
robust_visual_reward_model
like
5
Follow
NiuTrans
43
Safetensors
vision
DPO
RLHF
preference
feedback
reward model
preference model
arxiv:
2408.12109
License:
mit
Model card
Files
Files and versions
xet
Community
main
robust_visual_reward_model
/
figure
506 kB
Ctrl+K
Ctrl+K
2 contributors
History:
1 commit
gan-yang-zuzhu
upload models.
8810cfa
over 1 year ago
main_image.png
Safe
506 kB
upload models.
over 1 year ago