Datasets, and model checkpoints of our Group Relative Reward Model (GRRM) framework
-
GRRM: Group Relative Reward Modeling for Machine Translation
Paper • 2602.14028 • Published -
double7/Qwen2.5-7B-GRRM
Text Generation • 8B • Updated • 22 -
double7/Qwen2.5-7B-MT-GRRM-Optimized
Text Generation • 8B • Updated • 7 -
double7/Qwen2.5-7B-MT-GRRM-Optimized-CLA
Text Generation • 8B • Updated • 10