-
zhouxiangxin/Variational-Reasoning-32B-Acc
Text Generation • 33B • Updated • 2 -
zhouxiangxin/Variational-Reasoning-32B-GML
Text Generation • 33B • Updated • 5 -
zhouxiangxin/Variational-Reasoning-8B-Acc
Text Generation • 8B • Updated • 3 -
zhouxiangxin/Variational-Reasoning-8B-GML
Text Generation • 8B • Updated • 2
Xiangxin Zhou
zhouxiangxin
AI & ML interests
None yet
Recent Activity
authored
a paper
24 days ago
Rethinking the Trust Region in LLM Reinforcement Learning upvoted a paper 24 days ago
Rethinking the Trust Region in LLM Reinforcement Learning liked
a model 3 months ago
GSAI-ML/LLaDA-8B-Base