arxiv:2511.13524
Xiaoji Zheng
Student-Xiaoji
AI & ML interests
None yet
Recent Activity
liked
a model about 4 hours ago
nvidia/GR00T-N1.6-3B upvoted a paper about 12 hours ago
CLIPO: Contrastive Learning in Policy Optimization Generalizes RLVR upvoted a paper about 12 hours ago
OpenClaw-RL: Train Any Agent Simply by Talking Organizations
None yet