Shenzhi Wang
shenzhi-wang
AI & ML interests
Large Language Model, Reinforcement Learning, and AI Agents
Recent Activity
upvoted
a
paper
about 6 hours ago
OPUS: Towards Efficient and Principled Data Selection in Large Language Model Pre-training in Every Iteration
upvoted
a
paper
about 8 hours ago
On the Entropy Dynamics in Reinforcement Fine-Tuning of Large Language Models
upvoted
a
paper
7 days ago
SWE-Universe: Scale Real-World Verifiable Environments to Millions