arxiv:2503.04625
ChengpengLi
ChengpengLi
AI & ML interests
LLM for Reasoning, reinforcement learning, recommendation system, diffusion models
Recent Activity
upvoted
a
paper
about 13 hours ago
Video Reality Test: Can AI-Generated ASMR Videos fool VLMs and Humans?
upvoted
a
paper
2 months ago
Agentic Entropy-Balanced Policy Optimization
upvoted
a
paper
3 months ago
Quantile Advantage Estimation for Entropy-Safe Reasoning
Organizations
None yet