In a Training Loop 🔄

33 37 3

Yulei Qin

yolay

https://yuleichin.github.io/

AI & ML interests

Medical Imaging, Computer Vision, Language Models

Recent Activity

upvoted a paper 7 days ago

Youtu-LLM: Unlocking the Native Agentic Potential for Lightweight Large Language Models

updated a model 21 days ago

yolay/Youtu-Agent-RL-Maths-Qwen2.5-7B

updated a model 21 days ago

yolay/Youtu-Agent-RL-Search-Qwen2.5-7B

View all activity

Organizations

upvoted a paper 7 days ago

Youtu-LLM: Unlocking the Native Agentic Potential for Lightweight Large Language Models

Paper • 2512.24618 • Published Dec 31, 2025 • 147

upvoted 3 collections 26 days ago

upvoted a collection 29 days ago

Ai-general

Collection

50 items • Updated 8 days ago • 3

upvoted 6 papers about 1 month ago

Youtu-Agent: Scaling Agent Productivity with Automated Generation and Hybrid Policy Optimization

Paper • 2512.24615 • Published Dec 31, 2025 • 119

mHC: Manifold-Constrained Hyper-Connections

Paper • 2512.24880 • Published Dec 31, 2025 • 297

YOLO-Master: MOE-Accelerated with Specialized Transformers for Enhanced Real-time Detection

Paper • 2512.23273 • Published Dec 29, 2025 • 14

Nested Browser-Use Learning for Agentic Information Seeking

Paper • 2512.23647 • Published Dec 29, 2025 • 18

SmartSnap: Proactive Evidence Seeking for Self-Verifying Agents

Paper • 2512.22322 • Published Dec 26, 2025 • 39

ROCKET-1: Master Open-World Interaction with Visual-Temporal Context Prompting

Paper • 2410.17856 • Published Oct 23, 2024 • 52

upvoted 2 papers 2 months ago

SSA: Sparse Sparse Attention by Aligning Full and Sparse Attention Outputs in Feature Space

Paper • 2511.20102 • Published Nov 25, 2025 • 27

Agent0: Unleashing Self-Evolving Agents from Zero Data via Tool-Integrated Reasoning

Paper • 2511.16043 • Published Nov 20, 2025 • 109

upvoted 3 papers 3 months ago

Scalable Multi-Task Reinforcement Learning for Generalizable Spatial Intelligence in Visuomotor Agents

Paper • 2507.23698 • Published Jul 31, 2025 • 11

Agent Lightning: Train ANY AI Agents with Reinforcement Learning

Paper • 2508.03680 • Published Aug 5, 2025 • 133

LTD-Bench: Evaluating Large Language Models by Letting Them Draw

Paper • 2511.02347 • Published Nov 4, 2025 • 9

upvoted an article 3 months ago

Article

Aligning to What? Rethinking Agent Generalization in MiniMax M2

Oct 30, 2025

•

upvoted a paper 4 months ago

Training-Free Group Relative Policy Optimization

Paper • 2510.08191 • Published Oct 9, 2025 • 45

upvoted a collection 4 months ago

Reinforcement learning

Collection

103 items • Updated about 3 hours ago • 9

upvoted a paper 4 months ago

Learn the Ropes, Then Trust the Wins: Self-imitation with Progressive Exploration for Agentic Reinforcement Learning

Paper • 2509.22601 • Published Sep 26, 2025 • 30

Yulei Qin

AI & ML interests

Recent Activity

Organizations

yolay's activity

Aligning to What? Rethinking Agent Generalization in MiniMax M2