XuQixin's picture

7 3

XuQixin

Racktic

·

Racktic

AI & ML interests

NLP, mutimodel

Recent Activity

updated a model 3 days ago

Sci-Agent/scientist_8b_all_4o_rag_truncation-step32

updated a model 3 days ago

Sci-Agent/scientist_8b_all_4o_rag_truncation-step28

updated a model 3 days ago

Sci-Agent/scientist_8b_all_4o_rag_truncation-step26

View all activity

Organizations

upvoted 2 papers 3 months ago

BrowserAgent: Building Web Agents with Human-Inspired Web Browsing Actions

Paper • 2510.10666 • Published Oct 12, 2025 • 27

UniVideo: Unified Understanding, Generation, and Editing for Videos

Paper • 2510.08377 • Published Oct 9, 2025 • 71

upvoted 2 papers 4 months ago

Emergent Hierarchical Reasoning in LLMs through Reinforcement Learning

Paper • 2509.03646 • Published Sep 3, 2025 • 32

Reverse-Engineered Reasoning for Open-Ended Generation

Paper • 2509.06160 • Published Sep 7, 2025 • 150

upvoted 2 papers 7 months ago

RLPR: Extrapolating RLVR to General Domains without Verifiers

Paper • 2506.18254 • Published Jun 23, 2025 • 31

The Entropy Mechanism of Reinforcement Learning for Reasoning Language Models

Paper • 2505.22617 • Published May 28, 2025 • 131

upvoted a paper 11 months ago

Process Reinforcement through Implicit Rewards

Paper • 2502.01456 • Published Feb 3, 2025 • 61