Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
XuQixin's picture
7 3

XuQixin

Racktic
aakashbilly's profile picture
·
  • Racktic

AI & ML interests

NLP, mutimodel

Recent Activity

updated a model 3 days ago
Sci-Agent/scientist_8b_all_4o_rag_truncation-step32
updated a model 3 days ago
Sci-Agent/scientist_8b_all_4o_rag_truncation-step28
updated a model 3 days ago
Sci-Agent/scientist_8b_all_4o_rag_truncation-step26
View all activity

Organizations

Science Agent RL Data's profile picture

upvoted 2 papers 3 months ago

BrowserAgent: Building Web Agents with Human-Inspired Web Browsing Actions

Paper • 2510.10666 • Published Oct 12, 2025 • 27

UniVideo: Unified Understanding, Generation, and Editing for Videos

Paper • 2510.08377 • Published Oct 9, 2025 • 71
upvoted 2 papers 4 months ago

Emergent Hierarchical Reasoning in LLMs through Reinforcement Learning

Paper • 2509.03646 • Published Sep 3, 2025 • 32

Reverse-Engineered Reasoning for Open-Ended Generation

Paper • 2509.06160 • Published Sep 7, 2025 • 150
upvoted 2 papers 7 months ago

RLPR: Extrapolating RLVR to General Domains without Verifiers

Paper • 2506.18254 • Published Jun 23, 2025 • 31

The Entropy Mechanism of Reinforcement Learning for Reasoning Language Models

Paper • 2505.22617 • Published May 28, 2025 • 131
upvoted a paper 11 months ago

Process Reinforcement through Implicit Rewards

Paper • 2502.01456 • Published Feb 3, 2025 • 61
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs