Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
dsadasd's picture
1 2 4

dsadasd

dqwdq
·

AI & ML interests

None yet

Recent Activity

upvoted a paper 29 days ago
Chaining the Evidence: Robust Reinforcement Learning for Deep Search Agents with Citation-Aware Rubric Rewards
liked a model about 2 months ago
zai-org/GLM-4.7
upvoted a paper 2 months ago
Stabilizing Reinforcement Learning with LLMs: Formulation and Practices
View all activity

Organizations

None yet

upvoted a paper 29 days ago

Chaining the Evidence: Robust Reinforcement Learning for Deep Search Agents with Citation-Aware Rubric Rewards

Paper • 2601.06021 • Published Jan 9 • 45
liked a model about 2 months ago

zai-org/GLM-4.7

Text Generation • 358B • Updated 12 days ago • 114k • • 1.91k
upvoted a paper 2 months ago

Stabilizing Reinforcement Learning with LLMs: Formulation and Practices

Paper • 2512.01374 • Published Dec 1, 2025 • 104
liked a dataset 4 months ago

Slyracoon23/arc-agi-sft-dataset-augmented

Viewer • Updated Jul 3, 2025 • 40k • 10 • 1
New activity in inclusionAI/Ring-1T 4 months ago

###

#9 opened 4 months ago by
dqwdq
liked 2 models 7 months ago

zai-org/GLM-4.5-Air

Text Generation • 110B • Updated Aug 11, 2025 • 154k • • 579

zai-org/GLM-4.5

Text Generation • 358B • Updated Aug 11, 2025 • 34k • • 1.4k
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs