dsadasd's picture

1 2 4

dsadasd

dqwdq

·

AI & ML interests

None yet

Recent Activity

upvoted a paper 29 days ago

Chaining the Evidence: Robust Reinforcement Learning for Deep Search Agents with Citation-Aware Rubric Rewards

liked a model about 2 months ago

zai-org/GLM-4.7

upvoted a paper 2 months ago

Stabilizing Reinforcement Learning with LLMs: Formulation and Practices

View all activity

Organizations

None yet

upvoted a paper 29 days ago

Chaining the Evidence: Robust Reinforcement Learning for Deep Search Agents with Citation-Aware Rubric Rewards

Paper • 2601.06021 • Published Jan 9 • 45

liked a model about 2 months ago

zai-org/GLM-4.7

Text Generation • 358B • Updated 12 days ago • 114k • • 1.91k

upvoted a paper 2 months ago

Stabilizing Reinforcement Learning with LLMs: Formulation and Practices

Paper • 2512.01374 • Published Dec 1, 2025 • 104

liked a dataset 4 months ago

Slyracoon23/arc-agi-sft-dataset-augmented

Viewer • Updated Jul 3, 2025 • 40k • 10 • 1

New activity in inclusionAI/Ring-1T 4 months ago

###

#9 opened 4 months ago by

liked 2 models 7 months ago

zai-org/GLM-4.5-Air

Text Generation • 110B • Updated Aug 11, 2025 • 154k • • 579

zai-org/GLM-4.5

Text Generation • 358B • Updated Aug 11, 2025 • 34k • • 1.4k