39 203 50

KABI

dongguanting

https://dongguanting.github.io/

AI & ML interests

Reasoning and Alignment for Large Language Models

Recent Activity

upvoted a paper about 18 hours ago

DeepResearchEval: An Automated Framework for Deep Research Task Construction and Agentic Evaluation

upvoted a paper about 18 hours ago

STEP3-VL-10B Technical Report

upvoted a paper about 18 hours ago

Toward Efficient Agents: Memory, Tool learning, and Planning

View all activity

Organizations

upvoted 3 papers about 18 hours ago

DeepResearchEval: An Automated Framework for Deep Research Task Construction and Agentic Evaluation

Paper • 2601.09688 • Published 7 days ago • 119

STEP3-VL-10B Technical Report

Paper • 2601.09668 • Published 7 days ago • 179

Toward Efficient Agents: Memory, Tool learning, and Planning

Paper • 2601.14192 • Published 1 day ago • 29

upvoted a paper about 19 hours ago

Agentic-R: Learning to Retrieve for Agentic Search

Paper • 2601.11888 • Published 5 days ago • 11

liked a model 6 days ago

MurrayTom/TS-Guard

8B • Updated 7 days ago • 23 • 7

upvoted a paper 6 days ago

ToolSafe: Enhancing Tool Invocation Safety of LLM-based agents via Proactive Step-level Guardrail and Feedback

Paper • 2601.10156 • Published 7 days ago • 22

authored a paper 9 days ago

ET-Agent: Incentivizing Effective Tool-Integrated Reasoning Agent via Behavior Calibration

Paper • 2601.06860 • Published 11 days ago • 15

upvoted a paper 9 days ago

ET-Agent: Incentivizing Effective Tool-Integrated Reasoning Agent via Behavior Calibration

Paper • 2601.06860 • Published 11 days ago • 15

authored a paper 10 days ago

EnvScaler: Scaling Tool-Interactive Environments for LLM Agent via Programmatic Synthesis

Paper • 2601.05808 • Published 13 days ago • 35

upvoted a paper 10 days ago

EnvScaler: Scaling Tool-Interactive Environments for LLM Agent via Programmatic Synthesis

Paper • 2601.05808 • Published 13 days ago • 35

liked a dataset 13 days ago

XXHStudyHard/EnvScaler-SFT-Traj-9K

Viewer • Updated 7 days ago • 9.02k • 88 • 5

upvoted 2 papers 14 days ago

Entropy-Adaptive Fine-Tuning: Resolving Confident Conflicts to Mitigate Forgetting

Paper • 2601.02151 • Published 17 days ago • 99

ROI-Reasoning: Rational Optimization for Inference via Pre-Computation Meta-Cognition

Paper • 2601.03822 • Published 15 days ago • 22

liked a model 25 days ago

dongguanting/QwQ-32B-AEPO-DeepSearch

Text Generation • 33B • Updated Dec 20, 2025 • 4 • 1

upvoted a paper 26 days ago

ProcessBench: Identifying Process Errors in Mathematical Reasoning

Paper • 2412.06559 • Published Dec 9, 2024 • 86

upvoted a paper about 1 month ago

Seed-Prover 1.5: Mastering Undergraduate-Level Theorem Proving via Learning from Experience

Paper • 2512.17260 • Published Dec 19, 2025 • 49

liked a model about 1 month ago

dongguanting/Qwen3-8B-AEPO-DeepSearch

Text Generation • 8B • Updated Dec 20, 2025 • 13 • 2

updated 2 models about 1 month ago

dongguanting/Qwen3-8B-AEPO-DeepSearch

Text Generation • 8B • Updated Dec 20, 2025 • 13 • 2

dongguanting/QwQ-32B-AEPO-DeepSearch

Text Generation • 33B • Updated Dec 20, 2025 • 4 • 1

updated a collection about 1 month ago

AEPO

Collection

The official datasets and model checkpoints of AEPO • 5 items • Updated Dec 20, 2025 • 4

KABI

AI & ML interests

Recent Activity

Organizations

dongguanting's activity