Zhangchen Xu's picture

Zhangchen Xu PRO

zhangchenxu

·

https://zhangchenxu.com/

AI & ML interests

LLM Data, Alignment, Post-Training, Safety

Recent Activity

liked a model 2 days ago

miromind-ai/MiroThinker-v1.5-30B

liked a model 2 days ago

miromind-ai/MiroThinker-v1.5-235B

liked a model 25 days ago

nvidia/NVIDIA-Nemotron-3-Nano-30B-A3B-BF16

View all activity

Organizations

upvoted 2 papers 3 months ago

Efficient Long-context Language Model Training by Core Attention Disaggregation

Paper • 2510.18121 • Published Oct 20, 2025 • 122

Building a Foundational Guardrail for General Agentic Systems via Synthetic Data

Paper • 2510.09781 • Published Oct 10, 2025 • 26

upvoted an article 3 months ago

Article

BigCodeArena: Judging code generations end to end with code executions

Oct 7, 2025

•

19

upvoted 3 papers 3 months ago

CoDA: Agentic Systems for Collaborative Data Visualization

Paper • 2510.03194 • Published Oct 3, 2025 • 28

TOUCAN: Synthesizing 1.5M Tool-Agentic Data from Real-World MCP Environments

Paper • 2510.01179 • Published Oct 1, 2025 • 25

MCPMark: A Benchmark for Stress-Testing Realistic and Comprehensive MCP Use

Paper • 2509.24002 • Published Sep 28, 2025 • 174

upvoted an article 6 months ago

Article

SmolLM3: smol, multilingual, long-context reasoner

+21

Jul 8, 2025

•

746

upvoted a paper 7 months ago

Magistral

Paper • 2506.10910 • Published Jun 12, 2025 • 66

upvoted a collection 7 months ago

TinyV

8 items • Updated Jun 22, 2025 • 1

upvoted a paper 7 months ago

VisualSphinx: Large-Scale Synthetic Vision Logic Puzzles for RL

Paper • 2505.23977 • Published May 29, 2025 • 10

upvoted 2 papers 8 months ago

Personalized Safety in LLMs: A Benchmark and A Planning-Based Agent Approach

Paper • 2505.18882 • Published May 24, 2025 • 14

TinyV: Reducing False Negatives in Verification Improves RL for LLM Reasoning

Paper • 2505.14625 • Published May 20, 2025 • 13

upvoted an article 10 months ago

Article

Open R1: Update #3

Mar 11, 2025

•

296

upvoted a paper 10 months ago

KodCode: A Diverse, Challenging, and Verifiable Synthetic Dataset for Coding

Paper • 2503.02951 • Published Mar 4, 2025 • 33

upvoted 3 collections 11 months ago

KodCode-V1

KodCode-V1 is the largest fully-synthetic open-source dataset providing verifiable solutions and tests for coding tasks. • 6 items • Updated Apr 2, 2025 • 5

Small Model Learnability Gap: Models

24 items • Updated Feb 24, 2025 • 2

Small Model Learnability Gap: Dataset

6 items • Updated Feb 21, 2025 • 3

upvoted a paper 11 months ago

Small Models Struggle to Learn from Strong Reasoners

Paper • 2502.12143 • Published Feb 17, 2025 • 39

upvoted a collection about 1 year ago

Magpie Reasoning Datasets

Reasoning datasets built by Magpie and its friends! • 8 items • Updated Jan 27, 2025 • 11

upvoted an article about 1 year ago

Article

Fine-tune a SmolLM on domain-specific synthetic data from a LLM

Jan 3, 2025

•

37