Reasoning models trained on synthetic data using reinforcement learning.
Yichao 'Peak' Ji
peakji
AI & ML interests
Agents, Small Language Models, Retrieval-Augmented Generation, Information Extraction
Recent Activity
liked
a model 3 days ago
Qwen/Qwen3.5-35B-A3B-Base liked
a model 8 days ago
Qwen/Qwen3.5-397B-A17B liked
a model about 1 month ago
mistralai/Ministral-3-14B-Base-2512