dooho lee

BlueYellowGreen

https://leedooho.com

BlueYellowGreen

AI & ML interests

None yet

Recent Activity

upvoted a paper 4 days ago

Qwen3 Technical Report

upvoted an article 5 days ago

🚀 DTS: A Candidate for the Best Parallel Reasoning in LLMs

upvoted a paper 5 days ago

LLaDA2.1: Speeding Up Text Diffusion via Token Editing

View all activity

Organizations

None yet

upvoted a paper 4 days ago

Qwen3 Technical Report

Paper • 2505.09388 • Published May 14, 2025 • 332

upvoted an article 5 days ago

Article

🚀 DTS: A Candidate for the Best Parallel Reasoning in LLMs

5 days ago

•

upvoted 4 papers 5 days ago

LLaDA2.1: Speeding Up Text Diffusion via Token Editing

Paper • 2602.08676 • Published 7 days ago • 64

OPUS: Towards Efficient and Principled Data Selection in Large Language Model Pre-training in Every Iteration

Paper • 2602.05400 • Published 11 days ago • 314

When to Memorize and When to Stop: Gated Recurrent Memory for Long-Context Reasoning

Paper • 2602.10560 • Published 6 days ago • 27

Step 3.5 Flash: Open Frontier-Level Intelligence with 11B Active Parameters

Paper • 2602.10604 • Published 5 days ago • 174

upvoted 5 articles 5 days ago

Article

Performant local mixture-of-experts CPU inference with GPU acceleration in llama.cpp

18 days ago

•

Article

Fine-Tuning FunctionGemma on TPU to Create a Virtual Fitness Coach in 10 Minutes, $0.50

14 days ago

•

Article

CRAFT: Continuous Reasoning and Agentic Feedback Tuning

11 days ago

•

Article

From Golden Gate Bridge to Broken JSON: Why Anthropic's SAE Steering Fails for Structured Output

9 days ago

•

Article

Training Qwen3 VL to label bbox : synthetic data, environment and training analysis

7 days ago

•

upvoted 4 papers 6 days ago

upvoted 4 papers 14 days ago

Shaping capabilities with token-level data filtering

Paper • 2601.21571 • Published 18 days ago • 26

ConceptMoE: Adaptive Token-to-Concept Compression for Implicit Compute Allocation

Paper • 2601.21420 • Published 18 days ago • 42

Idea2Story: An Automated Pipeline for Transforming Research Concepts into Complete Scientific Narratives

Paper • 2601.20833 • Published 19 days ago • 176

Golden Goose: A Simple Trick to Synthesize Unlimited RLVR Tasks from Unverifiable Internet Text

Paper • 2601.22975 • Published 17 days ago • 99

upvoted a paper 17 days ago

Qwen3-ASR Technical Report

Paper • 2601.21337 • Published 18 days ago • 35

dooho lee

AI & ML interests

Recent Activity

Organizations

BlueYellowGreen's activity

🚀 DTS: A Candidate for the Best Parallel Reasoning in LLMs

Performant local mixture-of-experts CPU inference with GPU acceleration in llama.cpp

Fine-Tuning FunctionGemma on TPU to Create a Virtual Fitness Coach in 10 Minutes, $0.50

CRAFT: Continuous Reasoning and Agentic Feedback Tuning

From Golden Gate Bridge to Broken JSON: Why Anthropic's SAE Steering Fails for Structured Output

Training Qwen3 VL to label bbox : synthetic data, environment and training analysis