6 1181 69

wongyukim

kimwongyuda

AI & ML interests

None yet

Recent Activity

upvoted a paper 6 days ago

PhyCritic: Multimodal Critic Models for Physical AI

upvoted a paper 6 days ago

Step 3.5 Flash: Open Frontier-Level Intelligence with 11B Active Parameters

upvoted a paper 6 days ago

Collaborative Multi-Agent Test-Time Reinforcement Learning for Reasoning

View all activity

Organizations

None yet

upvoted 5 papers 6 days ago

Chaining the Evidence: Robust Reinforcement Learning for Deep Search Agents with Citation-Aware Rubric Rewards

Paper • 2601.06021 • Published Jan 9 • 47

upvoted 7 papers 7 days ago

DLLM-Searcher: Adapting Diffusion Large Language Model for Search Agents

Paper • 2602.07035 • Published 16 days ago • 30

TodoEvolve: Learning to Architect Agent Planning Systems

Paper • 2602.07839 • Published 11 days ago • 6

iGRPO: Self-Feedback-Driven LLM Reasoning

Paper • 2602.09000 • Published 10 days ago • 15

VideoWorld 2: Learning Transferable Knowledge from Real-world Videos

Paper • 2602.10102 • Published 9 days ago • 14

VLA-JEPA: Enhancing Vision-Language-Action Model with Latent World Model

Paper • 2602.10098 • Published 9 days ago • 17

Dr. MAS: Stable Reinforcement Learning for Multi-Agent LLM Systems

Paper • 2602.08847 • Published 10 days ago • 24

SkillRL: Evolving Agents via Recursive Skill-Augmented Reinforcement Learning

Paper • 2602.08234 • Published 10 days ago • 65

upvoted 7 papers 8 days ago

GISA: A Benchmark for General Information-Seeking Assistant

Paper • 2602.08543 • Published 10 days ago • 26

InternAgent-1.5: A Unified Agentic Framework for Long-Horizon Autonomous Scientific Discovery

Paper • 2602.08990 • Published 10 days ago • 69

QuantaAlpha: An Evolutionary Framework for LLM-Driven Alpha Mining

Paper • 2602.07085 • Published 13 days ago • 181

Weak-Driven Learning: How Weak Agents make Strong Agents Stronger

Paper • 2602.08222 • Published 10 days ago • 256

Recurrent-Depth VLA: Implicit Test-Time Compute Scaling of Vision-Language-Action Models via Latent Iterative Reasoning

Paper • 2602.07845 • Published 11 days ago • 68

AIRS-Bench: a Suite of Tasks for Frontier AI Research Science Agents

Paper • 2602.06855 • Published 13 days ago • 70

LLaDA2.1: Speeding Up Text Diffusion via Token Editing

Paper • 2602.08676 • Published 10 days ago • 66

upvoted a paper 9 days ago

F-GRPO: Don't Let Your Policy Learn the Obvious and Forget the Rare

Paper • 2602.06717 • Published 13 days ago • 71

wongyukim

AI & ML interests

Recent Activity

Organizations

wongyukim's activity