6 19 3

Tim Dingman

tdingman-scale

AI & ML interests

None yet

Recent Activity

upvoted a paper 7 days ago

RLAnything: Forge Environment, Policy, and Reward Model in Completely Dynamic RL System

commentedon a paper 7 days ago

RLAnything: Forge Environment, Policy, and Reward Model in Completely Dynamic RL System

commentedon a paper 9 days ago

OpenClaw-RL: Train Any Agent Simply by Talking

View all activity

Organizations

upvoted a paper 7 days ago

RLAnything: Forge Environment, Policy, and Reward Model in Completely Dynamic RL System

Paper • 2602.02488 • Published Feb 2 • 36

commented a paper 7 days ago

RLAnything: Forge Environment, Policy, and Reward Model in Completely Dynamic RL System

Paper • 2602.02488 • Published Feb 2 • 36 •

commented a paper 9 days ago

OpenClaw-RL: Train Any Agent Simply by Talking

Paper • 2603.10165 • Published 17 days ago • 143 •

upvoted a paper 15 days ago

OpenClaw-RL: Train Any Agent Simply by Talking

Paper • 2603.10165 • Published 17 days ago • 143

upvoted a paper about 2 months ago

On the Entropy Dynamics in Reinforcement Fine-Tuning of Large Language Models

Paper • 2602.03392 • Published Feb 3 • 57

commented a paper 8 months ago

Audio Flamingo 3: Advancing Audio Intelligence with Fully Open Large Audio Language Models

Paper • 2507.08128 • Published Jul 10, 2025 • 13 •

upvoted a paper 8 months ago

Audio Flamingo 3: Advancing Audio Intelligence with Fully Open Large Audio Language Models

Paper • 2507.08128 • Published Jul 10, 2025 • 13

upvoted 13 papers 9 months ago

LongWriter-Zero: Mastering Ultra-Long Text Generation via Reinforcement Learning

Paper • 2506.18841 • Published Jun 23, 2025 • 56

SWE-Factory: Your Automated Factory for Issue Resolution Training Data and Evaluation Benchmarks

Paper • 2506.10954 • Published Jun 12, 2025 • 54

Magistral

Paper • 2506.10910 • Published Jun 12, 2025 • 67

REASONING GYM: Reasoning Environments for Reinforcement Learning with Verifiable Rewards

Paper • 2505.24760 • Published May 30, 2025 • 74

MiMo-VL Technical Report

Paper • 2506.03569 • Published Jun 4, 2025 • 80

ProRL: Prolonged Reinforcement Learning Expands Reasoning Boundaries in Large Language Models

Paper • 2505.24864 • Published May 30, 2025 • 146

Beyond the 80/20 Rule: High-Entropy Minority Tokens Drive Effective Reinforcement Learning for LLM Reasoning

Paper • 2506.01939 • Published Jun 2, 2025 • 190

Reflect, Retry, Reward: Self-Improving LLMs via Reinforcement Learning

Paper • 2505.24726 • Published May 30, 2025 • 279

Reinforcement Pre-Training

Paper • 2506.08007 • Published Jun 9, 2025 • 265

MiniMax-M1: Scaling Test-Time Compute Efficiently with Lightning Attention

Paper • 2506.13585 • Published Jun 16, 2025 • 274

Tim Dingman

AI & ML interests

Recent Activity

Organizations

tdingman-scale's activity