Avi Basa's picture

217

Avi Basa

avahal

·

AI & ML interests

None yet

Recent Activity

commented on a paper 1 day ago

Evolving Deeper LLM Thinking

commented on a paper 1 day ago

SFT Memorizes, RL Generalizes: A Comparative Study of Foundation Model Post-training

commented on a paper 1 day ago

Kimi k1.5: Scaling Reinforcement Learning with LLMs

View all activity

Organizations

None yet

commented 20 papers 1 day ago

Evolving Deeper LLM Thinking

Paper • 2501.09891 • Published Jan 17 • 115 •

SFT Memorizes, RL Generalizes: A Comparative Study of Foundation Model Post-training

Paper • 2501.17161 • Published Jan 28 • 123 •

Kimi k1.5: Scaling Reinforcement Learning with LLMs

Paper • 2501.12599 • Published Jan 22 • 127 •

rStar-Math: Small LLMs Can Master Math Reasoning with Self-Evolved Deep Thinking

Paper • 2501.04519 • Published Jan 8 • 286 •

MiniMax-01: Scaling Foundation Models with Lightning Attention

Paper • 2501.08313 • Published Jan 14 • 301 •

DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning

Paper • 2501.12948 • Published Jan 22 • 430 •

LLM-Microscope: Uncovering the Hidden Role of Punctuation in Context Memory of Transformers

Paper • 2502.15007 • Published Feb 20 • 174 •

The Stochastic Parrot on LLM's Shoulder: A Summative Assessment of Physical Concept Understanding

Paper • 2502.08946 • Published Feb 13 • 191 •

MLGym: A New Framework and Benchmark for Advancing AI Research Agents

Paper • 2502.14499 • Published Feb 20 • 193 •

Qwen2.5-VL Technical Report

Paper • 2502.13923 • Published Feb 19 • 212 •

OmniHuman-1: Rethinking the Scaling-Up of One-Stage Conditioned Human Animation Models

Paper • 2502.01061 • Published Feb 3 • 222 •

SmolLM2: When Smol Goes Big -- Data-Centric Training of a Small Language Model

Paper • 2502.02737 • Published Feb 4 • 252 •

DAPO: An Open-Source LLM Reinforcement Learning System at Scale

Paper • 2503.14476 • Published Mar 18 • 144 •

ReCamMaster: Camera-Controlled Generative Rendering from A Single Video

Paper • 2503.11647 • Published Mar 14 • 145 •

RWKV-7 "Goose" with Expressive Dynamic State Evolution

Paper • 2503.14456 • Published Mar 18 • 153 •

Qwen2.5-Omni Technical Report

Paper • 2503.20215 • Published Mar 26 • 168 •

Transformers without Normalization

Paper • 2503.10622 • Published Mar 13 • 171 •

Feature-Level Insights into Artificial Text Detection with Sparse Autoencoders

Paper • 2503.03601 • Published Mar 5 • 232 •

OmniSVG: A Unified Scalable Vector Graphics Generation Model

Paper • 2504.06263 • Published Apr 8 • 182 •

SmolVLM: Redefining small and efficient multimodal models

Paper • 2504.05299 • Published Apr 7 • 202 •