Lewis Tunstall's picture

In a Training Loop 🔄

Lewis Tunstall PRO

lewtun

huggingface

·

https://lewtun.github.io/blog/

AI & ML interests

LLMs, LLMs, LLMs

Recent Activity

updated a dataset 4 days ago

lewtun/ml-intern-sessions

updated a Space 5 days ago

lewtun/ml-intern-abc12345

published a Space 5 days ago

lewtun/ml-intern-abc12345

View all activity

Organizations

liked a Space 5 days ago

physics-intern: an Autonomous Agent for Physics Research

Generate autonomous research reports for physics problems

liked a model 11 days ago

Zyphra/ZAYA1-8B

9B • Updated 6 days ago • 145k • 517

liked a Space 12 days ago

The ultimate guide to RL environments: building and scaling them in the LLM era

Building and scaling RL environments for LLM training

liked a Space 13 days ago

MoE Recipe Builder

Tetris-style recipe builder for Qwen3-30B-A3B MoE training

liked 2 Spaces 16 days ago

Hutter Prize Dashboard

Dashboard for the Hutter Prize (100MB) collab

Efficient Optimizer Live

Dashboard for the Efficient Optimizer challenge

liked 2 models 19 days ago

poolside/Laguna-XS.2

Text Generation • 33B • Updated 9 days ago • 46.4k • 255

lewtun/talkie-1930-13b-it-hf

Text Generation • 13B • Updated 19 days ago • 7.16k • 23

liked a Space 19 days ago

Talkie 1930

Chat with a 1930s‑style language model

liked 2 models 19 days ago

talkie-lm/talkie-1930-13b-it

Updated 24 days ago • 270

talkie-lm/talkie-1930-13b-base

Updated 24 days ago • 86

liked a model 23 days ago

deepseek-ai/DeepSeek-V4-Pro

Text Generation • 862B • Updated 11 days ago • 3.14M • • 4k

liked a model 25 days ago

openai/privacy-filter

Token Classification • 1B • Updated 25 days ago • 248k • • 1.45k

liked a Space 27 days ago

Traces Viewer

Explore and visualize trace logs in an interactive web viewer

liked a Space 28 days ago

Defeating the trainer-generator precision mismatch in TRL

Download research PDF (Pro access required)

liked a model about 1 month ago

Qwen/Qwen3.6-35B-A3B

Image-Text-to-Text • 36B • Updated 24 days ago • 5.48M • • 1.79k

liked a dataset about 1 month ago

GenerTeam/pretrain_data_eukaryote

Viewer • Updated 6 days ago • 100 • 898 • 3

liked a Space about 1 month ago

Distilling 100B+ Models 40x Faster with TRL

TRL distillation for 100B+ teachers, 40x faster

liked a dataset about 2 months ago

arcinstitute/opengenome2

Preview • Updated Sep 20, 2025 • 8.5k • 140

liked a model about 2 months ago

arcee-ai/Trinity-Large-Thinking

Text Generation • 399B • Updated 3 days ago • 20.9k • • 169