Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
13
15
6
Rémi Ouazan Reboul
ror
Follow
webxos's profile picture
aldakata's profile picture
Bajju360's profile picture
132 followers
·
178 following
remi-or
remiouazan
AI & ML interests
None yet
Recent Activity
reacted
to
qgallouedec
's
post
with 🔥
1 day ago
TRL v1.2 introduces the SSDTrainer 🚀 Simple Self-Distillation (SSD) from Apple's paper "Embarrassingly Simple Self-Distillation Improves Code Generation" is now available as an experimental trainer in TRL. The recipe is as minimal as the name suggests: sample completions from the model itself at a training-time temperature, then fine-tune on those raw, unverified samples with plain cross-entropy. No reward model. No verifier. No teacher model. No reinforcement learning. Just prompts and the model. ```python from trl.experimental.ssd import SSDConfig, SSDTrainer trainer = SSDTrainer( model="Qwen/Qwen3-4B-Instruct", args=SSDConfig(temperature=0.6, top_k=20, top_p=0.95), train_dataset=dataset, ) trainer.train() ``` v1.2 also ships expanded tool-calling support (LLaMA 3.1 / 3.2, DeepSeek-V3), another round of KTO ↔ DPO alignment getting us closer to promoting KTO to stable, a big GRPO simplification for overlong tool results, deprecation of `use_transformers_paged`, and key fixes for VLM response parsing. Full release notes: https://github.com/huggingface/trl/releases/tag/v1.2.0
liked
a Space
2 months ago
huggingface/eleusis-benchmark
upvoted
an
article
3 months ago
Architectural Choices in China's Open-Source AI Ecosystem: Building Beyond DeepSeek
View all activity
Organizations
ror
's activity
All
Models
Datasets
Spaces
Buckets
Papers
Collections
Community
Posts
Upvotes
Likes
Articles
published
an
article
5 months ago
view article
Article
Continuous batching from first principles
+1
Nov 25, 2025
•
360
published
an
article
9 months ago
view article
Article
Creating custom kernels for the AMD MI300
Jul 9, 2025
•
54
published
an
article
10 months ago
view article
Article
Creating custom kernels for the AMD MI300
Jul 9, 2025
•
54