vikas (Vikas Kumar)

upvoted a collection 8 months ago

EmbeddingGemma

Collection

3 items • Updated Mar 12 • 118

upvoted 2 articles 10 months ago

Article

SmolLM3: smol, multilingual, long-context reasoner

+21

eliebak, cmpatino, anton-l, edbeeching, m-ric, nouamanetazi, akseljoonas, guipenedo, hynky, clefourrier, SaylorTwift, kashif, qgallouedec, hlarcher, glutamatt, Xenova, reach-vb, ngxson, craffel, lewtun, loubnabnl, lvwerra, thomwolf

•

Jul 8, 2025

• 773

Article

Training and Finetuning Sparse Embedding Models with Sentence Transformers

tomaarsen, arthurbresnu

•

Jul 1, 2025

• 138

upvoted a collection 12 months ago

Deepseek Papers

Collection

Deepseek papers collection • 31 items • Updated about 18 hours ago • 341

upvoted an article 12 months ago

Article

The Transformers Library: standardizing model definitions

+2

lysandre, ArthurZ, pcuenq, julien-c

•

May 15, 2025

• 121

upvoted a paper about 1 year ago

Judging LLM-as-a-judge with MT-Bench and Chatbot Arena

Paper • 2306.05685 • Published Jun 9, 2023 • 43

upvoted an article about 1 year ago

Article

1 Billion Classifications

derek-thomas

•

Feb 13, 2025

• 45

upvoted 2 articles over 1 year ago

Article

Train 400x faster Static Embedding Models with Sentence Transformers

tomaarsen

•

Jan 15, 2025

• 229

Article

The 5 Most Under-Rated Tools on Hugging Face

derek-thomas

•

Aug 22, 2024

• 93

upvoted 2 papers almost 2 years ago

Transformer Explainer: Interactive Learning of Text-Generative Models

Paper • 2408.04619 • Published Aug 8, 2024 • 175

Magpie: Alignment Data Synthesis from Scratch by Prompting Aligned LLMs with Nothing

Paper • 2406.08464 • Published Jun 12, 2024 • 72

upvoted an article almost 2 years ago

Article

Finetuning PaliGemma with AutoTrain

abhishek

•

Jul 25, 2024

• 13

upvoted a collection almost 2 years ago

Gemma 2 2B Release

Collection

The 2.6B parameter version of Gemma 2. • 6 items • Updated Mar 12 • 84

upvoted 2 articles almost 2 years ago

Article

Fine-tune Llama 3.1 Ultra-Efficiently with Unsloth

mlabonne

•

Jul 29, 2024

• 371

Article

Llama 3.1 - 405B, 70B & 8B with multilinguality and long context

+6

philschmid, osanseviero, alvarobartt, lvwerra, dvilasuero, reach-vb, marcsun13, pcuenq

•

Jul 23, 2024

• 241

upvoted a collection almost 2 years ago

🪐 SmolLM

Collection

A series of smol LLMs: 135M, 360M and 1.7B. We release base and Instruct models as well as the training corpus and some WebGPU demos • 12 items • Updated May 5, 2025 • 249

upvoted 3 articles almost 2 years ago

Article

SmolLM - blazingly fast and remarkably powerful

+1

loubnabnl, anton-l, eliebak

•

Jul 16, 2024

• 455

Article

The Rise of Agentic Data Generation

mlabonne

•

Jul 15, 2024

• 89

Article

ColPali: Efficient Document Retrieval with Vision Language Models 👀

manu

•

Jul 5, 2024

• 317

upvoted a collection almost 2 years ago

Florence

Collection

5 items • Updated Mar 2 • 174

Vikas Kumar

AI & ML interests