5 7 39

Nik PRO

Malvinan

AI & ML interests

None yet

Recent Activity

upvoted a paper 8 days ago

Let ViT Speak: Generative Language-Image Pre-training

upvoted an article 30 days ago

NEO-unify: Building Native Multimodal Unified Models End to End

liked a Space about 2 months ago

HuggingFaceM4/FineVision

View all activity

Organizations

upvoted a paper 8 days ago

Let ViT Speak: Generative Language-Image Pre-training

Paper • 2605.00809 • Published 14 days ago • 32

upvoted an article 30 days ago

Article

NEO-unify: Building Native Multimodal Unified Models End to End

sensenova

•

Mar 5

• 161

liked a Space about 2 months ago

FineVision: Open Data is All You Need

📝

224

A new open-source dataset for training VLMs

liked a model 3 months ago

LiquidAI/LFM2-1.2B-Tool

Text Generation • 1B • Updated Mar 31 • 682 • 103

liked a Space 3 months ago

The Smol Training Playbook

📚

3.17k

The secrets to building world-class LLMs

liked 2 datasets 5 months ago

afaji/cvqa

Viewer • Updated Nov 27, 2024 • 10.4k • 3.86k • 37

neulab/CulturalGround

Updated Oct 23, 2025 • 1.22k • 18

updated a model 5 months ago

tofu-logs/vl-pythia-eva-1b

1B • Updated Dec 7, 2025 • 1

published a model 5 months ago

tofu-logs/vl-pythia-eva-1b

1B • Updated Dec 7, 2025 • 1

updated a model 5 months ago

tofu-logs/vl-pythia-eva-410m

0.7B • Updated Dec 7, 2025 • 1

published a model 5 months ago

tofu-logs/vl-pythia-eva-410m

0.7B • Updated Dec 7, 2025 • 1

updated a model 5 months ago

tofu-logs/vl-pythia-eva-160m

0.5B • Updated Dec 7, 2025 • 3

published a model 5 months ago

tofu-logs/vl-pythia-eva-160m

0.5B • Updated Dec 7, 2025 • 3

upvoted a paper 8 months ago

Visual Representation Alignment for Multimodal Large Language Models

Paper • 2509.07979 • Published Sep 9, 2025 • 84

upvoted a paper 11 months ago

BLIP3-KALE: Knowledge Augmented Large-Scale Dense Captions

Paper • 2411.07461 • Published Nov 12, 2024 • 23

Nik PRO

AI & ML interests

Recent Activity

Organizations

Malvinan's activity

NEO-unify: Building Native Multimodal Unified Models End to End

FineVision: Open Data is All You Need

The Smol Training Playbook