huihui-ai/Qwen2.5-VL-7B-Instruct-abliterated Image-Text-to-Text • 8B • Updated Nov 7, 2025 • 1.84k • 40
Running 3 Manga Translator 📖 3 Translate manga panels into different languages while preserving text style
LightOnOCR: A 1B End-to-End Multilingual Vision-Language Model for State-of-the-Art OCR Paper • 2601.14251 • Published 18 days ago • 24
PaddleOCR-VL Collection Boosting Multilingual Document Parsing via a 0.9B Ultra-Compact Vision-Language Model • 3 items • Updated Dec 16, 2025 • 26
PaddleOCR-VL-1.5 Collection Towards a Multi-Task 0.9B VLM for Robust In-the-Wild Document Parsing • 5 items • Updated 8 days ago • 8
LightOnOCR-2 🦉 Collection LightOnOCR-2-1B: a lightweight high-performance end-to-end OCR model family • 12 items • Updated 17 days ago • 22
Youtu-VL: Unleashing Visual Potential via Unified Vision-Language Supervision Paper • 2601.19798 • Published 11 days ago • 41
Youtu-LLM: Unlocking the Native Agentic Potential for Lightweight Large Language Models Paper • 2512.24618 • Published Dec 31, 2025 • 147
VLM-FO1: Bridging the Gap Between High-Level Reasoning and Fine-Grained Perception in VLMs Paper • 2509.25916 • Published Sep 30, 2025 • 6