This collection hosts a series of Vision Language Models (VLMs) fine-tuned for Optical Character Recognition (OCR) and Document Processing.
-
loay/Arabic-OCR-Qwen2.5-VL-7B-Vision
Image-to-Text • 8B • Updated • 119 • 3 -
loay/Arabic-OCR-DeepSeek-OCR-2
Image-to-Text • 3B • Updated • 37 -
loay/English-Document-OCR-Qwen3.5-2B
Image-Text-to-Text • 2B • Updated • 235 • 1 -
loay/English-Document-OCR-Qwen3.5-0.8B
Image-Text-to-Text • 0.8B • Updated • 4