Document to Markdown This collection contains models which convert text or multimodal documents to markdown format for various downstream tasks. rednote-hilab/dots.ocr Image-Text-to-Text • 3B • Updated Oct 31, 2025 • 232k • 1.22k numind/NuMarkdown-8B-Thinking Image-to-Text • 8B • Updated Nov 13, 2025 • 1.02M • 368 zai-org/GLM-4.5V Image-Text-to-Text • 108B • Updated Oct 25, 2025 • 31.9k • • 704 microsoft/kosmos-2.5 Image-Text-to-Text • 1B • Updated Aug 28, 2025 • 16.1k • 269
Document Datasets docling-project/DocLayNet Updated Jan 25, 2023 • 663 • 125 common-pile/caselaw_access_project Viewer • Updated Jun 6, 2025 • 5.52M • 1.49k • 205 llamaindex/vdr-multilingual-test Viewer • Updated Jan 10, 2025 • 15k • 179 • 3 PleIAs/common_corpus Viewer • Updated Jun 10, 2025 • 470M • 36.6k • 334
Document to Markdown This collection contains models which convert text or multimodal documents to markdown format for various downstream tasks. rednote-hilab/dots.ocr Image-Text-to-Text • 3B • Updated Oct 31, 2025 • 232k • 1.22k numind/NuMarkdown-8B-Thinking Image-to-Text • 8B • Updated Nov 13, 2025 • 1.02M • 368 zai-org/GLM-4.5V Image-Text-to-Text • 108B • Updated Oct 25, 2025 • 31.9k • • 704 microsoft/kosmos-2.5 Image-Text-to-Text • 1B • Updated Aug 28, 2025 • 16.1k • 269
Document Datasets docling-project/DocLayNet Updated Jan 25, 2023 • 663 • 125 common-pile/caselaw_access_project Viewer • Updated Jun 6, 2025 • 5.52M • 1.49k • 205 llamaindex/vdr-multilingual-test Viewer • Updated Jan 10, 2025 • 15k • 179 • 3 PleIAs/common_corpus Viewer • Updated Jun 10, 2025 • 470M • 36.6k • 334