ChartVerse: Scaling Chart Reasoning via Reliable Programmatic Synthesis from Scratch
Abstract
ChartVerse is a framework that synthesizes complex charts and reliable reasoning data using novel metrics and answer-first paradigms to improve vision-language model performance.
Chart reasoning is a critical capability for Vision Language Models (VLMs). However, the development of open-source models is severely hindered by the lack of high-quality training data. Existing datasets suffer from a dual challenge: synthetic charts are often simplistic and repetitive, while the associated QA pairs are prone to hallucinations and lack the reasoning depth required for complex tasks. To bridge this gap, we propose ChartVerse, a scalable framework designed to synthesize complex charts and reliable reasoning data from scratch. (1) To address the bottleneck of simple patterns, we first introduce Rollout Posterior Entropy (RPE), a novel metric that quantifies chart complexity. Guided by RPE, we develop complexity-aware chart coder to autonomously synthesize diverse, high-complexity charts via executable programs. (2) To guarantee reasoning rigor, we develop truth-anchored inverse QA synthesis. Diverging from standard generation, we adopt an answer-first paradigm: we extract deterministic answers directly from the source code, generate questions conditional on these anchors, and enforce strict consistency verification. To further elevate difficulty and reasoning depth, we filter samples based on model fail-rate and distill high-quality Chain-of-Thought (CoT) reasoning. We curate ChartVerse-SFT-600K and ChartVerse-RL-40K using Qwen3-VL-30B-A3B-Thinking as the teacher. Experimental results demonstrate that ChartVerse-8B achieves state-of-the-art performance, notably surpassing its teacher and rivaling the stronger Qwen3-VL-32B-Thinking.
Community
High-quality synthetic Chart data and strong Chart reasoning model.
This is an automated message from the Librarian Bot. I found the following papers similar to this paper.
The following papers were recommended by the Semantic Scholar API
- START: Spatial and Textual Learning for Chart Understanding (2025)
- ChartPoint: Guiding MLLMs with Grounding Reflection for Chart Reasoning (2025)
- CycleChart: A Unified Consistency-Based Learning Framework for Bidirectional Chart Understanding and Generation (2025)
- CoSineVerifier: Tool-Augmented Answer Verification for Computation-Oriented Scientific Questions (2025)
- CogDoc: Towards Unified thinking in Documents (2025)
- VideoAuto-R1: Video Auto Reasoning via Thinking Once, Answering Twice (2026)
- See Less, See Right: Bi-directional Perceptual Shaping For Multimodal Reasoning (2025)
Please give a thumbs up to this comment if you found it helpful!
If you want recommendations for any Paper on Hugging Face checkout this Space
You can directly ask Librarian Bot for paper recommendations by tagging it in a comment:
@librarian-bot
recommend
Models citing this paper 4
Datasets citing this paper 3
Spaces citing this paper 0
No Space linking this paper