WildTableBench: Benchmarking Multimodal Foundation Models on Table Understanding In the Wild Paper • 2605.01018 • Published 12 days ago • 1
ProBench: Judging Multimodal Foundation Models on Open-ended Multi-domain Expert Tasks Paper • 2503.06885 • Published Mar 10, 2025 • 4