MMLongCite: A Benchmark for Evaluating Fidelity of Long-Context Vision-Language Models Paper • 2510.13276 • Published Oct 15, 2025
Thinking with Video: Video Generation as a Promising Multimodal Reasoning Paradigm Paper • 2511.04570 • Published Nov 6, 2025 • 211
MAC-SLU: Multi-Intent Automotive Cabin Spoken Language Understanding Benchmark Paper • 2512.01603 • Published Dec 1, 2025
Agent2World: Learning to Generate Symbolic World Models via Adaptive Multi-Agent Feedback Paper • 2512.22336 • Published 26 days ago • 2
The Molecular Structure of Thought: Mapping the Topology of Long Chain-of-Thought Reasoning Paper • 2601.06002 • Published 12 days ago • 48
The Molecular Structure of Thought: Mapping the Topology of Long Chain-of-Thought Reasoning Paper • 2601.06002 • Published 12 days ago • 48
Towards Reasoning Era: A Survey of Long Chain-of-Thought for Reasoning Large Language Models Paper • 2503.09567 • Published Mar 12, 2025
Efficient Process Reward Model Training via Active Learning Paper • 2504.10559 • Published Apr 14, 2025 • 13
End-to-end Task-oriented Dialogue: A Survey of Tasks, Methods, and Future Directions Paper • 2311.09008 • Published Nov 15, 2023
OWL: Optimized Workforce Learning for General Multi-Agent Assistance in Real-World Task Automation Paper • 2505.23885 • Published May 29, 2025
AI4Research: A Survey of Artificial Intelligence for Scientific Research Paper • 2507.01903 • Published Jul 2, 2025 • 5
Cross-lingual Prompting: Improving Zero-shot Chain-of-Thought Reasoning across Languages Paper • 2310.14799 • Published Oct 23, 2023
CCHall: A Novel Benchmark for Joint Cross-Lingual and Cross-Modal Hallucinations Detection in Large Language Models Paper • 2505.19108 • Published May 25, 2025
AutoPR: Let's Automate Your Academic Promotion! Paper • 2510.09558 • Published Oct 10, 2025 • 52
Aware First, Think Less: Dynamic Boundary Self-Awareness Drives Extreme Reasoning Efficiency in Large Language Models Paper • 2508.11582 • Published Aug 15, 2025 • 1
Beyond Correctness: Evaluating Subjective Writing Preferences Across Cultures Paper • 2510.14616 • Published Oct 16, 2025 • 12
COIG-Writer: A High-Quality Dataset for Chinese Creative Writing with Thought Processes Paper • 2510.14763 • Published Oct 16, 2025 • 13
ECM: A Unified Electronic Circuit Model for Explaining the Emergence of In-Context Learning and Chain-of-Thought in Large Language Model Paper • 2502.03325 • Published Feb 5, 2025 • 1
CoMT: A Novel Benchmark for Chain of Multi-modal Thought on Large Vision-Language Models Paper • 2412.12932 • Published Dec 17, 2024 • 1
Text2World: Benchmarking Large Language Models for Symbolic World Model Generation Paper • 2502.13092 • Published Feb 18, 2025 • 13