Beyond Memorization: A Multi-Modal Ordinal Regression Benchmark to Expose Popularity Bias in Vision-Language Models Paper • 2512.21337 • Published 4 days ago • 23
SCOPE: Prompt Evolution for Enhancing Agent Effectiveness Paper • 2512.15374 • Published 11 days ago • 5