GenEnv: Difficulty-Aligned Co-Evolution Between LLM Agents and Environment Simulators Paper • 2512.19682 • Published Dec 22, 2025 • 19
Beyond Binary Preference: Aligning Diffusion Models to Fine-grained Criteria by Decoupling Attributes Paper • 2601.04300 • Published Jan 7 • 3
AutoTool: Dynamic Tool Selection and Integration for Agentic Reasoning Paper • 2512.13278 • Published Dec 15, 2025
RLAnything: Forge Environment, Policy, and Reward Model in Completely Dynamic RL System Paper • 2602.02488 • Published Feb 2 • 33
TaTToo: Tool-Grounded Thinking PRM for Test-Time Scaling in Tabular Reasoning Paper • 2510.06217 • Published Oct 7, 2025 • 66
TAP4LLM: Table Provider on Sampling, Augmenting, and Packing Semi-structured Data for Large Language Model Reasoning Paper • 2312.09039 • Published Dec 14, 2023
RLAnything: Forge Environment, Policy, and Reward Model in Completely Dynamic RL System Paper • 2602.02488 • Published Feb 2 • 33