MihailSlutsky's picture

MihailSlutsky

MihailSlutsky

·

AI & ML interests

None yet

Recent Activity

liked a dataset 4 days ago

theairlabcmu/TartanGround

upvoted a paper 5 days ago

Imagine-then-Plan: Agent Learning from Adaptive Lookahead with World Models

upvoted a paper 5 days ago

Flow Equivariant World Models: Memory for Partially Observed Dynamic Environments

View all activity

Organizations

None yet

liked a dataset 4 days ago

theairlabcmu/TartanGround

Updated Oct 14, 2025 • 22.7k • 2

upvoted 19 papers 5 days ago

Imagine-then-Plan: Agent Learning from Adaptive Lookahead with World Models

Paper • 2601.08955 • Published 12 days ago • 13

Flow Equivariant World Models: Memory for Partially Observed Dynamic Environments

Paper • 2601.01075 • Published 22 days ago • 6

V-DPM: 4D Video Reconstruction with Dynamic Point Maps

Paper • 2601.09499 • Published 11 days ago • 9

LSRIF: Logic-Structured Reinforcement Learning for Instruction Following

Paper • 2601.06431 • Published 15 days ago • 12

ToolSafe: Enhancing Tool Invocation Safety of LLM-based agents via Proactive Step-level Guardrail and Feedback

Paper • 2601.10156 • Published 10 days ago • 24

Action100M: A Large-scale Video Action Dataset

Paper • 2601.10592 • Published 10 days ago • 27

Transition Matching Distillation for Fast Video Generation

Paper • 2601.09881 • Published 10 days ago • 31

Molmo2: Open Weights and Data for Vision-Language Models with Video Understanding and Grounding

Paper • 2601.10611 • Published 10 days ago • 26

CoF-T2I: Video Models as Pure Visual Reasoners for Text-to-Image Generation

Paper • 2601.10061 • Published 10 days ago • 30

DanQing: An Up-to-Date Large-Scale Chinese Vision-Language Pre-training Dataset

Paper • 2601.10305 • Published 10 days ago • 36

Beyond Static Tools: Test-Time Tool Evolution for Scientific Reasoning

Paper • 2601.07641 • Published 13 days ago • 45

Urban Socio-Semantic Segmentation with Vision-Language Reasoning

Paper • 2601.10477 • Published 10 days ago • 154

STEP3-VL-10B Technical Report

Paper • 2601.09668 • Published 11 days ago • 185

PhyRPR: Training-Free Physics-Constrained Video Generation

Paper • 2601.09255 • Published 11 days ago • 3

AgencyBench: Benchmarking the Frontiers of Autonomous Agents in 1M-Token Real-World Contexts

Paper • 2601.11044 • Published 9 days ago • 33

Reasoning Models Generate Societies of Thought

Paper • 2601.10825 • Published 10 days ago • 11

ACoT-VLA: Action Chain-of-Thought for Vision-Language-Action Models

Paper • 2601.11404 • Published 9 days ago • 24

Your Group-Relative Advantage Is Biased

Paper • 2601.08521 • Published 12 days ago • 142

Future Optical Flow Prediction Improves Robot Control & Video Generation

Paper • 2601.10781 • Published 10 days ago • 19