-
Visual Representation Alignment for Multimodal Large Language Models
Paper • 2509.07979 • Published • 83 -
Language Models Can Learn from Verbal Feedback Without Scalar Rewards
Paper • 2509.22638 • Published • 70 -
Video-LMM Post-Training: A Deep Dive into Video Reasoning with Large Multimodal Models
Paper • 2510.05034 • Published • 50 -
Think-Then-Generate: Reasoning-Aware Text-to-Image Diffusion with LLM Encoders
Paper • 2601.10332 • Published • 28
Jeffrey Van de zande
Sexhuis
AI & ML interests
None yet
Recent Activity
updated
a collection
2 days ago
X1
new activity
3 months ago
openai/gpt-oss-safeguard-20b:streaming
updated
a collection
3 months ago
X1
Organizations
None yet