Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Yong Jae Lee's picture
2 2

Yong Jae Lee

yjlee0222

AI & ML interests

None yet

Organizations

None yet

authored 3 papers 10 months ago

X-Fusion: Introducing New Modality to Frozen Large Language Models

Paper • 2504.20996 • Published Apr 29, 2025 • 13

YoChameleon: Personalized Vision and Language Generation

Paper • 2504.20998 • Published Apr 29, 2025 • 12

Efficient LLaMA-3.2-Vision by Trimming Cross-attended Visual Features

Paper • 2504.00557 • Published Apr 1, 2025 • 15
authored 4 papers over 1 year ago

TemporalBench: Benchmarking Fine-grained Temporal Understanding for Multimodal Video Models

Paper • 2410.10818 • Published Oct 14, 2024 • 16

Vinoground: Scrutinizing LMMs over Dense Temporal Reasoning with Short Videos

Paper • 2410.02763 • Published Oct 3, 2024 • 7

LLaRA: Supercharging Robot Learning Data for Vision-Language Policy

Paper • 2406.20095 • Published Jun 28, 2024 • 18

Matryoshka Multimodal Models

Paper • 2405.17430 • Published May 27, 2024 • 34
authored a paper about 2 years ago

Interfacing Foundation Models' Embeddings

Paper • 2312.07532 • Published Dec 12, 2023 • 12
authored 2 papers over 2 years ago

Improved Baselines with Visual Instruction Tuning

Paper • 2310.03744 • Published Oct 5, 2023 • 39

Generate Anything Anywhere in Any Scene

Paper • 2306.17154 • Published Jun 29, 2023 • 22
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs