Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Yang's picture
1 9 3

Yang

diddytpq
·

AI & ML interests

None yet

Organizations

None yet

upvoted a paper 4 months ago

MiniCPM-V 4.5: Cooking Efficient MLLMs via Architecture, Data, and Training Recipe

Paper • 2509.18154 • Published Sep 16, 2025 • 52
upvoted a paper 6 months ago

4KAgent: Agentic Any Image to 4K Super-Resolution

Paper • 2507.07105 • Published Jul 9, 2025 • 105
upvoted a paper 7 months ago

GenRecal: Generation after Recalibration from Large to Small Vision-Language Models

Paper • 2506.15681 • Published Jun 18, 2025 • 39
upvoted an article 9 months ago
view article
Article

Fine-Tuning SigLIP2 for Image Classification

Mar 5, 2025
•
18
upvoted 3 papers about 1 year ago

InternLM-XComposer2.5-OmniLive: A Comprehensive Multimodal System for Long-term Streaming Video and Audio Interactions

Paper • 2412.09596 • Published Dec 12, 2024 • 98

EXAONE 3.5: Series of Large Language Models for Real-world Use Cases

Paper • 2412.04862 • Published Dec 6, 2024 • 50

VisionZip: Longer is Better but Not Necessary in Vision Language Models

Paper • 2412.04467 • Published Dec 5, 2024 • 117
upvoted 2 papers over 1 year ago

Achieving Human Level Competitive Robot Table Tennis

Paper • 2408.03906 • Published Aug 7, 2024 • 28

Vision language models are blind

Paper • 2407.06581 • Published Jul 9, 2024 • 84
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs