Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
oguzhanercan 's Collections
World Models
Memory
Research
PassKto1
Finetuning Strategies
RAG
Embedding Space İnterpretability
MultiModal Reasoning
Transformer Optimization / LLM & VLLM etc
Large Language Models
Agentic Tools
Robotics
Reasoning
Auto Regressive Image Generation
Diffusion Language&MultiModal Modeling
Vision Reasoning
Subject Driven Generation Control
Representation Learning
Scene Generation
Training Theory
Image-Text Alignment
Efficent ML
Control Based Video Generation Models
Video Generation Backbone Models
Video Generation Style Models
Image-Video General Tasks
Generation Quality Enhancement
Diffusion/Flow Model Optimization
Voice
Datasets
Mobile Generative Models
Video Generation Control-Style Transfer
Diffusion-Score-Flow Guidance
Image Restoration (SR , Inpainting etc.)
General Theory
Image-Video MultiModal Understanding
Face Generation-Swap-Contol-Edit
Architectural Proposals
Generative Modeling Approachs
Image Editting
Video Generation
Diffusion Model Control
Image Generation

World Models

updated about 15 hours ago
Upvote
-

  • Emu3.5: Native Multimodal Models are World Learners

    Paper • 2510.26583 • Published Oct 30 • 108

  • Cambrian-S: Towards Spatial Supersensing in Video

    Paper • 2511.04670 • Published Nov 6 • 37
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs