Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
della20241 's Collections
image-recognize
temp
vision
multimedia
translator

multimedia

updated Nov 23, 2024
Upvote
-

  • Running on Zero
    MCP
    2.68k

    Background Removal

    🌘
    2.68k

    Remove background from images


  • Running on Zero
    Featured
    9.38k

    FLUX.1 [dev]

    🖥
    9.38k

    Generate stunning images from text descriptions


  • Running on Zero
    Featured
    2.69k

    Whisper

    📉
    2.69k

    Transcribe audio and YouTube videos into text instantly


  • Runtime error
    145

    Whisper JAX

    👀
    145

    Transcribe or translate audio from microphone, file, or YouTube


  • Runtime error
    Featured
    324

    Ovis1.6 Gemma2 9B

    🐑
    324

    Interact with a chatbot that understands text and images


  • Running on Zero
    Featured
    5.04k

    FLUX.1 [Schnell]

    🏎
    5.04k

    Generate unique images from text descriptions


  • Runtime error
    Featured
    2k

    Stable Diffusion 3.5 Large

    🏃
    2k

    Generate images with SD3.5


  • Running on Zero
    51

    Fast Whisper Turbo

    ⚡
    51

    Ultra-fast Whisper Turbo inference ⚡


  • Runtime error
    405

    UVR5 UI

    ⚡
    405

    Separate audio into stems using various models


  • Running on Zero
    221

    GPT SoVITS V2 Pro Plus

    🤗
    221

    Convert text to speech with reference audio guidance


  • Running on Zero
    Featured
    562

    Video Background Removal

    📽
    562

    Remove/Change background of video.


  • Running on Zero
    MCP
    Featured
    812

    Whisper Large V3

    🤫
    812

    Transcribe speech from audio or YouTube videos into text

Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs