Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing
    • Website
      • Tasks
      • HuggingChat
      • Collections
      • Languages
      • Organizations
    • Community
      • Blog
      • Posts
      • Daily Papers
      • Learn
      • Discord
      • Forum
      • GitHub
    • Solutions
      • Team & Enterprise
      • Hugging Face PRO
      • Enterprise Support
      • Inference Providers
      • Inference Endpoints
      • Storage Buckets

  • Log In
  • Sign Up
Kreshnik 's Collections
music
OCR
3D
Language
Image
Voice
Papers
Model training

Voice

updated Mar 30
Upvote
-

  • microsoft/VibeVoice-1.5B

    Text-to-Speech • 3B • Updated Jan 22 • 226k • 2.38k

  • Configuration error
    Featured
    446

    FastVLM WebGPU

    🍎
    446

    Real-time video captioning powered by FastVLM


  • openbmb/VoxCPM-0.5B

    Text-to-Speech • Updated Sep 19, 2025 • 7.75k • 798

  • Running on CPU Upgrade
    84

    MiMo-Audio-Chat

    💬
    84

    Chat with Xiaomi MiMo-Audio using voice


  • FlashLabs/Chroma-4B

    Any-to-Any • Updated Jan 28 • 2.1k • 382

  • numind/NuMarkdown-8B-Thinking

    Image-to-Text • Updated Nov 13, 2025 • 76.4k • 452

  • CohereLabs/cohere-transcribe-03-2026

    Automatic Speech Recognition • Updated 13 days ago • 290k • 941
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs