Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
aikongfu 's Collections
embedding benchmark
AI agent
LLM
speech recognition
AI Coding
Computer Vision(Text to Image)
Text to Audio
Multimodal
Audio to text
Datasets
Text to Video
image to video

speech recognition

updated Nov 21, 2024
Upvote
-

  • Running on Zero
    Featured
    2.63k

    Whisper

    📉
    2.63k

    Transcribe audio files or YouTube videos into text


  • Runtime error
    Featured
    367

    Video Transcription Smart Summary

    ⚡
    367

    Generate summaries from YouTube videos or uploaded videos


  • Running on Zero
    Featured
    779

    Whisper Large V3

    🤫
    779

    Transcribe audio or YouTube videos into text


  • Paused
    845

    Video Dubbing (SoniTranslate)

    🌍
    845

    Video Dubbing with Open Source Projects


  • Build error
    272

    Faster Whisper Webui

    🚀
    272

    Transcribe audio to text with speaker diarization

Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs