Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
bezzam 's Collections
Omnilingual ASR (1,600+ Languages)
VibeVoice
Multimodel audio
Neural codecs
Speech recognition datasets
Text-to-speech datasets
DigiCam (CelebA)
DiffuserCam Mirflickr

Omnilingual ASR (1,600+ Languages)

updated 14 days ago

https://ai.meta.com/blog/omnilingual-asr-advancing-automatic-speech-recognition/

Upvote
1

  • Running on A100
    177

    Omnilingual ASR Media Transcription

    🌍
    177

    Transcribe audio or video into text in multiple languages


  • facebook/omnilingual-asr-corpus

    Viewer • Updated 11 days ago • 548k • 35.2k • 155

  • bezzam/omniASR-W2V-300M

    Automatic Speech Recognition • Updated 6 days ago

  • bezzam/omniASR-W2V-1B

    Updated 15 days ago

  • bezzam/omniASR-CTC-300M

    Automatic Speech Recognition • Updated 6 days ago

  • bezzam/omniASR-CTC-1B

    Updated 14 days ago
Upvote
1
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs