Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
samsam55 's Collections
Reinforcement Learning Etc..
Datasets
Self Improving
Run on CPU Optimizations
Deep Search
World View Creation (out painting 3D)
Computer Use
Coding LLMs
Visual Multi Modal LLM
TTS & Speech to Text
Misc
Agents
3D Models & Modeling

TTS & Speech to Text

updated Oct 16
Upvote
-

  • Taming Text-to-Sounding Video Generation via Advanced Modality Condition and Interaction

    Paper • 2510.03117 • Published Oct 3 • 11

  • ResembleAI/chatterbox

    Text-to-Speech • Updated Sep 23 • 776k • • 1.29k

  • thewh1teagle/phonikud

    0.3B • Updated Aug 24 • 164

  • UniMoE-Audio: Unified Speech and Music Generation with Dynamic-Capacity MoE

    Paper • 2510.13344 • Published Oct 15 • 61
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs