TTS & Speech to Text - a samsam55 Collection

samsam55 's Collections

Reinforcement Learning Etc..

Run on CPU Optimizations

World View Creation (out painting 3D)

Visual Multi Modal LLM

TTS & Speech to Text

Misc

Agents

3D Models & Modeling

TTS & Speech to Text

updated Oct 16

Taming Text-to-Sounding Video Generation via Advanced Modality Condition and Interaction

Paper • 2510.03117 • Published Oct 3 • 11
ResembleAI/chatterbox

Text-to-Speech • Updated Sep 23 • 776k • • 1.29k
thewh1teagle/phonikud

0.3B • Updated Aug 24 • 164
UniMoE-Audio: Unified Speech and Music Generation with Dynamic-Capacity MoE

Paper • 2510.13344 • Published Oct 15 • 61