TTS - a Ambroser53 Collection

Ambroser53 's Collections

RAG

grpo

Embed

LoRA

Vision

Speech

active learning

SSM

RL

TTS

context

TTS

updated Sep 3

Autoregressive Speech Synthesis without Vector Quantization

Paper • 2407.08551 • Published Jul 11, 2024 • 17
Stable Audio Open

Paper • 2407.14358 • Published Jul 19, 2024 • 26
Zyphra/Zonos-v0.1-transformer

Text-to-Speech • Updated Jun 3 • 16.3k • 418
Slamming: Training a Speech Language Model on One GPU in a Day

Paper • 2502.15814 • Published Feb 19 • 69
LLaMA-Omni2: LLM-based Real-time Spoken Chatbot with Autoregressive Streaming Speech Synthesis

Paper • 2505.02625 • Published May 5 • 22
microsoft/VibeVoice-1.5B

Text-to-Speech • 3B • Updated Sep 1 • 176k • 1.98k