Soul-AILab

community

https://www.soulapp.cn/soulX

Soul-AILab

AI & ML interests

None defined yet.

Recent Activity

tiamojames updated a model 26 days ago

Soul-AILab/SoulX-Podcast-1.7B-dialect

tiamojames updated a model 26 days ago

Soul-AILab/SoulX-Podcast-1.7B

tiamojames updated a Space 26 days ago

Soul-AILab/SoulX-Podcast-1.7B

View all activity

Papers

SAC: Neural Speech Codec with Semantic-Acoustic Dual-Stream Quantization

View all Papers

tiamojames

updated 2 models 26 days ago

Soul-AILab/SoulX-Podcast-1.7B-dialect

Text-to-Speech • 2B • Updated 26 days ago • 1.16k • 24

Soul-AILab/SoulX-Podcast-1.7B

Text-to-Speech • 2B • Updated 26 days ago • 2.3k • 215

tiamojames

updated a Space 26 days ago

SoulX Podcast 1.7B

Realistic Long-form Podcasts Generation with Dialectal

tiamojames

in Soul-AILab/SoulX-Podcast-1.7B 27 days ago

ZeroGPU worker error AttributeError

#2 opened 27 days ago by

tiamojames

updated a Space 27 days ago

SoulX Podcast 1.7B Dialect

Realistic Long-form Podcasts Generation with Dialectal

tiamojames

published 2 Spaces 28 days ago

SoulX Podcast 1.7B

Realistic Long-form Podcasts Generation with Dialectal

SoulX Podcast 1.7B Dialect

Realistic Long-form Podcasts Generation with Dialectal

tiamojames

in Soul-AILab/SoulX-Podcast-1.7B-Dialect 29 days ago

Apply for community grant: Academic project (gpu and storage)

#1 opened 29 days ago by

tiamojames

in Soul-AILab/SoulX-Podcast-1.7B 29 days ago

Apply for community grant: Academic project (gpu and storage)

#1 opened 29 days ago by

tiamojames

authored 3 papers 30 days ago

OSUM-EChat: Enhancing End-to-End Empathetic Spoken Chatbot via Understanding-Driven Spoken Dialogue

Paper • 2508.09600 • Published Aug 13

SenSE: Semantic-Aware High-Fidelity Universal Speech Enhancement

Paper • 2509.24708 • Published Sep 29

SoulX-Podcast: Towards Realistic Long-form Podcasts with Dialectal and Paralinguistic Diversity

Paper • 2510.23541 • Published Oct 27 • 13

worstchan

authored a paper about 1 month ago

SAC: Neural Speech Codec with Semantic-Acoustic Dual-Stream Quantization

Paper • 2510.16841 • Published Oct 19

Xinsheng-Wang

authored a paper 9 months ago

Spark-TTS: An Efficient LLM-Based Text-to-Speech Model with Single-Stream Decoupled Speech Tokens

Paper • 2503.01710 • Published Mar 3 • 6

worstchan

authored a paper 11 months ago

SLAM-Omni: Timbre-Controllable Voice Interaction System with Single-Stage Training

Paper • 2412.15649 • Published Dec 20, 2024 • 1

worstchan

authored a paper about 1 year ago

SLAM-AAC: Enhancing Audio Captioning with Paraphrasing Augmentation and CLAP-Refine through LLMs

Paper • 2410.09503 • Published Oct 12, 2024

worstchan

authored a paper almost 2 years ago

EAT: Self-Supervised Pre-Training with Efficient Audio Transformer

Paper • 2401.03497 • Published Jan 7, 2024 • 1