metadata
language:
- zh
- en
tags:
- speech-synthesis
- speech-to-speech
- voice-conversion
- pytorch
- audio
- chinese-tts
- multi-speaker
- convolution
- encoder-decoder
license: apache-2.0
datasets:
- vctk
library_name: pytorch
Convbased
Github: https://github.com/Convbased/Convbased-Studio
This project focuses on training high-quality pre-trained models.
| Feature Extraction | Vocoder | Sample Rate 40k | Sample Rate 48k |
|---|---|---|---|
| contentvec | hifigannsf | β | β |
| contentvec | sifigan | β | β |
| contentvec | bigvgan | β | β |
| spin | hifigannsf | β | β |
| spin | sifigan | β | β |
| spin-v2 | bigvgan | β | β |
| chinese-hubert-base | hifigannsf | β | β |
Training code from Applio.
Dedicated to advancing Chinese speech synthesis technology. These base models have been used for fine-tuning most models at Convbased Studio.