Convbased-Studio / README.md
PluginsKers's picture
Upload folder using huggingface_hub
1f1c98f verified
metadata
language:
  - zh
  - en
tags:
  - speech-synthesis
  - speech-to-speech
  - voice-conversion
  - pytorch
  - audio
  - chinese-tts
  - multi-speaker
  - convolution
  - encoder-decoder
license: apache-2.0
datasets:
  - vctk
library_name: pytorch

Convbased

Github: https://github.com/Convbased/Convbased-Studio

This project focuses on training high-quality pre-trained models.

Feature Extraction Vocoder Sample Rate 40k Sample Rate 48k
contentvec hifigannsf ❌ βœ…
contentvec sifigan ❌ βœ…
contentvec bigvgan βœ… ❌
spin hifigannsf ❌ βœ…
spin sifigan ❌ βœ…
spin-v2 bigvgan βœ… ❌
chinese-hubert-base hifigannsf ❌ βœ…

Training code from Applio.

Dedicated to advancing Chinese speech synthesis technology. These base models have been used for fine-tuning most models at Convbased Studio.