You need to agree to share your contact information to access this model

This repository is publicly accessible, but you have to accept the conditions to access its files and content.

Chatterbox Persian-Farsi

training High quality TTS with low ressource data

Chatterbox-TTS-Persian-Farsi is a TTS trained on data that I cleaned, denoised, and filtered.

The total cost of the TTS is $150 on my cloud hardware.

If you find this model useful and high-quality, and would like to support my work, you can send me money via ko-fi, or like it on huggingface.

Dataset : Thomcles/Persian-Farsi-Speech

demo audios:

"سلام! به آزمایش تبدیل متن به گفتار خوش آمدید."

"سه سیب سرخ روی سینی سیمی است"

"دیروز در تهران باران شد، امروز آفتابی است"

"قیمت لپ‌تاپ جدید من پنجاه میلیون تومان است."

"علی، نرگس و یوسف به دانشگاه شیراز رفتند."

"لطفاً جملهٔ قبل را دوباره تکرار کن، دوباره تکرار کن، دوباره تکرار کن!"

💻 Inference Code

First, download the file from huggingface and place it in the current directory.

from chatterbox import mtl_tts
import torchaudio as ta
from safetensors.torch import load_file as load_safetensors

device = "cpu" # or mps or cuda

multilingual_model = mtl_tts.ChatterboxMultilingualTTS.from_pretrained(device=device)

# ----
# Then download the file from huggingface and place it in the current directory.
# ----


t3_state = load_safetensors("t3_fa_v2.safetensors", device="cpu")
multilingual_model.t3.load_state_dict(t3_state)
multilingual_model.t3.to(device).eval()

persian_text = "سلام! به آزمایش تبدیل متن به گفتار خوش آمدید."
wav_persian = multilingual_model.generate(persian_text, language_id=None)
ta.save("test-fa.wav", wav_persian, model.sr)