alytts
Generate speech from text using OpenAI API
Generate speech from text using OpenAI API
Text-to-Speech, Speech-to-Text, and Language Recognition
Clone a voice using a text and audio sample
Generate audio from text using pre-trained models
Create custom voice clones using text input
Create interactive music playlists with AI assistance
Generate audio effects from video using image caption
Generate voice from text with customizable audio source
A demo of MetaVoice 1B, a new TTS model by MetaVoice.
Convert text to speech
Run a web-based application
Convert audio to text
Convert voice to another voice
Generate or edit spoken audio from text
High-fidelity Text-To-Speech
Languages ru,en,zh-cn,ja,de,fr,it,pt,pl,tr,ko,nl,cs,ar,es,hu
Generate music powered by AI
Convert voice to text
Generate audio by cloning a voice
Generate speech from text using ElevenLabs voices
Clone a voice to speak any text
Generate audio from text prompts
Transcribe or translate audio files
Generate speech from text
Transcribe audio to text with speaker diarization
Generate speech from text using various voices
easy download youtube audios with gradio
Transform a report or document into an interview/discussion
Convert text to audio and vice versa
Generate music from text descriptions
Convert audio to text with ease and accuracy.
Restore degraded audio using a Transformer-based model
Generate audio from text using selected characters
F5-TTS & E2-TTS: Zero-Shot Voice Cloning (Unofficial Demo)
Whisper Transcribe MP3 files, use a GPU to convert faster!
Vocal and background audio separator
Audio-Driven Portrait Animations
Fixed fork of the original audio sr!
Generate speech from text with or without voice cloning
Generate speech from text
WebGPU text-to-Speech powered by OuteTTS and Transformers.js
Transform audio into text using a web-based model
Real-time in-browser speech recognition
High-quality speech synthesis powered by Kokoro TTS
Translate and synthesize speech to English
Make Custom Voices With KokoroTTS
Zero Shot voice cloning with llasa 3b (Unofficial Demo)
Analyze music to identify genre, instrument, mood, and more
Generate Podcast using Kokoro-TTS!
Blazingly Fast and Embarrassingly Simple Song Generation
Conversational speech generation
Generate text and speech from text, audio, images, and videos
Generate audio from text and video prompts
SText to Audio(Sound SFX) Generator
Demo for OpenF5-TTS
A Step Towards Music Generation Foundation Model
Expressive Zeroshot TTS
Extraction & Reconstruction for Efficient Speech Separation
Generate speech from text using various TTS services
Generate a custom song from lyrics and optional prompts
Generate waveform video from audio
Generate speech from text with customizable voice and speed
Audio Flamingo 3 Demo
Audio Flamingo 3 demo for multi-turn multi-audio chat
Generate speech from text with voice selection
Demo space for Mistral latest speech models
Search audio for relevant chunks
Higgs Audio Demo
State-of-the-art audio transcription in your browser
F5-TTS & E2-TTS: Zero-Shot Voice Cloning (Unofficial Demo)
Conversational speech generation
State-of-the-art TTS model under 25MB
Audio Gen, Audio Style Transfer and Audio InPainting
Convert audio to text with context and language options
Generate speech from text with voice options
Translate and transcribe live speech in real-time
Generate captions from audio
Generate speech from text using a reference audio sample
Free Text-To-Speech generator with Emotion control (OpenAI)
Generate speech from text using selected models
Demo of our new open source model maya1