Kokoro TTS
Upgraded to v1.0!
Upgraded to v1.0!
Easily expand image boundaries
Generate captions for images with various styles and options
Use NVIDIA H100 GPU
Scalable and Versatile 3D Generation from images
Generate speech from text using selected models
F5-TTS & E2-TTS: Zero-Shot Voice Cloning (Unofficial Demo)
Universal 3D World Reconstruction with Any Prior Prompting
Qwen-Image-2509-CharacterSheet
nanonets2 / dots.ocr / olmOCR2 / chandraOCR
Generate music from lyrics and prompts
Clarity AI Upscaler Reproduction
Generate high-quality videos from text prompts and images
Chatterbox TTS supporting 23 languages
Fast 8 step inference of Qwen Image Edit 2509
Generate analysis and response based on policy and prompt
Generate captions for images
Upscale low-resolution images to high resolution
The Ultimate Anime-themed SDXL model
VGGT (CVPR 2025)
Generate images from text prompts
4-step Qwen Image Edit 2509 w/ a local caption model.
Multimodal Instruction-based Editing and Generation
Text-to-3D and Image-to-3D Generation