Running on Zero MCP 395 Multimodal OCR 🍍 395 nanonets ocr2 / olmocr / qwen2vl ocr / aya vision / rolmocr
Runtime error 216 IndexTTS: An Industrial-Level Controllable and Efficient Zero-Shot Text-To-Speech System 🎙 216 Generate speech from text using a reference audio
microsoft/Phi-4-multimodal-instruct Automatic Speech Recognition • 6B • Updated Dec 10, 2025 • 214k • 1.56k
Running on Zero Featured 2.79k F5-TTS 🗣 2.79k F5-TTS & E2-TTS: Zero-Shot Voice Cloning (Unofficial Demo)
Running 577 NIST FRVT TOP 1 Face Recognition, Face Liveness Detection, Face Analysis 🥇 577 Compare and analyze faces in images
Running 172 MiniAiLive Face Recognition WebAPI Playground 🥇 172 Advanced 1:1 & 1:N Face Matching Technology, On-premise SDK