Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
HIT-TMG
/
UniMoE-Audio-Preview
like
8
Follow
HITsz-Text and Multimodal Generative Intelligence Group(TMG)
73
Safetensors
English
Chinese
uni_audio_rvq_qwen2_5vl_moe
MoE
Unified Generation
Speech and Music
Multi-modal
custom_code
arxiv:
2510.13344
License:
apache-2.0
Model card
Files
Files and versions
xet
Community
1
main
UniMoE-Audio-Preview
/
imgs
/
Speech_Generation.png
Commit History
Upload 4 files
0565c27
verified
foggyforest
commited on
Oct 15