Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
HIT-TMG
/
UniMoE-Audio-Preview
like
8
Follow
HITsz-Text and Multimodal Generative Intelligence Group(TMG)
75
Safetensors
English
Chinese
uni_audio_rvq_qwen2_5vl_moe
MoE
Unified Generation
Speech and Music
Multi-modal
custom_code
arxiv:
2510.13344
License:
apache-2.0
Model card
Files
Files and versions
xet
Community
1
main
UniMoE-Audio-Preview
/
imgs
/
VT2M.png
foggyforest
Upload 4 files
0565c27
verified
about 1 month ago
download
Copy download link
history
contribute
delete
Safe
92.8 kB