Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
bezzam
's Collections
Omnilingual ASR (1,600+ Languages)
VibeVoice
Multimodel audio
Neural codecs
Speech recognition datasets
Text-to-speech datasets
DigiCam (CelebA)
DiffuserCam Mirflickr
VibeVoice
updated
3 days ago
Upvote
-
bezzam/VibeVoice-1.5B
Text-to-Speech
•
3B
•
Updated
about 11 hours ago
•
1.37k
bezzam/VibeVoice-7B
Text-to-Speech
•
9B
•
Updated
about 11 hours ago
•
333
bezzam/VibeVoice-AcousticTokenizer
Feature Extraction
•
0.7B
•
Updated
2 days ago
•
36
bezzam/VibeVoice-SemanticTokenizer
Feature Extraction
•
0.3B
•
Updated
2 days ago
•
21
bezzam/vibevoice_samples
Viewer
•
Updated
about 11 hours ago
•
2
•
478
VibeVoice Technical Report
Paper
•
2508.19205
•
Published
Aug 26
•
123
Upvote
-
Share collection
View history
Collection guide
Browse collections