Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
espnet
/
powsm
like
5
Follow
ESPnet
311
Automatic Speech Recognition
ESPnet
4 datasets
multilingual
audio
phone-recognition
grapheme-to-phoneme
phoneme-to-grapheme
arxiv:
2510.24992
License:
cc-by-4.0
Model card
Files
Files and versions
xet
Community
1
Use this model
main
powsm
1.38 GB
1 contributor
History:
11 commits
cjli
update arxiv
b7676ce
about 1 month ago
data
add model files
about 1 month ago
exp
add train/feats_stats.npz
about 1 month ago
.gitattributes
Safe
1.52 kB
initial commit
about 1 month ago
README.md
Safe
3.45 kB
update arxiv
about 1 month ago
meta.yaml
Safe
341 Bytes
patch yaml file
about 1 month ago