This collection contains datasets which contain audio with non-verbal tags such as <laugh>, <sigh> being transcribed.
Christopher Özbek
oezi13
AI & ML interests
Text-To-Speech
Recent Activity
published
a model
30 days ago
oezi13/PlayDiffusion-nonverbal
updated
a model
30 days ago
oezi13/PlayDiffusion-nonverbal
new activity
about 1 month ago
nvidia/canary-1b-v2:Timestamp accuracy benchmarks?