facebook/wav2vec2-base-10k-voxpopuli-ft-hr
Automatic Speech Recognition
•
Updated
•
11
None defined yet.
HiStream: Efficient High-Resolution Video Generation via Redundancy-Eliminated Streaming
SAM Audio: Segment Anything in Audio