Motion2Language, unsupervised learning of synchronized semantic motion segmentation Paper • 2310.10594 • Published Oct 16, 2023 • 1
StreamUni: Achieving Streaming Speech Translation with a Unified Large Speech-Language Model Paper • 2507.07803 • Published Jul 10
Finetuning End-to-End Models for Estonian Conversational Spoken Language Translation Paper • 2407.03809 • Published Jul 4, 2024
TimeChat-Online: 80% Visual Tokens are Naturally Redundant in Streaming Videos Paper • 2504.17343 • Published Apr 24 • 13