DyPE: Dynamic Position Extrapolation for Ultra High Resolution Diffusion Paper • 2510.20766 • Published 11 days ago • 31
Advancing Speech Understanding in Speech-Aware Language Models with GRPO Paper • 2509.16990 • Published Sep 21 • 18
StressTest Collection Model and Data from the paper - StressTest: Can YOUR Speech LM Handle the Stress? • 4 items • Updated Jun 29 • 1
Joint Audio and Symbolic Conditioning for Temporally Controlled Text-to-Music Generation Paper • 2406.10970 • Published Jun 16, 2024 • 1
HebDB: a Weakly Supervised Dataset for Hebrew Speech Processing Paper • 2407.07566 • Published Jul 10, 2024
Auto-Regressive vs Flow-Matching: a Comparative Study of Modeling Paradigms for Text-to-Music Generation Paper • 2506.08570 • Published Jun 10 • 33