How2: A Large-scale Dataset for Multimodal Language Understanding Paper • 1811.00347 • Published Nov 1, 2018
Multimodal Large Language Models with Fusion Low Rank Adaptation for Device Directed Speech Detection Paper • 2406.09617 • Published Jun 13, 2024