MDD-Net: Multimodal Depression Detection through Mutual Transformer Paper • 2508.08093 • Published Aug 11
MMFformer: Multimodal Fusion Transformer Network for Depression Detection Paper • 2508.06701 • Published Aug 8
BaDLAD: A Large Multi-Domain Bengali Document Layout Analysis Dataset Paper • 2303.05325 • Published Mar 9, 2023
GNN-ViTCap: GNN-Enhanced Multiple Instance Learning with Vision Transformers for Whole Slide Image Classification and Captioning Paper • 2507.07006 • Published Jul 9 • 1
FusionEnsemble-Net: An Attention-Based Ensemble of Spatiotemporal Networks for Multimodal Sign Language Recognition Paper • 2508.09362 • Published Aug 12
A Signer-Invariant Conformer and Multi-Scale Fusion Transformer for Continuous Sign Language Recognition Paper • 2508.09372 • Published Aug 12