SDIF-DA: A Shallow-to-Deep Interaction Framework with Data Augmentation for Multi-Modal Intent Detection | IEEE Conference Publication | IEEE Xplore