GenEn-MNER: Enhancing Nested Chinese NER With Multimodal Fusion and Alignment via Speech-to-Text Generation | IEEE Journals & Magazine | IEEE Xplore