CAT+: Investigating and Enhancing Audio-Visual Understanding in Large Language Models | IEEE Journals & Magazine | IEEE Xplore