Multi-Granularity Relational Attention Network for Audio-Visual Question Answering | IEEE Journals & Magazine | IEEE Xplore