Adapting Multimodal Large Language Models for Video Question Answering by Capturing Question-critical and Coherent Moments | IEEE Journals & Magazine | IEEE Xplore