VTVBrain: A Two-stage Brain Encoding Model for Decoding Key Neural Responses in Multimodal Contexts | IEEE Conference Publication | IEEE Xplore