Learning to Exploit Temporal Structure for Biomedical Vision-Language Processing | IEEE Conference Publication | IEEE Xplore