A Comparison of Pre-trained Vision-and-Language Models for Multimodal Representation Learning across Medical Images and Reports | IEEE Conference Publication | IEEE Xplore