Aligning Multimodal Biomedical Images and Language via One Large Vision-Language Model | IEEE Conference Publication | IEEE Xplore