Instruction-Augmented Multimodal Alignment for Image-Text and Element Matching | IEEE Conference Publication | IEEE Xplore