Revolutionizing NLP: Multimodal Integration for Enhanced Image-to-Text Extraction | IEEE Conference Publication | IEEE Xplore