Multimodal Deep Learning: Integrating Text and Image Embeddings with Attention Mechanism | IEEE Conference Publication | IEEE Xplore