Learning to Align Arabic and English Text to Remote Sensing Images Using Transformers | IEEE Conference Publication | IEEE Xplore