Multimodal Fusion Methods with Vision Transformers for Remote Sensing Semantic Segmentation | IEEE Conference Publication | IEEE Xplore