A Unified Multi-modal Structure for Retrieving Tracked Vehicles through Natural Language Descriptions | IEEE Conference Publication | IEEE Xplore