Multi-Modal Object Tracking with Vision-Language Adaptive Fusion and Alignment | IEEE Conference Publication | IEEE Xplore