Audio-Enhanced Text-to-Video Retrieval using Text-Conditioned Feature Alignment | IEEE Conference Publication | IEEE Xplore