DuST: Dual Swin Transformer for Multi-modal Video and Time-Series Modeling | IEEE Conference Publication | IEEE Xplore