SODA: Structural Pre-Decoupling and Co-Aligning for Video Compositional Representation | IEEE Journals & Magazine | IEEE Xplore