CM-PIE: Cross-Modal Perception for Interactive-Enhanced Audio-Visual Video Parsing | IEEE Conference Publication | IEEE Xplore