MMAudio: Taming Multimodal Joint Training for High-Quality Video-to-Audio Synthesis | IEEE Conference Publication | IEEE Xplore