Realistic Lip-Sync Generation from Text for Multimodal Applications | IEEE Conference Publication | IEEE Xplore