Conferences >2023 IEEE Conference on Virtu...

Audio to Deep Visual: Speaking Mouth Generation Based on 3D Sparse Landmarks

Download PDF
Download References
Request Permissions
Save to
Alerts

Abstract:

Having a system to automatically generate a talking mouth in sync with input speech would enhance speech communication and enable many novel applications. This article pr...Show More

Metadata

Abstract:

Having a system to automatically generate a talking mouth in sync with input speech would enhance speech communication and enable many novel applications. This article presents a new model that can generate 3D talking mouth landmarks from Chinese speech. We use sparse 3D landmarks to model the mouth motion, which are easy to capture and provide sufficient lip accuracy. The 4D mouth motion dataset was collected by our self-developed facial capture device, filling the gap in the Chinese speech-driven lip dataset. The exper-imental results show that the generated talking landmarks achieve accurate, smooth, and natural 3D mouth movements.

Published in: 2023 IEEE Conference on Virtual Reality and 3D User Interfaces Abstracts and Workshops (VRW)

Date of Conference: 25-29 March 2023

Date Added to IEEE Xplore: 01 May 2023

ISBN Information:

DOI: 10.1109/VRW58643.2023.00145

Conference Location: Shanghai, China

Contents

References is not available for this document.

Audio to Deep Visual: Speaking Mouth Generation Based on 3D Sparse Landmarks

Abstract:

Metadata

Abstract:

References

IEEE Account

Purchase Details

Profile Information

Need Help?

Audio to Deep Visual: Speaking Mouth Generation Based on 3D Sparse Landmarks

Alerts

Abstract:

Metadata

Abstract:

Authors

Figures

References

Keywords

Metrics

References

IEEE Account

Purchase Details

Profile Information

Need Help?