Conferences >2022 IEEE International Confe...

Ventriloquist-Net: Leveraging Speech Cues for Emotive Talking Head Generation

Download PDF
Download References
Request Permissions
Save to
Alerts

Abstract:

In this paper, we propose Ventriloquist-Net: A Talking Head Generation model that uses only a speech segment and a single source face image. It places emphasis on emotive...Show More

Metadata

Abstract:

In this paper, we propose Ventriloquist-Net: A Talking Head Generation model that uses only a speech segment and a single source face image. It places emphasis on emotive expressions. Cues for generating these expressions are implicitly inferred from the speech clip only. We formulate our framework to comprise of independently trained modules to expedite convergence. This not only allows extension to datasets in a semi-supervised manner but also facilitates handling in-the-wild source images. Quantitative and qualitative evaluations on generated videos demonstrate state-of-the-art performance even on unseen input data. Implementation and supplementary videos are available at https://github.com/dipnds/VentriloquistNet.

Published in: 2022 IEEE International Conference on Image Processing (ICIP)

Date of Conference: 16-19 October 2022

Date Added to IEEE Xplore: 18 October 2022

ISBN Information:

ISSN Information:

DOI: 10.1109/ICIP46576.2022.9897657

Conference Location: Bordeaux, France

Contents

References is not available for this document.

Ventriloquist-Net: Leveraging Speech Cues for Emotive Talking Head Generation

Abstract:

Metadata

Abstract:

ISSN Information:

References

IEEE Account

Purchase Details

Profile Information

Need Help?

Ventriloquist-Net: Leveraging Speech Cues for Emotive Talking Head Generation

Alerts

Abstract:

Metadata

Abstract:

ISSN Information:

References

IEEE Account

Purchase Details

Profile Information

Need Help?