Conferences >2017 IEEE International Confe...

Word level lyrics-audio synchronization using separated vocals

Download PDF
Download References
Request Permissions
Save to
Alerts

Abstract:

The massive amount of digital music data available necessitates automated methods for processing, classifying and organizing large volumes of songs. As music discovery an...Show More

Metadata

Abstract:

The massive amount of digital music data available necessitates automated methods for processing, classifying and organizing large volumes of songs. As music discovery and interactive music applications become commonplace, the ability to synchronize lyric text information with an audio recording has gained interest. This paper presents an approach for lyric-audio alignment by comparing synthesized speech with a vocal track removed from an instrument mixture using source separation. We take a hierarchical approach to solve the problem, assuming a set of paragraph-music segment pairs is given and focus on within-segment lyric alignment at the word level. A synthesized speech signal is generated to reflect the properties of the music signal by controlling the speech rate and gender. Dynamic time warping finds the shortest path between the synthesized speech and separated vocal. The resulting path is used to calculate the timestamps of words in the original signal. The system results in approximately half a second of misalignment error on average. Finally, we discuss the challenges and suggest improvements to the method.

Published in: 2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

Date of Conference: 05-09 March 2017

Date Added to IEEE Xplore: 19 June 2017

ISBN Information:

Electronic ISSN: 2379-190X

DOI: 10.1109/ICASSP.2017.7952235

Conference Location: New Orleans, LA, USA

Contents

References is not available for this document.

Word level lyrics-audio synchronization using separated vocals

Abstract:

Metadata

Abstract:

References

IEEE Account

Purchase Details

Profile Information

Need Help?

Word level lyrics-audio synchronization using separated vocals

Alerts

Abstract:

Metadata

Abstract:

References

IEEE Account

Purchase Details

Profile Information

Need Help?