Abstract:
Speech synthesis technology is evolving from the concept of converting texts into voices to directly learning and imitating user voices. In the past, the voice was genera...Show MoreMetadata
Abstract:
Speech synthesis technology is evolving from the concept of converting texts into voices to directly learning and imitating user voices. In the past, the voice was generated through several stages of recording frequently used sentences and synthesizing text into sounds that wanted to be converted. However, due to the limitations of naturalness and accuracy, research has been conducted to overcome this in various fields such as phonetics, linguistics, and statistics, and now the development of big data, AI technology, and parallel processing technology has increased to reflect human tone and timbre. If the voice information, which is one of the biometric information that can be used as personally identifiable information, can be made so natural that it is difficult for a machine to distinguish it from a human, personal information may be invaded by a security threat that may exist at any time. Therefore, in this paper, we explain representative speech synthesis models and introduce voice generation models, which are applications, to find out how to convert and synthesize Korean into other languages, and to look at what may become a security issue in the future.
Date of Conference: 17-20 January 2021
Date Added to IEEE Xplore: 10 March 2021
ISBN Information: