Conferences >2019 Asia-Pacific Signal and ...

Cross-lingual Automatic Speech Recognition Exploiting Articulatory Features

Download PDF
Download References
Request Permissions
Save to
Alerts

Abstract:

Articulatory features (AFs) provide language-independent attribute by exploiting the speech production knowledge. This paper proposes a cross-lingual automatic speech rec...Show More

Metadata

Abstract:

Articulatory features (AFs) provide language-independent attribute by exploiting the speech production knowledge. This paper proposes a cross-lingual automatic speech recognition (ASR) based on AF methods. Various neural network (NN) architectures are explored to extract cross-lingual AFs and their performance is studied. The architectures include mutilayer perception(MLP), convolutional NN (CNN) and long short-term memory recurrent NN (LSTM). In our cross-lingual setup, only the source language (English, representing a well-resourced language) is used to train the AF extractors. AFs are then generated for the target language (Mandarin, representing an under-resourced language) using the trained extractors. The frame-classification accuracy indicates that the LSTM has an ability to perform a knowledge transfer through the robust cross-lingual AFs from well-resourced to under-resourced language. The final ASR system is built using traditional approaches (e.g. hybrid models), combining AFs with conventional MFCCs. The results demonstrate that the cross-lingual AFs improve the performance in under-resourced ASR task even though the source and target languages come from different language family. Overall, the proposed cross-lingual ASR approach provides slight improvement over the monolingual LF-MMI and cross-lingual (acoustic model adaptation-based) ASR systems.

Published in: 2019 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC)

Date of Conference: 18-21 November 2019

Date Added to IEEE Xplore: 05 March 2020

ISBN Information:

ISSN Information:

DOI: 10.1109/APSIPAASC47483.2019.9023195

Conference Location: Lanzhou, China

Contents

References is not available for this document.

Cross-lingual Automatic Speech Recognition Exploiting Articulatory Features

Abstract:

Metadata

Abstract:

ISSN Information:

References

IEEE Account

Purchase Details

Profile Information

Need Help?

Cross-lingual Automatic Speech Recognition Exploiting Articulatory Features

Alerts

Abstract:

Metadata

Abstract:

ISSN Information:

Authors

Figures

References

Citations

Keywords

Metrics

Footnotes

References

IEEE Account

Purchase Details

Profile Information

Need Help?