Conferences >2019 22nd Conference of the O...

Recognition and translation of code-switching speech utterances

Download PDF
Download References
Request Permissions
Save to
Alerts

Abstract:

Code-switching (CS), a hallmark of worldwide bilingual communities, refers to a strategy adopted by bilinguals (or multilinguals) who mix two or more languages in a disco...Show More

Notes: This article was mistakenly omitted from the original submission to IEEE Xplore. It is now included as part of the conference record.

Metadata

Abstract:

Code-switching (CS), a hallmark of worldwide bilingual communities, refers to a strategy adopted by bilinguals (or multilinguals) who mix two or more languages in a discourse often with little change of interlocutor or topic. The units and the locations of the switches may vary widely from single-word switches to whole phrases (beyond the length of the loanword units). Such phenomena pose challenges for spoken language technologies, i.e., automatic speech recognition (ASR), since the systems need to be able to handle the input in a multilingual setting. Several works constructed a CS ASR on many different language pairs. But the common aim of developing a CS ASR is merely for transcribing CS-speech utterances into CS-text sentences within a single individual. In contrast, in this study, we address the situational context that happens during dialogs between CS and non-CS (monolingual) speakers and support monolingual speakers who want to understand CS speakers. We construct a system that recognizes and translates from codeswitching speech to monolingual text. We investigated several approaches, including a cascade of ASR and a neural machine translation (NMT), a cascade of ASR and a deep bidirectional language model (BERT), an ASR that directly outputs monolingual transcriptions from CS speech, and multi-task learning. Finally, we evaluate and discuss these four ways on a Japanese- English CS to English monolingual task.

Notes: This article was mistakenly omitted from the original submission to IEEE Xplore. It is now included as part of the conference record.

Published in: 2019 22nd Conference of the Oriental COCOSDA International Committee for the Co-ordination and Standardisation of Speech Databases and Assessment Techniques (O-COCOSDA)

Date of Conference: 25-27 October 2019

Date Added to IEEE Xplore: 23 April 2020

ISBN Information:

ISSN Information:

DOI: 10.1109/O-COCOSDA46868.2019.9060847

Conference Location: Cebu, Philippines

Recognition and translation of code-switching speech utterances

Abstract:

Metadata

Abstract:

ISSN Information:

IEEE Account

Purchase Details

Profile Information

Need Help?

Recognition and translation of code-switching speech utterances

Alerts

Abstract:

Metadata

Abstract:

ISSN Information:

Authors

Citations

Keywords

Metrics

IEEE Account

Purchase Details

Profile Information

Need Help?