Conferences >2021 IEEE Automatic Speech Re...

A Comparative Study on Non-Autoregressive Modelings for Speech-to-Text Generation

Download PDF
Download References
Request Permissions
Save to
Alerts

Abstract:

Non-autoregressive (NAR) models simultaneously generate multiple outputs in a sequence, which significantly reduces the inference speed at the cost of accuracy drop compa...Show More

Metadata

Abstract:

Non-autoregressive (NAR) models simultaneously generate multiple outputs in a sequence, which significantly reduces the inference speed at the cost of accuracy drop compared to autoregressive baselines. Showing great potential for real-time applications, an increasing number of NAR models have been explored in different fields to mitigate the performance gap against AR models. In this work, we conduct a comparative study of various NAR modeling methods for end-to-end automatic speech recognition (ASR). Experiments are performed in the state-of-the-art setting using ESPnet. The results on various tasks provide interesting findings for developing an understanding of NAR ASR, such as the accuracy-speed trade-off and robustness against long-form utterances. We also show that the techniques can be combined for further improvement and applied to NAR end-to-end speech translation. All the implementations are publicly available to encourage further research in NAR speech processing.

Published in: 2021 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU)

Date of Conference: 13-17 December 2021

Date Added to IEEE Xplore: 03 February 2022

ISBN Information:

DOI: 10.1109/ASRU51503.2021.9688157

Conference Location: Cartagena, Colombia

Funding Agency:

Contents

References is not available for this document.

A Comparative Study on Non-Autoregressive Modelings for Speech-to-Text Generation

Abstract:

Metadata

Abstract:

Funding Agency:

References

IEEE Account

Purchase Details

Profile Information

Need Help?

A Comparative Study on Non-Autoregressive Modelings for Speech-to-Text Generation

Alerts

Abstract:

Metadata

Abstract:

Funding Agency:

Authors

Figures

References

Citations

Keywords

Metrics

Footnotes

References

IEEE Account

Purchase Details

Profile Information

Need Help?