Uncovering Bias in ASR Systems: Evaluating Wav2vec2 and Whisper for Dutch speakers | IEEE Conference Publication | IEEE Xplore

Uncovering Bias in ASR Systems: Evaluating Wav2vec2 and Whisper for Dutch speakers


Abstract:

It is crucial that ASR systems can handle the wide range of variations in speech of speakers from different demographic groups, with different speaking styles, and of spe...Show More

Abstract:

It is crucial that ASR systems can handle the wide range of variations in speech of speakers from different demographic groups, with different speaking styles, and of speakers with (dis)abilities. A potential quality-of-service harm arises when ASR systems do not perform equally well for everyone. ASR systems may exhibit bias against certain types of speech, such as non-native accents, different age groups and gender. In this study, we evaluate two widely-used neural network-based architectures: Wav2vec2 and Whisper on potential biases for Dutch speakers. We used the Dutch speech corpus JASMIN as a test set containing read and conversational speech in a human-machine interaction setting. The results reveal a significant bias against non-natives, children and elderly and some regional dialects. The ASR systems generally perform slightly better for women than for men.
Date of Conference: 25-27 October 2023
Date Added to IEEE Xplore: 15 November 2023
ISBN Information:

ISSN Information:

Conference Location: Bucharest, Romania

Contact IEEE to Subscribe

References

References is not available for this document.