Abstract:
The robustness of automatic speech recognition systems can be improved by exploiting further information sources such as additional acoustic channels or modalities. Since...Show MoreMetadata
Abstract:
The robustness of automatic speech recognition systems can be improved by exploiting further information sources such as additional acoustic channels or modalities. Since the arising problem of information fusion exhibits striking parallels to problems in digital communications, where the turbo principle [1] was a groundbreaking innovation, Shivappa et al. showed that a similar iterative scheme can be applied to multimodal speech recognition [2]. We provide new interpretations and propose significant modifications of their approach: First, we show that no modification of the forward-backward recognition algorithm is required; second, we dispense with their proposed heuristic model; third, we deliver our own interpretation and formulation of the extrinsic information passed between the recognizers. Our proposed method is successfully applied to a synthetic unimodal two-channel speech recognition task.
Published in: Speech Communication; 10. ITG Symposium
Date of Conference: 26-28 September 2012
Date Added to IEEE Xplore: 25 September 2012
Print ISBN:978-3-8007-3455-9
Conference Location: Braunschweig, Germany