Abstract:
State-of-the-art speech recognisers employ neural networks in various configurations. A standard (hybrid) speech recogniser computes the likelihood for one time frame and...Show MoreMetadata
Abstract:
State-of-the-art speech recognisers employ neural networks in various configurations. A standard (hybrid) speech recogniser computes the likelihood for one time frame and state, using only one out of thousands of possible neural-network outputs. However, the whole output vector carries information. In this paper, features from state-of-the-art speech recognisers are collected per phone given a particular context, and input to a discriminative log-linear model. The log-linear model is trained with conditional maximum likelihood or a large-margin criterion. A key element is the prior on the parameters of the log-linear model. The mean of the prior is set to the point where the performance of the original systems is attained. The log-linear model then provides an additional increase over the state-of-the-art performance of the individual systems.
Date of Conference: 13-17 December 2015
Date Added to IEEE Xplore: 11 February 2016
ISBN Information:
Keywords assist with retrieval of results and provide a means to discovering other relevant content. Learn more.
- IEEE Keywords
- Index Terms
- Neural Network ,
- Model Discrimination ,
- Deep Neural Network Features ,
- Maximum Likelihood ,
- Log-linear ,
- Speech Recognition ,
- Conditional Likelihood ,
- Conditional Maximum Likelihood ,
- Support Vector Machine ,
- Feature Space ,
- Mixture Model ,
- Hidden Markov Model ,
- Conditional Distribution ,
- Parameter Vector ,
- Graphical Model ,
- Hybrid System ,
- Dot Product ,
- Language Model ,
- Acoustic Features ,
- Types Of Noise ,
- Tandem System ,
- Sequence Of Words ,
- Word Error Rate ,
- Conditional Random Field ,
- Audio Segments ,
- Training Criterion ,
- Joint System ,
- Segmentation Feature ,
- Decoding ,
- Gradient Descent
- Author Keywords
Keywords assist with retrieval of results and provide a means to discovering other relevant content. Learn more.
- IEEE Keywords
- Index Terms
- Neural Network ,
- Model Discrimination ,
- Deep Neural Network Features ,
- Maximum Likelihood ,
- Log-linear ,
- Speech Recognition ,
- Conditional Likelihood ,
- Conditional Maximum Likelihood ,
- Support Vector Machine ,
- Feature Space ,
- Mixture Model ,
- Hidden Markov Model ,
- Conditional Distribution ,
- Parameter Vector ,
- Graphical Model ,
- Hybrid System ,
- Dot Product ,
- Language Model ,
- Acoustic Features ,
- Types Of Noise ,
- Tandem System ,
- Sequence Of Words ,
- Word Error Rate ,
- Conditional Random Field ,
- Audio Segments ,
- Training Criterion ,
- Joint System ,
- Segmentation Feature ,
- Decoding ,
- Gradient Descent
- Author Keywords