Abstract:
Speaker recognition has been developed over many years and it comes with many different methods. MFCC is one of more the successful methods due to it being generally mode...Show MoreMetadata
Abstract:
Speaker recognition has been developed over many years and it comes with many different methods. MFCC is one of more the successful methods due to it being generally modeled on the human auditory system. It represents high success rate of recognition and strong robustness against noise in the lower frequency regions. However, in the higher frequency regions, it captures speaker characteristics information less effectively. In recent years, Artificial Neural Networks have become popular. This paper presents a speaker recognition method based on MFCC and Back-Propagation Neural Networks. Experimental studies have proven that the recognition rate is successful when the number of questionable speakers is not very larger. When the number of speakers increases, the rate of recognition decreases. The potential problems and solutions are discussed, the number of training samples must be larger than the number of network model weights, 2–10 times. When the number of speakers increases, the number of training samples required also increases significantly.
Published in: 2017 28th Irish Signals and Systems Conference (ISSC)
Date of Conference: 20-21 June 2017
Date Added to IEEE Xplore: 20 July 2017
ISBN Information:
Keywords assist with retrieval of results and provide a means to discovering other relevant content. Learn more.
- IEEE Keywords
- Index Terms
- Neural Network ,
- Back Propagation Neural Network ,
- Speaker Recognition ,
- Mel-frequency Cepstral Coefficients ,
- Artificial Neural Network ,
- Human System ,
- Frequency Region ,
- Auditory System ,
- Number Of Weights ,
- Low-frequency Region ,
- Recognition Rate ,
- Strong Robustness ,
- High Frequency Region ,
- Noise Region ,
- Human Auditory System ,
- Output Layer ,
- Hidden Layer ,
- Fast Fourier Transform ,
- Input Layer ,
- Hidden Markov Model ,
- Dynamic Time Warping ,
- Discrete Cosine Transform ,
- Nodes In Layer ,
- Pitch Perception ,
- Linear Scale ,
- Vector Quantization ,
- Cepstral Coefficients ,
- Hidden Nodes ,
- Hidden Layer Nodes ,
- Low-frequency Part
- Author Keywords
Keywords assist with retrieval of results and provide a means to discovering other relevant content. Learn more.
- IEEE Keywords
- Index Terms
- Neural Network ,
- Back Propagation Neural Network ,
- Speaker Recognition ,
- Mel-frequency Cepstral Coefficients ,
- Artificial Neural Network ,
- Human System ,
- Frequency Region ,
- Auditory System ,
- Number Of Weights ,
- Low-frequency Region ,
- Recognition Rate ,
- Strong Robustness ,
- High Frequency Region ,
- Noise Region ,
- Human Auditory System ,
- Output Layer ,
- Hidden Layer ,
- Fast Fourier Transform ,
- Input Layer ,
- Hidden Markov Model ,
- Dynamic Time Warping ,
- Discrete Cosine Transform ,
- Nodes In Layer ,
- Pitch Perception ,
- Linear Scale ,
- Vector Quantization ,
- Cepstral Coefficients ,
- Hidden Nodes ,
- Hidden Layer Nodes ,
- Low-frequency Part
- Author Keywords