Abstract:
This letter investigates the use of MFCCs and GMMs for 1) improving the state of the art in speaker height estimation, and 2) rapid estimation of subglottal resonances (S...Show MoreMetadata
Abstract:
This letter investigates the use of MFCCs and GMMs for 1) improving the state of the art in speaker height estimation, and 2) rapid estimation of subglottal resonances (SGRs) without relying on formant and pitch tracking (unlike our previous algorithm in [1]). The proposed system comprises a set of height-dependent GMMs modeling static and dynamic MFCC features, where each GMM is associated with a height value. Furthermore, since SGRs and height are correlated, each GMM is also associated with a set of SGR values (known a priori). Given a speech sample, speaker height and SGRs are estimated as weighted combinations of the values corresponding to the N most-likely GMMs. We assess the importance of using dynamic MFCC features and the weighted decision rule, and demonstrate the efficacy of our approach via experiments on height estimation (using TIMIT) and SGR estimation (using the Tracheal Resonance database.
Published in: IEEE Signal Processing Letters ( Volume: 21, Issue: 2, February 2014)
Speech Recognition Using Cross Correlation and Feature Analysis Using Mel-Frequency Cepstral Coefficients and Pitch
Ruchi Gupte,Sarah Hawa,Reena Sonkusare
Speech Emotion Detection Using Mel-Frequency Cepstral Coefficient and Hidden Markov Model
Didik Muttaqin,Suyanto Suyanto
Mel Frequency Cepstral Coefficients Enhance Imagined Speech Decoding Accuracy from EEG
Ciaran Cooney,Rafaella Folli,Damien Coyle
Feature Enriched Speech Emotion Recognition Using Mel Frequency Cepstral Coefficients
T. Akilandeswari,D. Aashritha,J.S. Athibathi Raja,A. Tanuja,J. Dhinisha
Prediction of Fundamental Frequency and Voicing From Mel-Frequency Cepstral Coefficients for Unconstrained Speech Reconstruction
Ben Milner,Xu Shao
Speech Mel Frequency Cepstral Coefficient feature classification using multi level support vector machine
Abhay Kumar,Sidhartha Sankar Rout,Varun Goel
Performance Analysis of Isolated Speech Recognition Technique Using MFCC and Cross-Correlation
Md. Ekhlasur Rahaman,S. M. Shamsul Alam,Himadri Shekhar Mondal,Ahmed Saif Muntaseer,Rajib Mandal,M. Raihan
Variants of mel-frequency cepstral coefficients for improved whispered speech speaker verification in mismatched conditions
Milton Sarria-Paja,Tiago H. Falk
Speech to text for Indonesian homophone phrase with Mel Frequency Cepstral Coefficient
Anugrayani Bustamin,Indrabayu,Intan Sari Areni,Novy NRA Mokobombang
Chicken Sound Recognition Using Anti-noise Mel Frequency Cepstral Coefficients
Ming Lin,Shangping Zhong,Lingli Lin