By Topic

Production features for detection of shouted speech

Sign In

Cookies must be enabled to login.After enabling cookies , please use refresh or reload or ctrl+f5 on the browser for the login options.

Formats Non-Member Member
$33 $13
Learn how you can qualify for the best price for this item!
Become an IEEE Member or Subscribe to
IEEE Xplore for exclusive pricing!
close button

puzzle piece

IEEE membership options for an individual and IEEE Xplore subscriptions for an organization offer the most affordable access to essential journal articles, conference papers, standards, eBooks, and eLearning courses.

Learn more about:

IEEE membership

IEEE Xplore subscriptions

2 Author(s)
Vinay Kumar Mittal ; Speech and Vision Laboratory, International Institute of Information Technology, Gachibowli, Hyderabad, India 500032 ; B. Yegnanarayana

Shouted speech or screaming signals have been studied mostly through spectral representation such as melcepstral coefficients. Intuitive evidence that the characteristics of the excitation source may vary in the case of shouted speech has drawn little attention yet. In this paper we examine how the characteristics of both components of speech production mechanism, especially the glottal excitation source, are modified during the production of shout signals. Shouted and normal speech signals are examined along with the corresponding Electro-glotto-graph (EGG) signals. Distinguishing features like the dominant frequency and the strength of excitation are explored, along with the instantaneous fundamental frequency. These features are computed using linear prediction analysis and zero frequency filtering of the speech signal. Efficacy of these features in discriminating between shouted and normal speech is tested in five different vowel contexts.

Published in:

2013 IEEE 10th Consumer Communications and Networking Conference (CCNC)

Date of Conference:

11-14 Jan. 2013