Skip to Main Content
Speech detection using Haar - like filtering is proposed as a new and very low calculation cost method for sensornet applications. The simple Haar - like filters having variable filter width and shift width are trained to learn appropriate filter parameters from the training samples to detect speech. To further decrease the calculation cost, the use of intermediate signal representation called ldquointegral signalrdquo is proposed. Our method yielded speech/nonspeech classification accuracy of 97.44% for the input length of 0.1 s. Compared with high performance feature extraction method MFCC (mel-frequency cepstrum coefficient), the proposed haar-like filtering can be approximately 93.71% efficient in terms of the total amount of add and multiply calculations while capable of achieving the error rate of only 2.56% relative to MFCC.