Skip to Main Content
This paper describes the architecture of a door phone embedded system with interactive voice response. Because speech technology is not 100% reliable, the emphasis was on parts that have greater impact on overall performance (audio capture, speech recognition and verification, and power consumption). Using an embedded microphone array increases speech recognition effectiveness in very noisy environments. To increase the speech recognition performance, a null grammar with confidence measure support was used. The speaker verification module was also optimized for nosy environments (using the cepstral mean normalization technique and a universal background model).