Abstract:
We developed a system for detecting the speech activity intervals of multiple speakers by combining multiple microphone arrays and human tracking technologies. We also pr...Show MoreMetadata
Abstract:
We developed a system for detecting the speech activity intervals of multiple speakers by combining multiple microphone arrays and human tracking technologies. We also proposed a method for estimating the face orientation of the detected speakers. The developed system was evaluated in two steps: individual utterances in different positions and orientations; and simultaneous dialogues by multiple speakers. Evaluation results revealed that the proposed system could detect speech activity intervals with more than 90% of accuracy, and face orientations with standard deviations within 30 degrees, in situations excluding the cases where all arrays are in the opposite direction to the speaker's face orientation.
Date of Conference: 28 September 2015 - 02 October 2015
Date Added to IEEE Xplore: 17 December 2015
ISBN Information:
Keywords assist with retrieval of results and provide a means to discovering other relevant content. Learn more.
- IEEE Keywords
- Index Terms
- Multiple Locations ,
- Multiple Arrays ,
- Orientation Estimation ,
- Face Orientation ,
- Microphone Array ,
- Multiple Speakers ,
- Minimum Distance ,
- Air Conditioning ,
- Search Space ,
- Real-time Performance ,
- 3D Space ,
- Noise Sources ,
- Azimuth Angle ,
- Direct Estimates ,
- Position Estimation ,
- Elevation Angle ,
- GHz CPU ,
- Sound Source ,
- Array Technology ,
- Angle Of Resolution ,
- Laser Ranging ,
- Range Of Degrees ,
- Sound Localization ,
- Position Estimation Error ,
- P2 Position ,
- Source Position ,
- Direct Line ,
- Resolution Estimation ,
- Computational Cost
Keywords assist with retrieval of results and provide a means to discovering other relevant content. Learn more.
- IEEE Keywords
- Index Terms
- Multiple Locations ,
- Multiple Arrays ,
- Orientation Estimation ,
- Face Orientation ,
- Microphone Array ,
- Multiple Speakers ,
- Minimum Distance ,
- Air Conditioning ,
- Search Space ,
- Real-time Performance ,
- 3D Space ,
- Noise Sources ,
- Azimuth Angle ,
- Direct Estimates ,
- Position Estimation ,
- Elevation Angle ,
- GHz CPU ,
- Sound Source ,
- Array Technology ,
- Angle Of Resolution ,
- Laser Ranging ,
- Range Of Degrees ,
- Sound Localization ,
- Position Estimation Error ,
- P2 Position ,
- Source Position ,
- Direct Line ,
- Resolution Estimation ,
- Computational Cost