Loading [MathJax]/extensions/MathMenu.js
Continuous Birdsong Recognition Using Gaussian Mixture Modeling of Image Shape Features | IEEE Journals & Magazine | IEEE Xplore

Continuous Birdsong Recognition Using Gaussian Mixture Modeling of Image Shape Features


Abstract:

Traditional birdsong recognition approaches used acoustic features based on the acoustic model of speech production or the perceptual model of the human auditory system t...Show More

Abstract:

Traditional birdsong recognition approaches used acoustic features based on the acoustic model of speech production or the perceptual model of the human auditory system to identify the associated bird species. In this paper, a new feature descriptor that uses image shape features is proposed to identify bird species based on the recognition of fixed-duration birdsong segments where their corresponding spectrograms are viewed as gray-level images. The MPEG-7 angular radial transform (ART) descriptor, which can compactly and efficiently describe the gray-level variations within an image region in both angular and radial directions, will be employed to extract the shape features from the spectrogram image. To effectively capture both frequency and temporal variations within a birdsong segment using ART, a sector expansion algorithm is proposed to transform its spectrogram image into a corresponding sector image such that the frequency and temporal axes of the spectrogram image will align with the radial and angular directions of the ART basis functions, respectively. For the classification of 28 bird species using Gaussian mixture models (GMM), the best classification accuracy is 86.30% and 94.62% for 3-second and 5-second birdsong segments using the proposed ART descriptor, which is better than traditional descriptors such as LPCC, MFCC, and TDMFCC.
Published in: IEEE Transactions on Multimedia ( Volume: 15, Issue: 2, February 2013)
Page(s): 454 - 464
Date of Publication: 27 November 2012

ISSN Information:

Author image of Chang-Hsing Lee
Department of Computer Science and Information Engineering, Chung Hua University, Hsinchu 300, Taiwan
Chang-Hsing Lee (M'11) received the B.S. and Ph.D. degrees from Computer and Information Science, National Chiao Tung University, Hsinchu, Taiwan in 1991 and 1995, respectively. He is currently an Associate Professor with the Department of Computer Science and Information Engineering, Chung Hua University, Hsinchu, Taiwan. His main research interests include audio/sound classification, multimedia information retrieval, an...Show More
Chang-Hsing Lee (M'11) received the B.S. and Ph.D. degrees from Computer and Information Science, National Chiao Tung University, Hsinchu, Taiwan in 1991 and 1995, respectively. He is currently an Associate Professor with the Department of Computer Science and Information Engineering, Chung Hua University, Hsinchu, Taiwan. His main research interests include audio/sound classification, multimedia information retrieval, an...View more
Author image of Sheng-Bin Hsu
Department of Computer Science and Information Engineering, Chung Hua University, Hsinchu 300, Taiwan
Sheng-Bin Hsu received the B.S. degree from Computer Science and Information Engineering, Ming Dao University, Changhua, Taiwan in 2006 and the M.S. degree from Computer Science and Information Engineering, Chung Hua University, Hsinchu, Taiwan in 2009. He is currently pursuing the Ph.D. degree in Computer Science, National Central University, Taoyuan, Taiwan. His main research interests include audio/sound classification...Show More
Sheng-Bin Hsu received the B.S. degree from Computer Science and Information Engineering, Ming Dao University, Changhua, Taiwan in 2006 and the M.S. degree from Computer Science and Information Engineering, Chung Hua University, Hsinchu, Taiwan in 2009. He is currently pursuing the Ph.D. degree in Computer Science, National Central University, Taoyuan, Taiwan. His main research interests include audio/sound classification...View more
Author image of Jau-Ling Shih
Department of Computer Science and Information Engineering, Chung Hua University, Hsinchu 300, Taiwan
Jau-Ling Shih received the B.S. degree from Electrical Engineering, National Sun Yat-Sen University, Kaohsiung, Taiwan in 1992, the M.S. degree from Electrical Engineering, National Cheng Kung University, Tainan, Taiwan in 1994, and the Ph.D. degree from Computer and Information Science, National Chiao Tung University, Hsinchu, Taiwan in 2002. She is currently a Professor with the Department of Computer Science and Inform...Show More
Jau-Ling Shih received the B.S. degree from Electrical Engineering, National Sun Yat-Sen University, Kaohsiung, Taiwan in 1992, the M.S. degree from Electrical Engineering, National Cheng Kung University, Tainan, Taiwan in 1994, and the Ph.D. degree from Computer and Information Science, National Chiao Tung University, Hsinchu, Taiwan in 2002. She is currently a Professor with the Department of Computer Science and Inform...View more
Author image of Chih-Hsun Chou
Department of Computer Science and Information Engineering, Chung Hua University, Hsinchu 300, Taiwan
Chih-Hsun Chou received the B.S. degree from Electronic Engineering, Tamkang University, Taipei, Taiwan in 1985, and the Ph.D. degree from Electrical Engineering, Ta-Tung Institute of Technology, Taipei, Taiwan in 1994. He is currently an Associate Professor with the Department of Computer Science and Information Engineering, Chung-Hua University, Hsinchu, Taiwan. His current research interests include artificial intellig...Show More
Chih-Hsun Chou received the B.S. degree from Electronic Engineering, Tamkang University, Taipei, Taiwan in 1985, and the Ph.D. degree from Electrical Engineering, Ta-Tung Institute of Technology, Taipei, Taiwan in 1994. He is currently an Associate Professor with the Department of Computer Science and Information Engineering, Chung-Hua University, Hsinchu, Taiwan. His current research interests include artificial intellig...View more

Author image of Chang-Hsing Lee
Department of Computer Science and Information Engineering, Chung Hua University, Hsinchu 300, Taiwan
Chang-Hsing Lee (M'11) received the B.S. and Ph.D. degrees from Computer and Information Science, National Chiao Tung University, Hsinchu, Taiwan in 1991 and 1995, respectively. He is currently an Associate Professor with the Department of Computer Science and Information Engineering, Chung Hua University, Hsinchu, Taiwan. His main research interests include audio/sound classification, multimedia information retrieval, and image processing.
Chang-Hsing Lee (M'11) received the B.S. and Ph.D. degrees from Computer and Information Science, National Chiao Tung University, Hsinchu, Taiwan in 1991 and 1995, respectively. He is currently an Associate Professor with the Department of Computer Science and Information Engineering, Chung Hua University, Hsinchu, Taiwan. His main research interests include audio/sound classification, multimedia information retrieval, and image processing.View more
Author image of Sheng-Bin Hsu
Department of Computer Science and Information Engineering, Chung Hua University, Hsinchu 300, Taiwan
Sheng-Bin Hsu received the B.S. degree from Computer Science and Information Engineering, Ming Dao University, Changhua, Taiwan in 2006 and the M.S. degree from Computer Science and Information Engineering, Chung Hua University, Hsinchu, Taiwan in 2009. He is currently pursuing the Ph.D. degree in Computer Science, National Central University, Taoyuan, Taiwan. His main research interests include audio/sound classification and image processing.
Sheng-Bin Hsu received the B.S. degree from Computer Science and Information Engineering, Ming Dao University, Changhua, Taiwan in 2006 and the M.S. degree from Computer Science and Information Engineering, Chung Hua University, Hsinchu, Taiwan in 2009. He is currently pursuing the Ph.D. degree in Computer Science, National Central University, Taoyuan, Taiwan. His main research interests include audio/sound classification and image processing.View more
Author image of Jau-Ling Shih
Department of Computer Science and Information Engineering, Chung Hua University, Hsinchu 300, Taiwan
Jau-Ling Shih received the B.S. degree from Electrical Engineering, National Sun Yat-Sen University, Kaohsiung, Taiwan in 1992, the M.S. degree from Electrical Engineering, National Cheng Kung University, Tainan, Taiwan in 1994, and the Ph.D. degree from Computer and Information Science, National Chiao Tung University, Hsinchu, Taiwan in 2002. She is currently a Professor with the Department of Computer Science and Information Engineering, Chung Hua University, Hsinchu, Taiwan. Her main research interests include image processing, image retrieval, and audio processing.
Jau-Ling Shih received the B.S. degree from Electrical Engineering, National Sun Yat-Sen University, Kaohsiung, Taiwan in 1992, the M.S. degree from Electrical Engineering, National Cheng Kung University, Tainan, Taiwan in 1994, and the Ph.D. degree from Computer and Information Science, National Chiao Tung University, Hsinchu, Taiwan in 2002. She is currently a Professor with the Department of Computer Science and Information Engineering, Chung Hua University, Hsinchu, Taiwan. Her main research interests include image processing, image retrieval, and audio processing.View more
Author image of Chih-Hsun Chou
Department of Computer Science and Information Engineering, Chung Hua University, Hsinchu 300, Taiwan
Chih-Hsun Chou received the B.S. degree from Electronic Engineering, Tamkang University, Taipei, Taiwan in 1985, and the Ph.D. degree from Electrical Engineering, Ta-Tung Institute of Technology, Taipei, Taiwan in 1994. He is currently an Associate Professor with the Department of Computer Science and Information Engineering, Chung-Hua University, Hsinchu, Taiwan. His current research interests include artificial intelligence, audio signal processing, intelligent control and data mining.
Chih-Hsun Chou received the B.S. degree from Electronic Engineering, Tamkang University, Taipei, Taiwan in 1985, and the Ph.D. degree from Electrical Engineering, Ta-Tung Institute of Technology, Taipei, Taiwan in 1994. He is currently an Associate Professor with the Department of Computer Science and Information Engineering, Chung-Hua University, Hsinchu, Taiwan. His current research interests include artificial intelligence, audio signal processing, intelligent control and data mining.View more

Contact IEEE to Subscribe

References

References is not available for this document.