AUTOMATED BIRD DETECTION IN AUDIO RECORDINGS BY A SIGNAL PROCESSING PERSPECTIVE
DOI:
https://doi.org/10.29284/ijasis.7.2.2021.11-20Keywords:
Bird detection, wavelet transform, dual tree m-band wavelets, Gaussian mixture model.Abstract
In this study, an effective automated technique for the detection of bird sounds is presented in a signal processing perspective. The detection of bird sound by examining the sound patterns is the basic step for wildlife monitoring. An Automated Bird Detection (ABD) system based on Dual-tree M-band Wavelet transform (DMWT) is designed. The more intrinsic content of the audio is extracted as features by DMWT and this is the crucial stage as the extracted features directly influence the efficiency of the ABD system. It classifies the given audio signals into two classes; birds are present or not present. The sounds in the audio signals are modeled by Gaussian Mixture Model (GMM) with the help of DMWT features. The ABD system is analyzed by changing the DMWT decomposition level, and Gaussian components used to model each fault. Results show that the ABD system achieves 97.82% accuracy by 3rd level DMWT features when modeled by 16 Gaussian components.
Downloads
References
A.L. McIlraith and H.C. Card, “Birdsong recognition using backpropagation and multivariate statistics,” IEEE Transaction on Signal Processing, Vol. 45, No. 11, 1997, 2740–2748.
A. Selin, J. Turunen, and J.T. Tanttu, “Wavelets in recognition of bird sounds,” EURASIP Journal of Advances in Signal Processing, Vol. 2007, 2007, pp.1-9.
F. Briggs, B. Lakshminarayanan, L. Neal, X. Z. Fern, R. Raich, S. J. Hadley, A.S. Hadley, and M.G. Betts, “Acoustic classification of multiple simultaneous bird species: A multi-instance multi-label approach,” J The Journal of the Acoustical Society of America, Vol. 131, No. 6, 2012, 4640–4650.
M. Ramashini, P. E. Abas, U. Grafe and L.C. De Silva, "Bird Sounds Classification Using Linear Discriminant Analysis," 4th International Conference and Workshops on Recent Advances and Innovations in Engineering, 2019, pp. 1-6.
M.M.M. Sukri, U. Fadlilah, S. Saon, A.K. Mahamad, M.M. Som and A. Sidek, "Bird Sound Identification based on Artificial Neural Network," 2020 IEEE Student Conference on Research and Development, 2020, pp. 342-345.
B.N. Suhas et al., "Automatic bird sound detection in long range field recordings using Wavelets & Mel filter bank features," 2nd International Conference on Cognitive Machine Intelligence, 2020, pp. 218-226.
Y. Jadhav, V. Patil and D. Parasar, "Machine Learning Approach to Classify Birds on the Basis of Their Sound," International Conference on Inventive Computation Technologies , 2020, pp. 69-73.
J. Xie, K. Hu, M. Zhu, J. Yu and Q. Zhu, "Investigation of Different CNN-Based Models for Improved Bird Sound Classification," IEEE Access, Vol. 7, 2019, pp. 175353-175361.
B. Chandu, A. Munikoti, K. S. Murthy, G. Murthy V. and C. Nagaraj, "Automated Bird Species Identification using Audio Signal Processing and Neural Networks," International Conference on Artificial Intelligence and Signal Processing, 2020, pp. 1-5.
P. Jancovic and M. Kokuer, "Bird Species Recognition Using Unsupervised Modeling of Individual Vocalization Elements," in IEEE/ACM Transactions on Audio, Speech, and Language Processing, Vol. 27, No. 5, 2019, pp. 932-947.
Á. Incze, H. Jancsó, Z. Szilágyi, A. Farkas and C. Sulyok, "Bird Sound Recognition Using a Convolutional Neural Network," 16th International Symposium on Intelligent Systems and Informatics, 2018, pp. 000295-000300.
C. Chaux, L. Duval, and J.C. Pesquet, “Hilbert pairs of M-band orthonormal wavelet bases,” 12th European Signal and Image Processing Conference, 2004, pp. 1187–1190.
I. Bayram, and I.W. Selesnick, “A simple construction for the m-band dual-tree complex wavelet transform,” 12th Signal Processing Education Workshop, 2006, pp. 596-601.
D.A. Reynolds and R.C. Rose, "Robust text-independent speaker identification using gaussian mixture speaker models," IEEE Transactions on Speech and Audio Processing, Vol. 3, No. 1, 1995, pp. 72–83.
C.M Bishop, “Pattern recognition and machine learning”, Springer, Chapter 9, Vol. 1, 2006, pp.43-50.
D. Stowell and M.D. Plumbley, "An open dataset for research on audio field recording archives: freefield1010," 2013, pp.1-10.
Downloads
Published
Issue
Section
License
This work is licensed under a Creative Commons Attribution 4.0 International License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.