International Journal of Applied Science and Engineering
Published by Chaoyang University of Technology

Camargo Luis 1*, Gasca Maira 2, Linero Rafa 1

1 Faculty of Engineering, Universidad del Magdalena, Santa Marta, Colombia

2 School of Basic Sciences, Technology and Engineering, Universidad Nacional Abierta y a Distancia, Santa Marta, Colombia


 

Download Citation: |
Download PDF


ABSTRACT


Avitourism depends on the understanding of birds and the intention of birdwatchers to see or hear a specific species. Increase of this activity strengthens the economy of communities and helps finance biodiversity conservation projects. Technological products that incorporate traditional and modern tools for signal and image processing facilitate the tracking, classification and observation of birds. This article has two approaches. The first one proposes and evaluates a lightweight classification model that uses feature vector extracted from the bird's song spectrum and is based on comparison of Euclidean distance between sample features and a set vector by species. The second approach adapts and evaluates convolutional neural network architectures for bird classification using the spectrogram of the bird's song. The methodology applied in both approaches consists of: pre-processing, feature extraction, classification and evaluation metrics. The main results are the feasibility of the proposed lightweight classification model with an accuracy of 0.8 and a loss of 2.32, and the feasibility of using convolutional neural networks with an accuracy above 0.9 and a loss of less than 1, in the ResNet50, VGG19, and InceptionV3 architectures, this using as a minimum 30 spectrograms per species during training. It is concluded that the model that best meets the needs of fewer samples and less computational resources required for training is ResNet50. Additionally, it is discussed to combine the two approaches in a hierarchical and hybrid classification model that allows to introduce in the reduction and classification layers of the neural network features of another type and of other sources.


Keywords: CNN, ResNet50, VGG19, InceptionV3, DFT, spectrogram, Euclidean distance.


Share this article with your colleagues

 


REFERENCES


  1. Afanasiev, O. 2022. Birdwatching, ornitological tourism and avitourism: Terminological dispute and review of world and Russian practices. Anais Brasileiros de Estudos Turísticos, 12, 1–12.

  2. Ali, L., Alnajjar, F., Jassmi, H., Gocho, M., Khan, W., Serhani, M. 2021. Performance evaluation of deep CNN-based crack detection and localization techniques for concrete structures. Sensors, 21, 1688.

  3. Alzubaidi, L., Zhang, J., Humaidi, A., Al-Dujaili, A., Duan, Y., Al-Shamma, O., Santamaría, J., Fadhel, M., Al-Amidie, M., Farhan, L. 2021. Review of deep learning: Concepts, CNN architectures, challenges, applications, future directions. Journal of Big Data, 8, 1–74.

  4. Catchpole, C., Slater, P. 2003. Bird song: Biological themes and variations. Cambridge University Press. Cambridge. UK.

  5. Chápuli, R. 2018. ¿Hay música en el canto de las aves? Encuentros en la Biología, 11, 21–25.

  6. Chen, G., Xia, C., Zhang, Y. 2020. Individual identification of birds with complex songs: The case of green-backed flycatchers ficedula elisae. Behavioural processes, 174, 104063.

  7. Chen, S.-S., Li, Y. 2013. Automatic recognition of bird songs using time-frequency texture. Proceeding of 5th International Conference and Computational Intelligence and Communication Networks. Mathura, India. 262–266.

  8. Donegan, T., Verhelst, J., Ellery, T., Cortés-Herrera, O., Salaman, P. 2016. Revision of the status of bird species occurring or reported in Colombia 2016 and assessment of BirdLife International’s new parrot taxonomy. Conservación Colombiana, 27, 12–36.

  9. Fagerlund, S. 2007. Bird species recognition using support vector machines. EURASIP Journal on Advances in Signal Processing, 038637, 1–8.

  10. González, J., Padrón, J., Barbero, I., Custodio, L., Merchán, F. 2019. Reconocimiento de canto de aves basado en el análisis de componentes principales del espectrograma. Revista de Iniciación Científica, 5, 124–129.

  11. He, K., Zhang, X., Ren, S., Sun, J. 2016. Deep residual learning for image recognition. Proceedings of the IEEE conference on computer vision and pattern recognition. Las Vegas, USA. 770–778.

  12. Hsu, S., Lee, C., Chang, P., Han, C., Fan, K. 2018. Local wavelet acoustic pattern: A novel time–frequency descriptor for birdsong recognition. IEEE Transactions on Multimedia, 20, 3187–3199.

  13. Hvenegaard, G., Dearden, P. 1998. Ecotourism versus tourism in a Thai national park. Annals of Tourism Research, 25, 700–720.

  14. Institute Humboldt. 2020. Descarga en tu celular los sonidos de la Biodiversidad colombiana. Retrieved 2022-04-12 from http://www.humboldt.org.co/es/noticias/actualidad/item/107-descarga-en-tu-celular-los-sonidos-de-la-biodiversidad-colombiana

  15. Ketkar, N., Moolayil, J. 2021. Convolutional neural networks. In Deep Learning with Python. Berkeley: Apress. California, USA.

  16. Khanna, H., Gaunt, S., McCallum, D. 1997. Digital spectrographic cross-correlation: Tests of sensitivity. Bioacoustics, 7, 209–234.

  17. Kingma, D. P., Ba, J. 2014. Adam: A method for stochastic optimization. arXiv:1412.6980.

  18. Kogan, J., Margoliash, D. 1998. Automated recognition of bird song elements from continuous recordings using dynamic time warping and hidden Markov models: A comparative study. The Journal of the Acoustical Society of America, 103, 2185–2196.

  19. Lee, C., Han, C., Chuang, C. 2008. Automatic classification of bird species from their sounds using two-dimensional cepstral coefficients. IEEE Transactions on Audio, Speech, and Language Processing, 16, 1541–1550.

  20. Mathew, A., Amudha, P., Sivakumari, S. 2021. Deep learning techniques: An overview. Proceeding of International Conference on Advanced Machine Learning Technologies and Applications. Singapore. 1141.

  21. Nair, V., Hinton, G.E. 2010. Rectified linear units improve restricted boltzmann machines. Proceedings of the 27th international conference on machine learning (ICML-10). Haifa, Israel.

  22. Ocampo-Peñuela, N., Winton, S. 2017. Economic and conservation potential of bird-watching tourism in postconflict Colombia. Tropical Conservation Science, 10, 1–6.

  23. Quintana, V. 2017. El turismo de naturaleza: Un producto turístico sostenible. Arbor Ciencia, Pensamiento y Cultura, 193, 1–14.

  24. Simonyan, K., Zisserman, A. 2014. Very deep convolutional networks for large-scale image recognition. arXiv, 1409, 1–14.

  25. Steven, R., Morrison, C., Castley, J. 2015. Birdwatching and avitourism: A global review of research into its participant markets, distribution and impacts, highlighting future research priorities to inform sustainable avitourism management. Journal of Sustainable Tourism, 23, 1257–1276.

  26. Steven, R., Rakotopare, N., Newsome, D. 2021. Avitourism tribes: As diverse as the birds they watch. Singapore: Springer.

  27. Sukri, M., Fadlilah, U., Saon, S., Som, M., Sidek, A. 2020. Bird sound identification based on artificial neural network. Proceeding of 2020 IEEE Student Conference on Research and Development (SCOReD). 342–345.

  28. Szegedy, C., Vanhoucke, V., Ioffe, S., Shlens, J., Wojna, Z. 2016. Rethinking the inception architecture for computer vision. Proceedings of the IEEE conference on computer vision and pattern recognition. 2818–2826.

  29. Tanttu, J., Jari, T., Ojanen, M. 2003. Automatic classification of flight calls of Crossbill species (Loxia spp.). Proceedings of the 1st International Conference on Acoustic Communication by Animals. 239–240.

  30. Wang, H., Xu, Y., Yu, Y., Lin, Y., Ran, J. 2022. An efficient model for a vast number of bird species identification based on acoustic features. Animals, 12, 2434.

  31. Xeno-canto Foundation. 2021. Compartiendo cantos de aves de todo el mundo. Retrieved 2022-06-06 from https://www.xeno-canto.org/about/xeno-canto

  32. Zhang, X., Chen, A., Zhou, G., Zhang, Z., Huang, X., Qiang, X. 2019. Spectrogram-frame linear network and continuous frame sequence for bird sound classification. Ecological Informatics, 54, 101009.


ARTICLE INFORMATION


Received: 2023-06-14
Revised: 2023-09-08
Accepted: 2023-09-27
Available Online: 2024-01-02


Cite this article:

Luis, C., Maira, G., Rafa, L. 2024. Traditional and modern processing of digital signals and images for the classification of birds from singing. International Journal of Applied Science and Engineering, 21, 2023222. https://doi.org/10.6703/IJASE.202403_21(1).007

  Copyright The Author(s). This is an open access article distributed under the terms of the Creative Commons Attribution License (CC BY 4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are cited.