A Novel Deep Learning Approach for Classification of Bird Sound Using Mel Frequency Cepstral Coefficients

Authors

  • Aymen Saad Department of Information Technology, Technical College of Management, Kufa, Al Furat Al Awsat Technical University, Kufa 54003, Iraq

DOI:

https://doi.org/10.52866/ijcsm.2024.05.03.040

Keywords:

Bird sound, Deep Learning, Light Wight Convolutional Neural Networks, Classification, Mel Frequency Cepstral Coefficients

Abstract

 Monitoring animal populations is one important matter to better understand changes in their
population, behavior, and biodiversity. Bird sounds are the main tool to classify bird species acoustically. The
sounds of birds are an indicator for ecologists as it responds to changes in their environment. The recognition
among a variety of bird species to get important features is computationally expensive. With the unbalanced
classes and scarcity of training data, the performance accuracy is degrading. This paper aims to classify species of
birds using lightweight convolutional neural networks (LWCNNs) basis on using a spectrogram image of Brazilian
bird sounds as a dataset. For extracting spectrogram images, Mel Frequency Cepstral Coefficient (MFCC)
algorithm is used. To prove the high performance of the classifier, ten species of birds with 10,000 spectrogram
images are provided to the classifier. Our LWCNN model achieved a training and testing accuracy of 99.68 % and
92.80 % respectively in 10.54 min with 5 epochs.

Downloads

Download data is not yet available.

Downloads

Published

2024-08-18

How to Cite

[1]
A. Saad, “A Novel Deep Learning Approach for Classification of Bird Sound Using Mel Frequency Cepstral Coefficients”, Iraqi Journal For Computer Science and Mathematics, vol. 5, no. 3, Aug. 2024.
CITATION
DOI: 10.52866/ijcsm.2024.05.03.040
Published: 2024-08-18

Issue

Section

Articles