A Deep Learning Approach for Theautomatic Classification of Acoustic Events: A Case of Natural Disasters
Loading...
Date
Authors
Journal Title
Journal ISSN
Volume Title
Publisher
University of Ghana
Abstract
Automatic classification of acoustic events is a signal processing activity that has recently gained research interest, especially in the machine learning community. This is due to its cost-effectiveness in the long-term monitoring of larger areas and the collection of large amounts of data in real-time. A plethora of techniques have been proposed and adopted for the classification of acoustic events such as respiratory sound, animal calls/vocalizations, baby cry, speech disorders, and environmental sound. This study was aimed at developing a natural disaster sound classification model that will enable automatic classification of natural disasters. Accordingly, deep learning techniques including Convolutional Neural Network (CNN) and a Long short-term memory based-Recurrent Neural Network (RNN-LSTM) were used to develop classification models. The adopted algorithms and sound features used in this study were motivated by methodologies used in the area of speech/voice recognition. To ensure a relevant and rigorous research, this study adopted the design science research methodology which consisted of a five-phase cycle; awareness of the problem, suggestion, development, evaluation, and conclusion. Furthermore, to also ensure the real-time classification of natural disaster sounds, the detection-by-classification approach was adopted instead of detection-and-classification. The dataset used for this study consisted of five classes of natural disasters sound that was extracted from the Freesound database. The sound files were preprocessed at 16000Hz to extract 13 Mel Frequency Cepstral Coefficient (MFCC). An arbitrary time frame of 0.1s was adopted. In the end, the performance of both models was validated using the classification metrics and cross-validation. Results indicated that although CNN performed slightly better than RNN-LSTM, both models were effective at automatically discerning one disaster sound from the other in real-time. Best results of 99.95% in classification accuracy, and 0.999 in the area under the curve (AUC) score were obtained from CNN.
Description
Mphil. Computer Science