DEW: A wavelet approach of rare sound event detection

Gul, Sania and Khan, Muhammad Salman and Ur-Rehman, Ata and Naseer, Noman (2024) DEW: A wavelet approach of rare sound event detection. PLOS ONE, 19 (3). e0300444. ISSN 1932-6203

[thumbnail of journal.pone.0300444.pdf] Text
journal.pone.0300444.pdf

Download (2MB)

Abstract

This paper presents a novel sound event detection (SED) system for rare events occurring in an open environment. Wavelet multiresolution analysis (MRA) is used to decompose the input audio clip of 30 seconds into five levels. Wavelet denoising is then applied on the third and fifth levels of MRA to filter out the background. Significant transitions, which may represent the onset of a rare event, are then estimated in these two levels by combining the peak-finding algorithm with the K-medoids clustering algorithm. The small portions of one-second duration, called ‘chunks’ are cropped from the input audio signal corresponding to the estimated locations of the significant transitions. Features from these chunks are extracted by the wavelet scattering network (WSN) and are given as input to a support vector machine (SVM) classifier, which classifies them. The proposed SED framework produces an error rate comparable to the SED systems based on convolutional neural network (CNN) architecture. Also, the proposed algorithm is computationally efficient and lightweight as compared to deep learning models, as it has no learnable parameter. It requires only a single epoch of training, which is 5, 10, 200, and 600 times lesser than the models based on CNNs and deep neural networks (DNNs), CNN with long short-term memory (LSTM) network, convolutional recurrent neural network (CRNN), and CNN respectively. The proposed model neither requires concatenation with previous frames for anomaly detection nor any additional training data creation needed for other comparative deep learning models. It needs to check almost 360 times fewer chunks for the presence of rare events than the other baseline systems used for comparison in this paper. All these characteristics make the proposed system suitable for real-time applications on resource-limited devices.

Item Type: Article
Subjects: Pustakas > Biological Science
Depositing User: Unnamed user with email support@pustakas.com
Date Deposited: 05 Apr 2024 11:16
Last Modified: 05 Apr 2024 11:16
URI: http://archive.pcbmb.org/id/eprint/1938

Actions (login required)

View Item
View Item