Classification of environmental sounds using 1D convolutional Neural network on Urbansound8k dataset
This is an implementation of the paper https://arxiv.org/abs/1904.08990 It can deal with audio signals of any length as it splits the signal into overlapped frames using a sliding window, hence no data augmentation is required. Different architectures considering several input sizes are evaluated, including the initialization of the first convolutional layer with a Gammatone filterbank that models the human auditory filter response in the cochlea.
https://www.kaggle.com/chrisfilo/urbansound8k Place the files into a folder named 'dataset' in the same working directory
- NumPy v1.18
- Tensorflow v2.2.0
- Keras