😎 Awesome lists about Speech Emotion Recognition
-
Updated
Dec 24, 2024
😎 Awesome lists about Speech Emotion Recognition
An implementation of Speech Emotion Recognition, based on HuBERT model, training with PyTorch and HuggingFace framework, and fine-tuning on the RAVDESS dataset.
Transformer-based model for Speech Emotion Recognition(SER) - implemented by Pytorch
MFCC features + SVM for speech emotion classification
In this work is proposed a speech emotion recognition model based on the extraction of four different features got from RAVDESS sound files and stacking the resulting matrices in a one-dimensional array by taking the mean values along the time axis. Then this array is fed into a 1-D CNN model as input.
Exploration of different audio features and CNN-based architectures for building an effective Speech Emotion Recognition (SER) system. The goal is to improve the accuracy of detecting emotions embedded in speech signals. The repository contains code, notebook, and detailed explanations of the experiments conducted.
Hybrid Data Augmentation and Attention-based Dilated Convolutional-Recurrent Neural Networks for Speech Emotion Recognition
Speech Emotion Recognition (SER) using Deep neural networks CNN and RNN
Application for Disruptive Situations Detection in public transports through Speech Emotion Recognition.
Repository for PhD activity (dissertation S3) with focus on speech emotion recognition, JAIST 2017~2021
This repository contains the source code for my final year project for my undergraduate degree in MTU.
Classify emotions like happy, fear, neutral, etc. using CNN applying bootstrap aggregation approach
Speech Emotion Recognition on a subset of the RAVDESS dataset.
Domain-shift Aware Meta-Learning for Domain Generalization in SER
Speech Emotion Recognition using PyTorch sponsored by AIS and VISTEC-DEPA AIResearch Institute Thailand.
his is a Speech Emotion Recognition system that classifies emotions from speech samples using deep learning models. The project uses four datasets: CREMAD, RAVDESS, SAVEE, and TESS. The model achieves an accuracy of 96% by combining CNN, LSTM, and CLSTM architectures, along with data augmentation techniques and feature extraction methods.
A Convolutional Neural Network that distinguishes between the speakers emotions. Comes with multiple preprocessors to improve the models performance.
An attempt at the speech emotion recognition (SER) task on the CREMA-D dataset using TensorFlow 1D & 2D RCNN models.
Add a description, image, and links to the speech-emotion-classification topic page so that developers can more easily learn about it.
To associate your repository with the speech-emotion-classification topic, visit your repo's landing page and select "manage topics."