This repository provides training and evaluation code for the paper Speech Emotion Recognition using Semantic Information
(ICASSP 2021). If you use this codebase in your experiments please cite:
Tzirakis, P., Nguyen, A., Zafeiriou, S., & Schuller, B. W. (2021). Speech Emotion Recognition using Semantic Information. arXiv preprint arXiv:2103.02993.
This repository provides the following:
- Word2Vec embeddings trained on the German SWC corpus.
- Speech2Vec embeddings trained on the SEWA DB.
- Code to train/evaluate word2vec and our unified model, i. e., paralinguistic and semantic feature extrators with a LSTM cell on top.
Below are listed the required modules to run the code.
- aeneas
- librosa
- nltk
- numpy
- stop-words
- tensorflow
- torch
- Create the speech2vec segmentation by running
speech2word_mapping.py
in speech2vec folder. - Run
data_generator.py
to create tfrecords. - Run
train.py
to train the models, andeval.py
to evaluate.