Application of Machine Learning Algorithms in Speech Emotion Recognition

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

3 Scopus citations

Abstract

Speech emotion recognition has been widely used in recent years and has become a heated topic for research. Focused on the convolutional neural network model using spectrograms as input, the CNN-LSTM model based on feature vectors, original speech signal and Log-mel spectrograms, the performance of different models is compared as well as analyzed. The study found that there are some common problems existing in the classification performance of the model. The features and algorithms currently used can effectively distinguish emotions with varied 'arousal', but it is difficult to identify the feelings with similar arousal, among the models. The CNN-LSTM model with Log-mel spectrograms as input achieved the highest accuracy.

Original languageEnglish
Title of host publicationProceedings - 2021 International Conference on Signal Processing and Machine Learning, CONF-SPML 2021
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages113-116
Number of pages4
ISBN (Electronic)9781665417341
DOIs
StatePublished - 2021
Event2021 International Conference on Signal Processing and Machine Learning, CONF-SPML 2021 - Stanford, United States
Duration: 14 Nov 2021 → …

Publication series

NameProceedings - 2021 International Conference on Signal Processing and Machine Learning, CONF-SPML 2021

Conference

Conference2021 International Conference on Signal Processing and Machine Learning, CONF-SPML 2021
Country/TerritoryUnited States
CityStanford
Period14/11/21 → …

Keywords

  • Feature Extraction
  • Machine Learning
  • Neural Networks
  • Speech emotion recognition

Fingerprint

Dive into the research topics of 'Application of Machine Learning Algorithms in Speech Emotion Recognition'. Together they form a unique fingerprint.

Cite this