COMPARISON BETWEEN GMM-SVM SEQUENCE KERNEL AND GMM: APPLICATION TO SPEECH EMOTION RECOGNITION
Speech emotion recognition aims at automatically identifying the emotional or physical state of a human being from his or her voice. The emotional state is an important factor in human communication, because it provides feedback information in many applications. This paper makes a comparison of two...
Main Authors: | , , |
---|---|
Format: | Article |
Language: | English |
Published: |
Taylor's University
2016-09-01
|
Series: | Journal of Engineering Science and Technology |
Subjects: | |
Online Access: | http://jestec.taylors.edu.my/Vol%2011%20issue%209%20September%202016/11_9_1.pdf |
id |
doaj-ad4f21776e524f6e8f9a812d68c91e14 |
---|---|
record_format |
Article |
spelling |
doaj-ad4f21776e524f6e8f9a812d68c91e142020-11-24T23:29:02ZengTaylor's UniversityJournal of Engineering Science and Technology1823-46902016-09-0111912211233COMPARISON BETWEEN GMM-SVM SEQUENCE KERNEL AND GMM: APPLICATION TO SPEECH EMOTION RECOGNITION I. TRABELSI0D. BEN AYED1N. ELLOUZE2Université Tunis El Manar, Ecole Nationale d'Ingénieurs de Tunis-ENIT Laboratoire Signal, Image et Technologies de l'Information-LRSITI, 1002, Tunis, TunisiaUniversité Tunis El Manar, Ecole Nationale d'Ingénieurs de Tunis-ENIT Laboratoire Signal, Image et Technologies de l'Information-LRSITI, 1002, Tunis, TunisiaUniversité Tunis El Manar, Ecole Nationale d'Ingénieurs de Tunis-ENIT Laboratoire Signal, Image et Technologies de l'Information-LRSITI, 1002, Tunis, TunisiaSpeech emotion recognition aims at automatically identifying the emotional or physical state of a human being from his or her voice. The emotional state is an important factor in human communication, because it provides feedback information in many applications. This paper makes a comparison of two standard methods used for speaker recognition and verification: Gaussian Mixture Models (GMM) and Support Vector Machines (SVM) for emotion recognition. An extensive comparison of two methods: GMM and GMM/SVM sequence kernel is conducted. The main goal here is to analyze and compare influence of initial setting of parameters such as number of mixture components, used number of iterations and volume of training data for these two methods. Experimental studies are performed over the Berlin Emotional Database, expressing different emotions, in German language. The emotions used in this study are anger, fear, joy, boredom, neutral, disgust, and sadness. Experimental results show the effectiveness of the combination of GMM and SVM in order to classify sound data sequences when compared to systems based on GMM.http://jestec.taylors.edu.my/Vol%2011%20issue%209%20September%202016/11_9_1.pdfSpeechEmotionsSVMGMMKernelSequence |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
I. TRABELSI D. BEN AYED N. ELLOUZE |
spellingShingle |
I. TRABELSI D. BEN AYED N. ELLOUZE COMPARISON BETWEEN GMM-SVM SEQUENCE KERNEL AND GMM: APPLICATION TO SPEECH EMOTION RECOGNITION Journal of Engineering Science and Technology Speech Emotions SVM GMM Kernel Sequence |
author_facet |
I. TRABELSI D. BEN AYED N. ELLOUZE |
author_sort |
I. TRABELSI |
title |
COMPARISON BETWEEN GMM-SVM SEQUENCE KERNEL AND GMM: APPLICATION TO SPEECH EMOTION RECOGNITION |
title_short |
COMPARISON BETWEEN GMM-SVM SEQUENCE KERNEL AND GMM: APPLICATION TO SPEECH EMOTION RECOGNITION |
title_full |
COMPARISON BETWEEN GMM-SVM SEQUENCE KERNEL AND GMM: APPLICATION TO SPEECH EMOTION RECOGNITION |
title_fullStr |
COMPARISON BETWEEN GMM-SVM SEQUENCE KERNEL AND GMM: APPLICATION TO SPEECH EMOTION RECOGNITION |
title_full_unstemmed |
COMPARISON BETWEEN GMM-SVM SEQUENCE KERNEL AND GMM: APPLICATION TO SPEECH EMOTION RECOGNITION |
title_sort |
comparison between gmm-svm sequence kernel and gmm: application to speech emotion recognition |
publisher |
Taylor's University |
series |
Journal of Engineering Science and Technology |
issn |
1823-4690 |
publishDate |
2016-09-01 |
description |
Speech emotion recognition aims at automatically identifying the emotional or physical state of a human being from his or her voice. The emotional state is an important factor in human communication, because it provides feedback information in many applications. This paper makes a comparison of two standard methods used for speaker recognition and verification: Gaussian
Mixture Models (GMM) and Support Vector Machines (SVM) for emotion recognition. An extensive comparison of two methods: GMM and GMM/SVM sequence kernel is conducted. The main goal here is to analyze and compare influence of initial setting of parameters such as number of mixture components, used number of iterations and volume of training data for these two methods. Experimental studies are performed over the Berlin Emotional
Database, expressing different emotions, in German language. The emotions used in this study are anger, fear, joy, boredom, neutral, disgust, and sadness. Experimental results show the effectiveness of the combination of GMM and SVM in order to classify sound data sequences when compared to systems based on GMM. |
topic |
Speech Emotions SVM GMM Kernel Sequence |
url |
http://jestec.taylors.edu.my/Vol%2011%20issue%209%20September%202016/11_9_1.pdf |
work_keys_str_mv |
AT itrabelsi comparisonbetweengmmsvmsequencekernelandgmmapplicationtospeechemotionrecognition AT dbenayed comparisonbetweengmmsvmsequencekernelandgmmapplicationtospeechemotionrecognition AT nellouze comparisonbetweengmmsvmsequencekernelandgmmapplicationtospeechemotionrecognition |
_version_ |
1725546808493473792 |