COMPARISON BETWEEN GMM-SVM SEQUENCE KERNEL AND GMM: APPLICATION TO SPEECH EMOTION RECOGNITION

Speech emotion recognition aims at automatically identifying the emotional or physical state of a human being from his or her voice. The emotional state is an important factor in human communication, because it provides feedback information in many applications. This paper makes a comparison of two...

Full description

Bibliographic Details
Main Authors: I. TRABELSI, D. BEN AYED, N. ELLOUZE
Format: Article
Language:English
Published: Taylor's University 2016-09-01
Series:Journal of Engineering Science and Technology
Subjects:
SVM
GMM
Online Access:http://jestec.taylors.edu.my/Vol%2011%20issue%209%20September%202016/11_9_1.pdf
id doaj-ad4f21776e524f6e8f9a812d68c91e14
record_format Article
spelling doaj-ad4f21776e524f6e8f9a812d68c91e142020-11-24T23:29:02ZengTaylor's UniversityJournal of Engineering Science and Technology1823-46902016-09-0111912211233COMPARISON BETWEEN GMM-SVM SEQUENCE KERNEL AND GMM: APPLICATION TO SPEECH EMOTION RECOGNITION I. TRABELSI0D. BEN AYED1N. ELLOUZE2Université Tunis El Manar, Ecole Nationale d'Ingénieurs de Tunis-ENIT Laboratoire Signal, Image et Technologies de l'Information-LRSITI, 1002, Tunis, TunisiaUniversité Tunis El Manar, Ecole Nationale d'Ingénieurs de Tunis-ENIT Laboratoire Signal, Image et Technologies de l'Information-LRSITI, 1002, Tunis, TunisiaUniversité Tunis El Manar, Ecole Nationale d'Ingénieurs de Tunis-ENIT Laboratoire Signal, Image et Technologies de l'Information-LRSITI, 1002, Tunis, TunisiaSpeech emotion recognition aims at automatically identifying the emotional or physical state of a human being from his or her voice. The emotional state is an important factor in human communication, because it provides feedback information in many applications. This paper makes a comparison of two standard methods used for speaker recognition and verification: Gaussian Mixture Models (GMM) and Support Vector Machines (SVM) for emotion recognition. An extensive comparison of two methods: GMM and GMM/SVM sequence kernel is conducted. The main goal here is to analyze and compare influence of initial setting of parameters such as number of mixture components, used number of iterations and volume of training data for these two methods. Experimental studies are performed over the Berlin Emotional Database, expressing different emotions, in German language. The emotions used in this study are anger, fear, joy, boredom, neutral, disgust, and sadness. Experimental results show the effectiveness of the combination of GMM and SVM in order to classify sound data sequences when compared to systems based on GMM.http://jestec.taylors.edu.my/Vol%2011%20issue%209%20September%202016/11_9_1.pdfSpeechEmotionsSVMGMMKernelSequence
collection DOAJ
language English
format Article
sources DOAJ
author I. TRABELSI
D. BEN AYED
N. ELLOUZE
spellingShingle I. TRABELSI
D. BEN AYED
N. ELLOUZE
COMPARISON BETWEEN GMM-SVM SEQUENCE KERNEL AND GMM: APPLICATION TO SPEECH EMOTION RECOGNITION
Journal of Engineering Science and Technology
Speech
Emotions
SVM
GMM
Kernel
Sequence
author_facet I. TRABELSI
D. BEN AYED
N. ELLOUZE
author_sort I. TRABELSI
title COMPARISON BETWEEN GMM-SVM SEQUENCE KERNEL AND GMM: APPLICATION TO SPEECH EMOTION RECOGNITION
title_short COMPARISON BETWEEN GMM-SVM SEQUENCE KERNEL AND GMM: APPLICATION TO SPEECH EMOTION RECOGNITION
title_full COMPARISON BETWEEN GMM-SVM SEQUENCE KERNEL AND GMM: APPLICATION TO SPEECH EMOTION RECOGNITION
title_fullStr COMPARISON BETWEEN GMM-SVM SEQUENCE KERNEL AND GMM: APPLICATION TO SPEECH EMOTION RECOGNITION
title_full_unstemmed COMPARISON BETWEEN GMM-SVM SEQUENCE KERNEL AND GMM: APPLICATION TO SPEECH EMOTION RECOGNITION
title_sort comparison between gmm-svm sequence kernel and gmm: application to speech emotion recognition
publisher Taylor's University
series Journal of Engineering Science and Technology
issn 1823-4690
publishDate 2016-09-01
description Speech emotion recognition aims at automatically identifying the emotional or physical state of a human being from his or her voice. The emotional state is an important factor in human communication, because it provides feedback information in many applications. This paper makes a comparison of two standard methods used for speaker recognition and verification: Gaussian Mixture Models (GMM) and Support Vector Machines (SVM) for emotion recognition. An extensive comparison of two methods: GMM and GMM/SVM sequence kernel is conducted. The main goal here is to analyze and compare influence of initial setting of parameters such as number of mixture components, used number of iterations and volume of training data for these two methods. Experimental studies are performed over the Berlin Emotional Database, expressing different emotions, in German language. The emotions used in this study are anger, fear, joy, boredom, neutral, disgust, and sadness. Experimental results show the effectiveness of the combination of GMM and SVM in order to classify sound data sequences when compared to systems based on GMM.
topic Speech
Emotions
SVM
GMM
Kernel
Sequence
url http://jestec.taylors.edu.my/Vol%2011%20issue%209%20September%202016/11_9_1.pdf
work_keys_str_mv AT itrabelsi comparisonbetweengmmsvmsequencekernelandgmmapplicationtospeechemotionrecognition
AT dbenayed comparisonbetweengmmsvmsequencekernelandgmmapplicationtospeechemotionrecognition
AT nellouze comparisonbetweengmmsvmsequencekernelandgmmapplicationtospeechemotionrecognition
_version_ 1725546808493473792