Multi-Input Speech Emotion Recognition Model Using Mel Spectrogram and GeMAPS

The existing research on emotion recognition commonly uses mel spectrogram (MelSpec) and Geneva minimalistic acoustic parameter set (GeMAPS) as acoustic parameters to learn the audio features. MelSpec can represent the time-series variations of each frequency but cannot manage multiple types of audi...

詳細記述

書誌詳細
出版年:Sensors
主要な著者: Itsuki Toyoshima, Yoshifumi Okada, Momoko Ishimaru, Ryunosuke Uchiyama, Mayu Tada
フォーマット: 論文
言語:英語
出版事項: MDPI AG 2023-02-01
主題:
オンライン・アクセス:https://www.mdpi.com/1424-8220/23/3/1743