A Preliminary Study on Speaker Diarization for Automatic Transcription of Broadcast Radio Speech
碩士 === 國立臺北科技大學 === 電子工程系 === 106 === We use Time-delay Neural Network for Speaker Diarization. The average DER is 27.74%, which is better than 31.08% of GMM. We use trained automatic speaker diarization system to classify information of unmarked speakers in the NER-210 corpus, retrain the ASR by ma...
Main Authors: | Wu-Hua Hsu, 許吳華 |
---|---|
Other Authors: | 廖元甫 |
Format: | Others |
Language: | zh-TW |
Published: |
2018
|
Online Access: | http://ndltd.ncl.edu.tw/handle/a3z9vr |
Similar Items
-
Studies on Acoustic Features for Automatic Speech Recognition and Speaker Diarization in Real Environments
by: Ishizuka, Kentaro
Published: (2010) -
Detection and handling of overlapping speech for speaker diarization
by: Zelenák, Martin
Published: (2012) -
Speech Enhancement for Multimodal Speaker Diarization System
by: Rehan Ahmad, et al.
Published: (2020-01-01) -
Speaker diarization and speech recognition in the semi-automatization of audio description: An exploratory study on future possibilities?
by: Héctor Delgado, et al.
Published: (2015-06-01) -
Speaker Diarization Based on Speech Signal Approximation by Step Function
by: Rustam Latypov, et al.
Published: (2021-01-01)