A Preliminary Study on Speaker Diarization for Automatic Transcription of Broadcast Radio Speech

碩士 === 國立臺北科技大學 === 電子工程系 === 106 === We use Time-delay Neural Network for Speaker Diarization. The average DER is 27.74%, which is better than 31.08% of GMM. We use trained automatic speaker diarization system to classify information of unmarked speakers in the NER-210 corpus, retrain the ASR by ma...

Full description

Bibliographic Details
Main Authors: Wu-Hua Hsu, 許吳華
Other Authors: 廖元甫
Format: Others
Language:zh-TW
Published: 2018
Online Access:http://ndltd.ncl.edu.tw/handle/a3z9vr