A Preliminary Study on Speaker Diarization for Automatic Transcription of Broadcast Radio Speech
碩士 === 國立臺北科技大學 === 電子工程系 === 106 === We use Time-delay Neural Network for Speaker Diarization. The average DER is 27.74%, which is better than 31.08% of GMM. We use trained automatic speaker diarization system to classify information of unmarked speakers in the NER-210 corpus, retrain the ASR by ma...
Main Authors: | , |
---|---|
Other Authors: | |
Format: | Others |
Language: | zh-TW |
Published: |
2018
|
Online Access: | http://ndltd.ncl.edu.tw/handle/a3z9vr |