Tone Recognition in Continuous Mandarin Speech
博士 === 國立交通大學 === 電子研究所 === 83 === The characteristics of the tones of Mandarin speech is in general not soley determined by its lexical tonality, but also affected by other factors. In this dissertation, three tone recognizer approaches are studied. Firs...
Main Authors: | , |
---|---|
Other Authors: | |
Format: | Others |
Language: | en_US |
Published: |
1995
|
Online Access: | http://ndltd.ncl.edu.tw/handle/80549709559690772088 |
id |
ndltd-TW-083NCTU0430026 |
---|---|
record_format |
oai_dc |
spelling |
ndltd-TW-083NCTU04300262015-10-13T12:53:37Z http://ndltd.ncl.edu.tw/handle/80549709559690772088 Tone Recognition in Continuous Mandarin Speech 中文連續語音之聲調辨認 Yih-Ru Wang 王逸如 博士 國立交通大學 電子研究所 83 The characteristics of the tones of Mandarin speech is in general not soley determined by its lexical tonality, but also affected by other factors. In this dissertation, three tone recognizer approaches are studied. First, the HMM-based approach is discussed. Serveral schemes to consider the effects of the coarticulation and the intonation of utterence on tone recogni- tion are proposed. Effectiveness of these schemes were examined by simulation on a multi-speaker database. A recognition rate of 86.62% was archieved. Second, the neural net based approaches are studied. There also consider the coarticulation effect by including the features from neighboring syllables. And, the hidden control neural net(HCNN) and hidden state multi-layer perceptron(HSMLP) are used for compensating the declination effect. Experiment shows 86.72% recognition rate was achieved for a larger database. Last, the approach using a prosodic model to assist tone recognition is studied. A simple recurrent neural net(SRNN) is emploed to model the prosody of an utterance. By using the outputs of the hidden nodes of SRNN to assist an MLP tone recognizer, an improvement on recognition rate from 91.38% to 93.10% was achieved for a single male speaker database. Sin-Horng Chen 陳信宏 1995 學位論文 ; thesis 96 en_US |
collection |
NDLTD |
language |
en_US |
format |
Others
|
sources |
NDLTD |
description |
博士 === 國立交通大學 === 電子研究所 === 83 === The characteristics of the tones of Mandarin speech is in
general not soley determined by its lexical tonality, but also
affected by other factors. In this dissertation, three tone
recognizer approaches are studied. First, the HMM-based
approach is discussed. Serveral schemes to consider the effects
of the coarticulation and the intonation of utterence on tone
recogni- tion are proposed. Effectiveness of these schemes were
examined by simulation on a multi-speaker database. A
recognition rate of 86.62% was archieved. Second, the neural
net based approaches are studied. There also consider the
coarticulation effect by including the features from
neighboring syllables. And, the hidden control neural net(HCNN)
and hidden state multi-layer perceptron(HSMLP) are used for
compensating the declination effect. Experiment shows 86.72%
recognition rate was achieved for a larger database. Last, the
approach using a prosodic model to assist tone recognition is
studied. A simple recurrent neural net(SRNN) is emploed to
model the prosody of an utterance. By using the outputs of the
hidden nodes of SRNN to assist an MLP tone recognizer, an
improvement on recognition rate from 91.38% to 93.10% was
achieved for a single male speaker database.
|
author2 |
Sin-Horng Chen |
author_facet |
Sin-Horng Chen Yih-Ru Wang 王逸如 |
author |
Yih-Ru Wang 王逸如 |
spellingShingle |
Yih-Ru Wang 王逸如 Tone Recognition in Continuous Mandarin Speech |
author_sort |
Yih-Ru Wang |
title |
Tone Recognition in Continuous Mandarin Speech |
title_short |
Tone Recognition in Continuous Mandarin Speech |
title_full |
Tone Recognition in Continuous Mandarin Speech |
title_fullStr |
Tone Recognition in Continuous Mandarin Speech |
title_full_unstemmed |
Tone Recognition in Continuous Mandarin Speech |
title_sort |
tone recognition in continuous mandarin speech |
publishDate |
1995 |
url |
http://ndltd.ncl.edu.tw/handle/80549709559690772088 |
work_keys_str_mv |
AT yihruwang tonerecognitionincontinuousmandarinspeech AT wángyìrú tonerecognitionincontinuousmandarinspeech AT yihruwang zhōngwénliánxùyǔyīnzhīshēngdiàobiànrèn AT wángyìrú zhōngwénliánxùyǔyīnzhīshēngdiàobiànrèn |
_version_ |
1716868736360644608 |