Tone Recognition in Continuous Mandarin Speech

博士 === 國立交通大學 === 電子研究所 === 83 === The characteristics of the tones of Mandarin speech is in general not soley determined by its lexical tonality, but also affected by other factors. In this dissertation, three tone recognizer approaches are studied. Firs...

Full description

Bibliographic Details
Main Authors: Yih-Ru Wang, 王逸如
Other Authors: Sin-Horng Chen
Format: Others
Language:en_US
Published: 1995
Online Access:http://ndltd.ncl.edu.tw/handle/80549709559690772088
id ndltd-TW-083NCTU0430026
record_format oai_dc
spelling ndltd-TW-083NCTU04300262015-10-13T12:53:37Z http://ndltd.ncl.edu.tw/handle/80549709559690772088 Tone Recognition in Continuous Mandarin Speech 中文連續語音之聲調辨認 Yih-Ru Wang 王逸如 博士 國立交通大學 電子研究所 83 The characteristics of the tones of Mandarin speech is in general not soley determined by its lexical tonality, but also affected by other factors. In this dissertation, three tone recognizer approaches are studied. First, the HMM-based approach is discussed. Serveral schemes to consider the effects of the coarticulation and the intonation of utterence on tone recogni- tion are proposed. Effectiveness of these schemes were examined by simulation on a multi-speaker database. A recognition rate of 86.62% was archieved. Second, the neural net based approaches are studied. There also consider the coarticulation effect by including the features from neighboring syllables. And, the hidden control neural net(HCNN) and hidden state multi-layer perceptron(HSMLP) are used for compensating the declination effect. Experiment shows 86.72% recognition rate was achieved for a larger database. Last, the approach using a prosodic model to assist tone recognition is studied. A simple recurrent neural net(SRNN) is emploed to model the prosody of an utterance. By using the outputs of the hidden nodes of SRNN to assist an MLP tone recognizer, an improvement on recognition rate from 91.38% to 93.10% was achieved for a single male speaker database. Sin-Horng Chen 陳信宏 1995 學位論文 ; thesis 96 en_US
collection NDLTD
language en_US
format Others
sources NDLTD
description 博士 === 國立交通大學 === 電子研究所 === 83 === The characteristics of the tones of Mandarin speech is in general not soley determined by its lexical tonality, but also affected by other factors. In this dissertation, three tone recognizer approaches are studied. First, the HMM-based approach is discussed. Serveral schemes to consider the effects of the coarticulation and the intonation of utterence on tone recogni- tion are proposed. Effectiveness of these schemes were examined by simulation on a multi-speaker database. A recognition rate of 86.62% was archieved. Second, the neural net based approaches are studied. There also consider the coarticulation effect by including the features from neighboring syllables. And, the hidden control neural net(HCNN) and hidden state multi-layer perceptron(HSMLP) are used for compensating the declination effect. Experiment shows 86.72% recognition rate was achieved for a larger database. Last, the approach using a prosodic model to assist tone recognition is studied. A simple recurrent neural net(SRNN) is emploed to model the prosody of an utterance. By using the outputs of the hidden nodes of SRNN to assist an MLP tone recognizer, an improvement on recognition rate from 91.38% to 93.10% was achieved for a single male speaker database.
author2 Sin-Horng Chen
author_facet Sin-Horng Chen
Yih-Ru Wang
王逸如
author Yih-Ru Wang
王逸如
spellingShingle Yih-Ru Wang
王逸如
Tone Recognition in Continuous Mandarin Speech
author_sort Yih-Ru Wang
title Tone Recognition in Continuous Mandarin Speech
title_short Tone Recognition in Continuous Mandarin Speech
title_full Tone Recognition in Continuous Mandarin Speech
title_fullStr Tone Recognition in Continuous Mandarin Speech
title_full_unstemmed Tone Recognition in Continuous Mandarin Speech
title_sort tone recognition in continuous mandarin speech
publishDate 1995
url http://ndltd.ncl.edu.tw/handle/80549709559690772088
work_keys_str_mv AT yihruwang tonerecognitionincontinuousmandarinspeech
AT wángyìrú tonerecognitionincontinuousmandarinspeech
AT yihruwang zhōngwénliánxùyǔyīnzhīshēngdiàobiànrèn
AT wángyìrú zhōngwénliánxùyǔyīnzhīshēngdiàobiànrèn
_version_ 1716868736360644608