Speech rhythm guided syllable nuclei detection

In this paper, we present a novel speech-rhythm-guided syllable-nuclei location detection algorithm. As a departure from conventional methods, we introduce an instantaneous speech rhythm estimator to predict possible regions where syllable nuclei can appear. Within a possible region, a simple slope...

Full description

Bibliographic Details
Main Authors: Glass, James R. (Contributor), Zhang, Yaodong, Ph. D. Massachusetts Institute of Technology (Author)
Other Authors: Massachusetts Institute of Technology. Computer Science and Artificial Intelligence Laboratory (Contributor), Massachusetts Institute of Technology. Department of Electrical Engineering and Computer Science (Contributor), Zhang, Yaodong (Contributor)
Format: Article
Language:English
Published: Institute of Electrical and Electronics Engineers, 2010-12-06T23:03:25Z.
Subjects:
Online Access:Get fulltext
LEADER 01569 am a22002173u 4500
001 60218
042 |a dc 
100 1 0 |a Glass, James R.  |e author 
100 1 0 |a Massachusetts Institute of Technology. Computer Science and Artificial Intelligence Laboratory  |e contributor 
100 1 0 |a Massachusetts Institute of Technology. Department of Electrical Engineering and Computer Science  |e contributor 
100 1 0 |a Glass, James R.  |e contributor 
100 1 0 |a Glass, James R.  |e contributor 
100 1 0 |a Zhang, Yaodong  |e contributor 
700 1 0 |a Zhang, Yaodong, Ph. D. Massachusetts Institute of Technology  |e author 
245 0 0 |a Speech rhythm guided syllable nuclei detection 
260 |b Institute of Electrical and Electronics Engineers,   |c 2010-12-06T23:03:25Z. 
856 |z Get fulltext  |u http://hdl.handle.net/1721.1/60218 
520 |a In this paper, we present a novel speech-rhythm-guided syllable-nuclei location detection algorithm. As a departure from conventional methods, we introduce an instantaneous speech rhythm estimator to predict possible regions where syllable nuclei can appear. Within a possible region, a simple slope based peak counting algorithm is used to get the exact location of each syllable nucleus. We verify the correctness of our method by investigating the syllable nuclei interval distribution in TIMIT dataset, and evaluate the performance by comparing with a state-of-the-art syllable nuclei based speech rate detection approach. 
546 |a en_US 
655 7 |a Article 
773 |t IEEE International Conference on Acoustics, Speech and Signal Processing