Real‐time implementation and performance evaluation of speech classifiers in speech analysis‐synthesis

In this work, six voiced/unvoiced speech classifiers based on the autocorrelation function (ACF), average magnitude difference function (AMDF), cepstrum, weighted ACF (WACF), zero crossing rate and energy of the signal (ZCR‐E), and neural networks (NNs) have been simulated and implemented in real ti...

Full description

Bibliographic Details
Main Author:	Sandeep Kumar
Format:	Article
Language:	English
Published:	Electronics and Telecommunications Research Institute (ETRI) 2020-07-01
Series:	ETRI Journal
Subjects:	acf amdf cepstrum neural network real‐time system speech classification wacf zcr‐e
Online Access:	https://doi.org/10.4218/etrij.2019-0364

id	doaj-f6d8266c24fa4cb0993a612853131602
record_format	Article
spelling	doaj-f6d8266c24fa4cb0993a6128531316022021-02-25T03:49:35ZengElectronics and Telecommunications Research Institute (ETRI)ETRI Journal1225-64632020-07-01431829410.4218/etrij.2019-036410.4218/etrij.2019-0364Real‐time implementation and performance evaluation of speech classifiers in speech analysis‐synthesisSandeep KumarIn this work, six voiced/unvoiced speech classifiers based on the autocorrelation function (ACF), average magnitude difference function (AMDF), cepstrum, weighted ACF (WACF), zero crossing rate and energy of the signal (ZCR‐E), and neural networks (NNs) have been simulated and implemented in real time using the TMS320C6713 DSP starter kit. These speech classifiers have been integrated into a linear‐predictive‐coding‐based speech analysis‐synthesis system and their performance has been compared in terms of the percentage of the voiced/unvoiced classification accuracy, speech quality, and computation time. The results of the percentage of the voiced/unvoiced classification accuracy and speech quality show that the NN‐based speech classifier performs better than the ACF‐, AMDF‐, cepstrum‐, WACF‐ and ZCR‐E‐based speech classifiers for both clean and noisy environments. The computation time results show that the AMDF‐based speech classifier is computationally simple, and thus its computation time is less than that of other speech classifiers, while that of the NN‐based speech classifier is greater compared with other classifiers.https://doi.org/10.4218/etrij.2019-0364acfamdfcepstrumneural networkreal‐time systemspeech classificationwacfzcr‐e
collection	DOAJ
language	English
format	Article
sources	DOAJ
author	Sandeep Kumar
spellingShingle	Sandeep Kumar Real‐time implementation and performance evaluation of speech classifiers in speech analysis‐synthesis ETRI Journal acf amdf cepstrum neural network real‐time system speech classification wacf zcr‐e
author_facet	Sandeep Kumar
author_sort	Sandeep Kumar
title	Real‐time implementation and performance evaluation of speech classifiers in speech analysis‐synthesis
title_short	Real‐time implementation and performance evaluation of speech classifiers in speech analysis‐synthesis
title_full	Real‐time implementation and performance evaluation of speech classifiers in speech analysis‐synthesis
title_fullStr	Real‐time implementation and performance evaluation of speech classifiers in speech analysis‐synthesis
title_full_unstemmed	Real‐time implementation and performance evaluation of speech classifiers in speech analysis‐synthesis
title_sort	real‐time implementation and performance evaluation of speech classifiers in speech analysis‐synthesis
publisher	Electronics and Telecommunications Research Institute (ETRI)
series	ETRI Journal
issn	1225-6463
publishDate	2020-07-01
description	In this work, six voiced/unvoiced speech classifiers based on the autocorrelation function (ACF), average magnitude difference function (AMDF), cepstrum, weighted ACF (WACF), zero crossing rate and energy of the signal (ZCR‐E), and neural networks (NNs) have been simulated and implemented in real time using the TMS320C6713 DSP starter kit. These speech classifiers have been integrated into a linear‐predictive‐coding‐based speech analysis‐synthesis system and their performance has been compared in terms of the percentage of the voiced/unvoiced classification accuracy, speech quality, and computation time. The results of the percentage of the voiced/unvoiced classification accuracy and speech quality show that the NN‐based speech classifier performs better than the ACF‐, AMDF‐, cepstrum‐, WACF‐ and ZCR‐E‐based speech classifiers for both clean and noisy environments. The computation time results show that the AMDF‐based speech classifier is computationally simple, and thus its computation time is less than that of other speech classifiers, while that of the NN‐based speech classifier is greater compared with other classifiers.
topic	acf amdf cepstrum neural network real‐time system speech classification wacf zcr‐e
url	https://doi.org/10.4218/etrij.2019-0364
work_keys_str_mv	AT sandeepkumar realtimeimplementationandperformanceevaluationofspeechclassifiersinspeechanalysissynthesis
_version_	1724252246184558592

Real‐time implementation and performance evaluation of speech classifiers in speech analysis‐synthesis

Similar Items