Real‐time implementation and performance evaluation of speech classifiers in speech analysis‐synthesis

In this work, six voiced/unvoiced speech classifiers based on the autocorrelation function (ACF), average magnitude difference function (AMDF), cepstrum, weighted ACF (WACF), zero crossing rate and energy of the signal (ZCR‐E), and neural networks (NNs) have been simulated and implemented in real ti...

Full description

Bibliographic Details
Main Author: Sandeep Kumar
Format: Article
Language:English
Published: Electronics and Telecommunications Research Institute (ETRI) 2020-07-01
Series:ETRI Journal
Subjects:
acf
Online Access:https://doi.org/10.4218/etrij.2019-0364
id doaj-f6d8266c24fa4cb0993a612853131602
record_format Article
spelling doaj-f6d8266c24fa4cb0993a6128531316022021-02-25T03:49:35ZengElectronics and Telecommunications Research Institute (ETRI)ETRI Journal1225-64632020-07-01431829410.4218/etrij.2019-036410.4218/etrij.2019-0364Real‐time implementation and performance evaluation of speech classifiers in speech analysis‐synthesisSandeep KumarIn this work, six voiced/unvoiced speech classifiers based on the autocorrelation function (ACF), average magnitude difference function (AMDF), cepstrum, weighted ACF (WACF), zero crossing rate and energy of the signal (ZCR‐E), and neural networks (NNs) have been simulated and implemented in real time using the TMS320C6713 DSP starter kit. These speech classifiers have been integrated into a linear‐predictive‐coding‐based speech analysis‐synthesis system and their performance has been compared in terms of the percentage of the voiced/unvoiced classification accuracy, speech quality, and computation time. The results of the percentage of the voiced/unvoiced classification accuracy and speech quality show that the NN‐based speech classifier performs better than the ACF‐, AMDF‐, cepstrum‐, WACF‐ and ZCR‐E‐based speech classifiers for both clean and noisy environments. The computation time results show that the AMDF‐based speech classifier is computationally simple, and thus its computation time is less than that of other speech classifiers, while that of the NN‐based speech classifier is greater compared with other classifiers.https://doi.org/10.4218/etrij.2019-0364acfamdfcepstrumneural networkreal‐time systemspeech classificationwacfzcr‐e
collection DOAJ
language English
format Article
sources DOAJ
author Sandeep Kumar
spellingShingle Sandeep Kumar
Real‐time implementation and performance evaluation of speech classifiers in speech analysis‐synthesis
ETRI Journal
acf
amdf
cepstrum
neural network
real‐time system
speech classification
wacf
zcr‐e
author_facet Sandeep Kumar
author_sort Sandeep Kumar
title Real‐time implementation and performance evaluation of speech classifiers in speech analysis‐synthesis
title_short Real‐time implementation and performance evaluation of speech classifiers in speech analysis‐synthesis
title_full Real‐time implementation and performance evaluation of speech classifiers in speech analysis‐synthesis
title_fullStr Real‐time implementation and performance evaluation of speech classifiers in speech analysis‐synthesis
title_full_unstemmed Real‐time implementation and performance evaluation of speech classifiers in speech analysis‐synthesis
title_sort real‐time implementation and performance evaluation of speech classifiers in speech analysis‐synthesis
publisher Electronics and Telecommunications Research Institute (ETRI)
series ETRI Journal
issn 1225-6463
publishDate 2020-07-01
description In this work, six voiced/unvoiced speech classifiers based on the autocorrelation function (ACF), average magnitude difference function (AMDF), cepstrum, weighted ACF (WACF), zero crossing rate and energy of the signal (ZCR‐E), and neural networks (NNs) have been simulated and implemented in real time using the TMS320C6713 DSP starter kit. These speech classifiers have been integrated into a linear‐predictive‐coding‐based speech analysis‐synthesis system and their performance has been compared in terms of the percentage of the voiced/unvoiced classification accuracy, speech quality, and computation time. The results of the percentage of the voiced/unvoiced classification accuracy and speech quality show that the NN‐based speech classifier performs better than the ACF‐, AMDF‐, cepstrum‐, WACF‐ and ZCR‐E‐based speech classifiers for both clean and noisy environments. The computation time results show that the AMDF‐based speech classifier is computationally simple, and thus its computation time is less than that of other speech classifiers, while that of the NN‐based speech classifier is greater compared with other classifiers.
topic acf
amdf
cepstrum
neural network
real‐time system
speech classification
wacf
zcr‐e
url https://doi.org/10.4218/etrij.2019-0364
work_keys_str_mv AT sandeepkumar realtimeimplementationandperformanceevaluationofspeechclassifiersinspeechanalysissynthesis
_version_ 1724252246184558592