On the Use of Evolutionary Algorithms to Improve the Robustness of Continuous Speech Recognition Systems in Adverse Conditions

Limiting the decrease in performance due to acoustic environment changes remains a major challenge for continuous speech recognition (CSR) systems. We propose a novel approach which combines the Karhunen-LoÃƒÂ¨ve transform (KLT) in the mel-frequency domain with a genetic algorithm (GA) to enhance th...

Full description

Bibliographic Details
Main Authors:	Sid-Ahmed Selouani, Douglas O'Shaughnessy
Format:	Article
Language:	English
Published:	SpringerOpen 2003-07-01
Series:	EURASIP Journal on Advances in Signal Processing
Subjects:	speech recognition genetic algorithms Karhunen-Loève transform hidden Markov models robustness.
Online Access:	http://dx.doi.org/10.1155/S1110865703302070

id	doaj-1624c8ee493a4883867d49ba38f6798e
record_format	Article
spelling	doaj-1624c8ee493a4883867d49ba38f6798e2020-11-25T01:41:36ZengSpringerOpenEURASIP Journal on Advances in Signal Processing1687-61721687-61802003-07-012003881482310.1155/S1687617203302070On the Use of Evolutionary Algorithms to Improve the Robustness of Continuous Speech Recognition Systems in Adverse ConditionsSid-Ahmed SelouaniDouglas O'ShaughnessyLimiting the decrease in performance due to acoustic environment changes remains a major challenge for continuous speech recognition (CSR) systems. We propose a novel approach which combines the Karhunen-LoÃƒÂ¨ve transform (KLT) in the mel-frequency domain with a genetic algorithm (GA) to enhance the data representing corrupted speech. The idea consists of projecting noisy speech parameters onto the space generated by the genetically optimized principal axis issued from the KLT. The enhanced parameters increase the recognition rate for highly interfering noise environments. The proposed hybrid technique, when included in the front-end of an HTK-based CSR system, outperforms that of the conventional recognition process in severe interfering car noise environments for a wide range of signal-to-noise ratios (SNRs) varying from 16 dB to Ã¢ÂˆÂ’4 dB. We also showed the effectiveness of the KLT-GA method in recognizing speech subject to telephone channel degradations.http://dx.doi.org/10.1155/S1110865703302070speech recognitiongenetic algorithmsKarhunen-Loève transformhidden Markov modelsrobustness.
collection	DOAJ
language	English
format	Article
sources	DOAJ
author	Sid-Ahmed Selouani Douglas O'Shaughnessy
spellingShingle	Sid-Ahmed Selouani Douglas O'Shaughnessy On the Use of Evolutionary Algorithms to Improve the Robustness of Continuous Speech Recognition Systems in Adverse Conditions EURASIP Journal on Advances in Signal Processing speech recognition genetic algorithms Karhunen-Loève transform hidden Markov models robustness.
author_facet	Sid-Ahmed Selouani Douglas O'Shaughnessy
author_sort	Sid-Ahmed Selouani
title	On the Use of Evolutionary Algorithms to Improve the Robustness of Continuous Speech Recognition Systems in Adverse Conditions
title_short	On the Use of Evolutionary Algorithms to Improve the Robustness of Continuous Speech Recognition Systems in Adverse Conditions
title_full	On the Use of Evolutionary Algorithms to Improve the Robustness of Continuous Speech Recognition Systems in Adverse Conditions
title_fullStr	On the Use of Evolutionary Algorithms to Improve the Robustness of Continuous Speech Recognition Systems in Adverse Conditions
title_full_unstemmed	On the Use of Evolutionary Algorithms to Improve the Robustness of Continuous Speech Recognition Systems in Adverse Conditions
title_sort	on the use of evolutionary algorithms to improve the robustness of continuous speech recognition systems in adverse conditions
publisher	SpringerOpen
series	EURASIP Journal on Advances in Signal Processing
issn	1687-6172 1687-6180
publishDate	2003-07-01
description	Limiting the decrease in performance due to acoustic environment changes remains a major challenge for continuous speech recognition (CSR) systems. We propose a novel approach which combines the Karhunen-LoÃƒÂ¨ve transform (KLT) in the mel-frequency domain with a genetic algorithm (GA) to enhance the data representing corrupted speech. The idea consists of projecting noisy speech parameters onto the space generated by the genetically optimized principal axis issued from the KLT. The enhanced parameters increase the recognition rate for highly interfering noise environments. The proposed hybrid technique, when included in the front-end of an HTK-based CSR system, outperforms that of the conventional recognition process in severe interfering car noise environments for a wide range of signal-to-noise ratios (SNRs) varying from 16 dB to Ã¢ÂˆÂ’4 dB. We also showed the effectiveness of the KLT-GA method in recognizing speech subject to telephone channel degradations.
topic	speech recognition genetic algorithms Karhunen-Loève transform hidden Markov models robustness.
url	http://dx.doi.org/10.1155/S1110865703302070
work_keys_str_mv	AT sidahmedselouani ontheuseofevolutionaryalgorithmstoimprovetherobustnessofcontinuousspeechrecognitionsystemsinadverseconditions AT douglasoshaughnessy ontheuseofevolutionaryalgorithmstoimprovetherobustnessofcontinuousspeechrecognitionsystemsinadverseconditions
_version_	1725040769552613376

On the Use of Evolutionary Algorithms to Improve the Robustness of Continuous Speech Recognition Systems in Adverse Conditions

Similar Items