Unsupervised Clustering and Automatic Language Model Generation for ASR

The goal of an automatic speech recognition system is to enable the computer in understanding human speech and act accordingly. In order to realize this goal, language modeling plays an important role. It works as a knowledge source through mimicking human comprehension mechanism in understandin...

Full description

Bibliographic Details
Main Author:	Podder, Sushil
Format:	Others
Language:	en
Published:	University of Waterloo 2006
Subjects:	Systems Design ASR language model generation WordNet
Online Access:	http://hdl.handle.net/10012/933

id	ndltd-LACETR-oai-collectionscanada.gc.ca-OWTU.10012-933
record_format	oai_dc
spelling	ndltd-LACETR-oai-collectionscanada.gc.ca-OWTU.10012-9332013-10-04T04:07:18ZPodder, Sushil2006-08-22T14:02:42Z2006-08-22T14:02:42Z20042004http://hdl.handle.net/10012/933The goal of an automatic speech recognition system is to enable the computer in understanding human speech and act accordingly. In order to realize this goal, language modeling plays an important role. It works as a knowledge source through mimicking human comprehension mechanism in understanding the language. Among many other approaches, statistical language modeling technique is widely used in automatic speech recognition systems. However, the generation of reliable and robust statistical model is very difficult task, especially for a large vocabulary system. For a large vocabulary system, the performance of such a language model degrades as the vocabulary size increases. Hence, the performance of the speech recognition system also degrades due to the increased complexity and mutual confusion among the candidate words in the language model. In order to solve these problems, reduction of language model size as well as minimization of mutual confusion between words are required. In our work, we have employed clustering techniques, using self-organizing map, to build topical language models. Moreover, in order to capture the inherent semantics of sentences, a lexical dictionary, WordNet has been used in the clustering process. This thesis work focuses on various aspects of clustering, language model generation, extraction of task dependent acoustic parameters, and their implementations under the framework of the CMU Sphinx3 speech engine decoder. The preliminary results, presented in this thesis show the effectiveness of the topical language models.application/pdf1353777 bytesapplication/pdfenUniversity of WaterlooCopyright: 2004, Podder, Sushil. All rights reserved.Systems DesignASRlanguage model generationWordNetUnsupervised Clustering and Automatic Language Model Generation for ASRThesis or DissertationSystems Design EngineeringMaster of Applied Science
collection	NDLTD
language	en
format	Others
sources	NDLTD
topic	Systems Design ASR language model generation WordNet
spellingShingle	Systems Design ASR language model generation WordNet Podder, Sushil Unsupervised Clustering and Automatic Language Model Generation for ASR
description	The goal of an automatic speech recognition system is to enable the computer in understanding human speech and act accordingly. In order to realize this goal, language modeling plays an important role. It works as a knowledge source through mimicking human comprehension mechanism in understanding the language. Among many other approaches, statistical language modeling technique is widely used in automatic speech recognition systems. However, the generation of reliable and robust statistical model is very difficult task, especially for a large vocabulary system. For a large vocabulary system, the performance of such a language model degrades as the vocabulary size increases. Hence, the performance of the speech recognition system also degrades due to the increased complexity and mutual confusion among the candidate words in the language model. In order to solve these problems, reduction of language model size as well as minimization of mutual confusion between words are required. In our work, we have employed clustering techniques, using self-organizing map, to build topical language models. Moreover, in order to capture the inherent semantics of sentences, a lexical dictionary, WordNet has been used in the clustering process. This thesis work focuses on various aspects of clustering, language model generation, extraction of task dependent acoustic parameters, and their implementations under the framework of the CMU Sphinx3 speech engine decoder. The preliminary results, presented in this thesis show the effectiveness of the topical language models.
author	Podder, Sushil
author_facet	Podder, Sushil
author_sort	Podder, Sushil
title	Unsupervised Clustering and Automatic Language Model Generation for ASR
title_short	Unsupervised Clustering and Automatic Language Model Generation for ASR
title_full	Unsupervised Clustering and Automatic Language Model Generation for ASR
title_fullStr	Unsupervised Clustering and Automatic Language Model Generation for ASR
title_full_unstemmed	Unsupervised Clustering and Automatic Language Model Generation for ASR
title_sort	unsupervised clustering and automatic language model generation for asr
publisher	University of Waterloo
publishDate	2006
url	http://hdl.handle.net/10012/933
work_keys_str_mv	AT poddersushil unsupervisedclusteringandautomaticlanguagemodelgenerationforasr
_version_	1716599790366621696

Unsupervised Clustering and Automatic Language Model Generation for ASR

Similar Items