Speech/Text Information Retrieval with Speech Queries for Mandarin Chinese

博士 === 國立臺灣大學 === 電機工程學系 === 86 === Intelligent and efficient information retrieval techniques allowing easy access to huge amount and various types of information become highly desired and have been extensively studied in recent years. Thi...

Full description

Bibliographic Details
Main Authors: Bai, Bo-Ren, 白博仁
Other Authors: Lin-Shan Lee
Format: Others
Language:en_US
Published: 1998
Online Access:http://ndltd.ncl.edu.tw/handle/77177297346059554563
id ndltd-TW-086NTU00442109
record_format oai_dc
spelling ndltd-TW-086NTU004421092016-06-29T04:13:46Z http://ndltd.ncl.edu.tw/handle/77177297346059554563 Speech/Text Information Retrieval with Speech Queries for Mandarin Chinese 中文語音及文字資料庫之國語語音檢索 Bai, Bo-Ren 白博仁 博士 國立臺灣大學 電機工程學系 86 Intelligent and efficient information retrieval techniques allowing easy access to huge amount and various types of information become highly desired and have been extensively studied in recent years. This dissertation deals with the problem of Chinese text and Mandarin speech information retrieval with Mandarin speech queries. To achieve this purpose, key technologies including text keyword extraction and speech keyword spotting are first developed, then approaches integrating the syllable and keyword information for text and speech information retrieval using speech queries are proposed considering relevant technologies such as text processing, speech recognition, information retrieval, and so on. Finally, a prototype system with an interface supporting some user-friendly functions is successfully developed to demonstrate the feasibility of the technologies developed in this dissertation. An efficient text keyword extraction approach is first developed based on the PAT tree data structure to handle dynamic text collection. In the approach, all patterns in a text collection are first taken as possible keyword candidates, then for each keyword candidate, both the association relationships of sub-patterns inside the keyword candidate and the contextual dependency of the keyword candidate with its adjacent patterns on both sides will be examined, thus very good performance can be achieved. On the other hand, a syllable-based speech keyword spotting approach is developed to handle keyword sets with large and flexible vocabularies. The approach is implemented by a multi-phase framework with score normalization and speedup techniques, thus it is very accuracy and fast. With the text keyword extraction and speech keyword spotting techniques developed successfully, by adopting conventional continuous speech recognition techniques for syllable recognition, approaches integrating the syllable and word information for text and speech information retrieval using speech queries are then developed. It has been found that by properly integrating different information considering relevant speech recognition and information retrieval technologies, better retrieving performance can be achieved. On the other hand, techniques such as relevance feedback are also applied to support interactive retrieving. All the above technologies are finally integrated into a prototype system for text and speech information retrieval using speech queries. The system includes an interface supporting some user-friendly functions to play the voice records, display and read the text records, extract keywords from retrieved text records for term suggestion, etc. Also, relevance feedback functions are supported, thus the user can select either voice records, text records, and/or their keywords and feed them back to the system for further retrieving both text and voice records. The prototype system not only verifies the feasibility of the technologies developed in this dissertation, but also provides a very good environment for further investigation on a variety of research topics on information retrieval. Lin-Shan Lee Lee-Feng Chien 李琳山 簡立峰 --- 1998 學位論文 ; thesis 164 en_US
collection NDLTD
language en_US
format Others
sources NDLTD
author2 Lin-Shan Lee
author_facet Lin-Shan Lee
Bai, Bo-Ren
白博仁
author Bai, Bo-Ren
白博仁
spellingShingle Bai, Bo-Ren
白博仁
Speech/Text Information Retrieval with Speech Queries for Mandarin Chinese
author_sort Bai, Bo-Ren
title Speech/Text Information Retrieval with Speech Queries for Mandarin Chinese
title_short Speech/Text Information Retrieval with Speech Queries for Mandarin Chinese
title_full Speech/Text Information Retrieval with Speech Queries for Mandarin Chinese
title_fullStr Speech/Text Information Retrieval with Speech Queries for Mandarin Chinese
title_full_unstemmed Speech/Text Information Retrieval with Speech Queries for Mandarin Chinese
title_sort speech/text information retrieval with speech queries for mandarin chinese
publishDate 1998
url http://ndltd.ncl.edu.tw/handle/77177297346059554563
work_keys_str_mv AT baiboren speechtextinformationretrievalwithspeechqueriesformandarinchinese
AT báibórén speechtextinformationretrievalwithspeechqueriesformandarinchinese
AT baiboren zhōngwényǔyīnjíwénzìzīliàokùzhīguóyǔyǔyīnjiǎnsuǒ
AT báibórén zhōngwényǔyīnjíwénzìzīliàokùzhīguóyǔyǔyīnjiǎnsuǒ
_version_ 1718327780832182272
description 博士 === 國立臺灣大學 === 電機工程學系 === 86 === Intelligent and efficient information retrieval techniques allowing easy access to huge amount and various types of information become highly desired and have been extensively studied in recent years. This dissertation deals with the problem of Chinese text and Mandarin speech information retrieval with Mandarin speech queries. To achieve this purpose, key technologies including text keyword extraction and speech keyword spotting are first developed, then approaches integrating the syllable and keyword information for text and speech information retrieval using speech queries are proposed considering relevant technologies such as text processing, speech recognition, information retrieval, and so on. Finally, a prototype system with an interface supporting some user-friendly functions is successfully developed to demonstrate the feasibility of the technologies developed in this dissertation. An efficient text keyword extraction approach is first developed based on the PAT tree data structure to handle dynamic text collection. In the approach, all patterns in a text collection are first taken as possible keyword candidates, then for each keyword candidate, both the association relationships of sub-patterns inside the keyword candidate and the contextual dependency of the keyword candidate with its adjacent patterns on both sides will be examined, thus very good performance can be achieved. On the other hand, a syllable-based speech keyword spotting approach is developed to handle keyword sets with large and flexible vocabularies. The approach is implemented by a multi-phase framework with score normalization and speedup techniques, thus it is very accuracy and fast. With the text keyword extraction and speech keyword spotting techniques developed successfully, by adopting conventional continuous speech recognition techniques for syllable recognition, approaches integrating the syllable and word information for text and speech information retrieval using speech queries are then developed. It has been found that by properly integrating different information considering relevant speech recognition and information retrieval technologies, better retrieving performance can be achieved. On the other hand, techniques such as relevance feedback are also applied to support interactive retrieving. All the above technologies are finally integrated into a prototype system for text and speech information retrieval using speech queries. The system includes an interface supporting some user-friendly functions to play the voice records, display and read the text records, extract keywords from retrieved text records for term suggestion, etc. Also, relevance feedback functions are supported, thus the user can select either voice records, text records, and/or their keywords and feed them back to the system for further retrieving both text and voice records. The prototype system not only verifies the feasibility of the technologies developed in this dissertation, but also provides a very good environment for further investigation on a variety of research topics on information retrieval.