Automatic topic detection of multi-lingual news stories.

Wong Kam Lai. === Thesis (M.Phil.)--Chinese University of Hong Kong, 2000. === Includes bibliographical references (leaves 92-98). === Abstracts in English and Chinese. === Chapter 1 --- Introduction --- p.1 === Chapter 1.1 --- Our Contributions --- p.5 === Chapter 1.2 --- Organization of this The...

Full description

Bibliographic Details
Other Authors: Wong, Kam Lai.
Format: Others
Language:English
Chinese
Published: 2000
Subjects:
Online Access:http://library.cuhk.edu.hk/record=b5890341
http://repository.lib.cuhk.edu.hk/en/item/cuhk-323212
id ndltd-cuhk.edu.hk-oai-cuhk-dr-cuhk_323212
record_format oai_dc
collection NDLTD
language English
Chinese
format Others
sources NDLTD
topic Broadcast journalism--Data processing
Information retrieval
English language--Data processing
Chinese language--Data processing
spellingShingle Broadcast journalism--Data processing
Information retrieval
English language--Data processing
Chinese language--Data processing
Automatic topic detection of multi-lingual news stories.
description Wong Kam Lai. === Thesis (M.Phil.)--Chinese University of Hong Kong, 2000. === Includes bibliographical references (leaves 92-98). === Abstracts in English and Chinese. === Chapter 1 --- Introduction --- p.1 === Chapter 1.1 --- Our Contributions --- p.5 === Chapter 1.2 --- Organization of this Thesis --- p.5 === Chapter 2 --- Literature Review --- p.7 === Chapter 2.1 --- Dragon Systems --- p.7 === Chapter 2.2 --- Carnegie Mellon University (CMU) --- p.9 === Chapter 2.3 --- University of Massachusetts (UMass) --- p.10 === Chapter 2.4 --- IBM T.J. Watson Research Center --- p.11 === Chapter 2.5 --- BBN Technologies --- p.12 === Chapter 2.6 --- National Taiwan University (NTU) --- p.13 === Chapter 2.7 --- Drawbacks of Existing Approaches --- p.14 === Chapter 3 --- Overview of Proposed Approach --- p.15 === Chapter 3.1 --- News Source --- p.15 === Chapter 3.2 --- Story Preprocessing --- p.18 === Chapter 3.3 --- Concept Term Generation --- p.20 === Chapter 3.4 --- Named Entity Extraction --- p.21 === Chapter 3.5 --- Gross Translation of Chinese to English --- p.21 === Chapter 3.6 --- Topic Detection method --- p.22 === Chapter 3.6.1 --- Deferral Period --- p.22 === Chapter 3.6.2 --- Detection Approach --- p.23 === Chapter 4 --- Concept Term Model --- p.25 === Chapter 4.1 --- Background of Contextual Analysis --- p.25 === Chapter 4.2 --- Concept Term Generation --- p.28 === Chapter 4.2.1 --- Concept Generation Algorithm --- p.28 === Chapter 4.2.2 --- Concept Term Representation for Detection --- p.33 === Chapter 5 --- Topic Detection Model --- p.35 === Chapter 5.1 --- Text Representation and Term Weights --- p.35 === Chapter 5.1.1 --- Story Representation --- p.35 === Chapter 5.1.2 --- Topic Representation --- p.43 === Chapter 5.1.3 --- Similarity Score --- p.43 === Chapter 5.1.4 --- Time adjustment scheme --- p.46 === Chapter 5.2 --- Gross Translation Method --- p.48 === Chapter 5.3 --- The Detection System --- p.50 === Chapter 5.3.1 --- Detection Requirement --- p.50 === Chapter 5.3.2 --- The Top Level Model --- p.52 === Chapter 5.4 --- The Clustering Algorithm --- p.55 === Chapter 5.4.1 --- Similarity Calculation --- p.55 === Chapter 5.4.2 --- Grouping Related Elements --- p.56 === Chapter 5.4.3 --- Topic Identification --- p.60 === Chapter 6 --- Experimental Results and Analysis --- p.63 === Chapter 6.1 --- Evaluation Model --- p.63 === Chapter 6.1.1 --- Evaluation Methodology --- p.64 === Chapter 6.2 --- Experiments on the effects of tuning the parameter --- p.68 === Chapter 6.2.1 --- Experiment Setup --- p.68 === Chapter 6.2.2 --- Results and Analysis --- p.69 === Chapter 6.3 --- Experiments on the effects of named entities and concept terms --- p.74 === Chapter 6.3.1 --- Experiment Setup --- p.74 === Chapter 6.3.2 --- Results and Analysis --- p.75 === Chapter 6.4 --- Experiments on the effect of using time adjustment --- p.77 === Chapter 6.4.1 --- Experiment Setup --- p.77 === Chapter 6.4.2 --- Results and Analysis --- p.79 === Chapter 6.5 --- Experiments on mono-lingual detection --- p.80 === Chapter 6.5.1 --- Experiment Setup --- p.80 === Chapter 6.5.2 --- Results and Analysis --- p.80 === Chapter 7 --- Conclusions and Future Work --- p.83 === Chapter 7.1 --- Conclusions --- p.83 === Chapter 7.2 --- Future Work --- p.85 === Chapter A --- List of Topics annotated for TDT3 Corpus --- p.86 === Chapter B --- Matching evaluation topics to hypothesized topics --- p.90 === Bibliography --- p.92
author2 Wong, Kam Lai.
author_facet Wong, Kam Lai.
title Automatic topic detection of multi-lingual news stories.
title_short Automatic topic detection of multi-lingual news stories.
title_full Automatic topic detection of multi-lingual news stories.
title_fullStr Automatic topic detection of multi-lingual news stories.
title_full_unstemmed Automatic topic detection of multi-lingual news stories.
title_sort automatic topic detection of multi-lingual news stories.
publishDate 2000
url http://library.cuhk.edu.hk/record=b5890341
http://repository.lib.cuhk.edu.hk/en/item/cuhk-323212
_version_ 1718982876060450816
spelling ndltd-cuhk.edu.hk-oai-cuhk-dr-cuhk_3232122019-02-26T03:35:14Z Automatic topic detection of multi-lingual news stories. Broadcast journalism--Data processing Information retrieval English language--Data processing Chinese language--Data processing Wong Kam Lai. Thesis (M.Phil.)--Chinese University of Hong Kong, 2000. Includes bibliographical references (leaves 92-98). Abstracts in English and Chinese. Chapter 1 --- Introduction --- p.1 Chapter 1.1 --- Our Contributions --- p.5 Chapter 1.2 --- Organization of this Thesis --- p.5 Chapter 2 --- Literature Review --- p.7 Chapter 2.1 --- Dragon Systems --- p.7 Chapter 2.2 --- Carnegie Mellon University (CMU) --- p.9 Chapter 2.3 --- University of Massachusetts (UMass) --- p.10 Chapter 2.4 --- IBM T.J. Watson Research Center --- p.11 Chapter 2.5 --- BBN Technologies --- p.12 Chapter 2.6 --- National Taiwan University (NTU) --- p.13 Chapter 2.7 --- Drawbacks of Existing Approaches --- p.14 Chapter 3 --- Overview of Proposed Approach --- p.15 Chapter 3.1 --- News Source --- p.15 Chapter 3.2 --- Story Preprocessing --- p.18 Chapter 3.3 --- Concept Term Generation --- p.20 Chapter 3.4 --- Named Entity Extraction --- p.21 Chapter 3.5 --- Gross Translation of Chinese to English --- p.21 Chapter 3.6 --- Topic Detection method --- p.22 Chapter 3.6.1 --- Deferral Period --- p.22 Chapter 3.6.2 --- Detection Approach --- p.23 Chapter 4 --- Concept Term Model --- p.25 Chapter 4.1 --- Background of Contextual Analysis --- p.25 Chapter 4.2 --- Concept Term Generation --- p.28 Chapter 4.2.1 --- Concept Generation Algorithm --- p.28 Chapter 4.2.2 --- Concept Term Representation for Detection --- p.33 Chapter 5 --- Topic Detection Model --- p.35 Chapter 5.1 --- Text Representation and Term Weights --- p.35 Chapter 5.1.1 --- Story Representation --- p.35 Chapter 5.1.2 --- Topic Representation --- p.43 Chapter 5.1.3 --- Similarity Score --- p.43 Chapter 5.1.4 --- Time adjustment scheme --- p.46 Chapter 5.2 --- Gross Translation Method --- p.48 Chapter 5.3 --- The Detection System --- p.50 Chapter 5.3.1 --- Detection Requirement --- p.50 Chapter 5.3.2 --- The Top Level Model --- p.52 Chapter 5.4 --- The Clustering Algorithm --- p.55 Chapter 5.4.1 --- Similarity Calculation --- p.55 Chapter 5.4.2 --- Grouping Related Elements --- p.56 Chapter 5.4.3 --- Topic Identification --- p.60 Chapter 6 --- Experimental Results and Analysis --- p.63 Chapter 6.1 --- Evaluation Model --- p.63 Chapter 6.1.1 --- Evaluation Methodology --- p.64 Chapter 6.2 --- Experiments on the effects of tuning the parameter --- p.68 Chapter 6.2.1 --- Experiment Setup --- p.68 Chapter 6.2.2 --- Results and Analysis --- p.69 Chapter 6.3 --- Experiments on the effects of named entities and concept terms --- p.74 Chapter 6.3.1 --- Experiment Setup --- p.74 Chapter 6.3.2 --- Results and Analysis --- p.75 Chapter 6.4 --- Experiments on the effect of using time adjustment --- p.77 Chapter 6.4.1 --- Experiment Setup --- p.77 Chapter 6.4.2 --- Results and Analysis --- p.79 Chapter 6.5 --- Experiments on mono-lingual detection --- p.80 Chapter 6.5.1 --- Experiment Setup --- p.80 Chapter 6.5.2 --- Results and Analysis --- p.80 Chapter 7 --- Conclusions and Future Work --- p.83 Chapter 7.1 --- Conclusions --- p.83 Chapter 7.2 --- Future Work --- p.85 Chapter A --- List of Topics annotated for TDT3 Corpus --- p.86 Chapter B --- Matching evaluation topics to hypothesized topics --- p.90 Bibliography --- p.92 Wong, Kam Lai. Chinese University of Hong Kong Graduate School. Division of Systems Engineering and Engineering Management. 2000 Text bibliography print xiii, 98 leaves : ill. ; 30 cm. cuhk:323212 http://library.cuhk.edu.hk/record=b5890341 eng chi Use of this resource is governed by the terms and conditions of the Creative Commons “Attribution-NonCommercial-NoDerivatives 4.0 International” License (http://creativecommons.org/licenses/by-nc-nd/4.0/) http://repository.lib.cuhk.edu.hk/en/islandora/object/cuhk%3A323212/datastream/TN/view/Automatic%20topic%20detection%20of%20multi-lingual%20news%20stories.jpghttp://repository.lib.cuhk.edu.hk/en/item/cuhk-323212