Automatic topic detection of multi-lingual news stories.
Wong Kam Lai. === Thesis (M.Phil.)--Chinese University of Hong Kong, 2000. === Includes bibliographical references (leaves 92-98). === Abstracts in English and Chinese. === Chapter 1 --- Introduction --- p.1 === Chapter 1.1 --- Our Contributions --- p.5 === Chapter 1.2 --- Organization of this The...
Other Authors: | |
---|---|
Format: | Others |
Language: | English Chinese |
Published: |
2000
|
Subjects: | |
Online Access: | http://library.cuhk.edu.hk/record=b5890341 http://repository.lib.cuhk.edu.hk/en/item/cuhk-323212 |
id |
ndltd-cuhk.edu.hk-oai-cuhk-dr-cuhk_323212 |
---|---|
record_format |
oai_dc |
collection |
NDLTD |
language |
English Chinese |
format |
Others
|
sources |
NDLTD |
topic |
Broadcast journalism--Data processing Information retrieval English language--Data processing Chinese language--Data processing |
spellingShingle |
Broadcast journalism--Data processing Information retrieval English language--Data processing Chinese language--Data processing Automatic topic detection of multi-lingual news stories. |
description |
Wong Kam Lai. === Thesis (M.Phil.)--Chinese University of Hong Kong, 2000. === Includes bibliographical references (leaves 92-98). === Abstracts in English and Chinese. === Chapter 1 --- Introduction --- p.1 === Chapter 1.1 --- Our Contributions --- p.5 === Chapter 1.2 --- Organization of this Thesis --- p.5 === Chapter 2 --- Literature Review --- p.7 === Chapter 2.1 --- Dragon Systems --- p.7 === Chapter 2.2 --- Carnegie Mellon University (CMU) --- p.9 === Chapter 2.3 --- University of Massachusetts (UMass) --- p.10 === Chapter 2.4 --- IBM T.J. Watson Research Center --- p.11 === Chapter 2.5 --- BBN Technologies --- p.12 === Chapter 2.6 --- National Taiwan University (NTU) --- p.13 === Chapter 2.7 --- Drawbacks of Existing Approaches --- p.14 === Chapter 3 --- Overview of Proposed Approach --- p.15 === Chapter 3.1 --- News Source --- p.15 === Chapter 3.2 --- Story Preprocessing --- p.18 === Chapter 3.3 --- Concept Term Generation --- p.20 === Chapter 3.4 --- Named Entity Extraction --- p.21 === Chapter 3.5 --- Gross Translation of Chinese to English --- p.21 === Chapter 3.6 --- Topic Detection method --- p.22 === Chapter 3.6.1 --- Deferral Period --- p.22 === Chapter 3.6.2 --- Detection Approach --- p.23 === Chapter 4 --- Concept Term Model --- p.25 === Chapter 4.1 --- Background of Contextual Analysis --- p.25 === Chapter 4.2 --- Concept Term Generation --- p.28 === Chapter 4.2.1 --- Concept Generation Algorithm --- p.28 === Chapter 4.2.2 --- Concept Term Representation for Detection --- p.33 === Chapter 5 --- Topic Detection Model --- p.35 === Chapter 5.1 --- Text Representation and Term Weights --- p.35 === Chapter 5.1.1 --- Story Representation --- p.35 === Chapter 5.1.2 --- Topic Representation --- p.43 === Chapter 5.1.3 --- Similarity Score --- p.43 === Chapter 5.1.4 --- Time adjustment scheme --- p.46 === Chapter 5.2 --- Gross Translation Method --- p.48 === Chapter 5.3 --- The Detection System --- p.50 === Chapter 5.3.1 --- Detection Requirement --- p.50 === Chapter 5.3.2 --- The Top Level Model --- p.52 === Chapter 5.4 --- The Clustering Algorithm --- p.55 === Chapter 5.4.1 --- Similarity Calculation --- p.55 === Chapter 5.4.2 --- Grouping Related Elements --- p.56 === Chapter 5.4.3 --- Topic Identification --- p.60 === Chapter 6 --- Experimental Results and Analysis --- p.63 === Chapter 6.1 --- Evaluation Model --- p.63 === Chapter 6.1.1 --- Evaluation Methodology --- p.64 === Chapter 6.2 --- Experiments on the effects of tuning the parameter --- p.68 === Chapter 6.2.1 --- Experiment Setup --- p.68 === Chapter 6.2.2 --- Results and Analysis --- p.69 === Chapter 6.3 --- Experiments on the effects of named entities and concept terms --- p.74 === Chapter 6.3.1 --- Experiment Setup --- p.74 === Chapter 6.3.2 --- Results and Analysis --- p.75 === Chapter 6.4 --- Experiments on the effect of using time adjustment --- p.77 === Chapter 6.4.1 --- Experiment Setup --- p.77 === Chapter 6.4.2 --- Results and Analysis --- p.79 === Chapter 6.5 --- Experiments on mono-lingual detection --- p.80 === Chapter 6.5.1 --- Experiment Setup --- p.80 === Chapter 6.5.2 --- Results and Analysis --- p.80 === Chapter 7 --- Conclusions and Future Work --- p.83 === Chapter 7.1 --- Conclusions --- p.83 === Chapter 7.2 --- Future Work --- p.85 === Chapter A --- List of Topics annotated for TDT3 Corpus --- p.86 === Chapter B --- Matching evaluation topics to hypothesized topics --- p.90 === Bibliography --- p.92 |
author2 |
Wong, Kam Lai. |
author_facet |
Wong, Kam Lai. |
title |
Automatic topic detection of multi-lingual news stories. |
title_short |
Automatic topic detection of multi-lingual news stories. |
title_full |
Automatic topic detection of multi-lingual news stories. |
title_fullStr |
Automatic topic detection of multi-lingual news stories. |
title_full_unstemmed |
Automatic topic detection of multi-lingual news stories. |
title_sort |
automatic topic detection of multi-lingual news stories. |
publishDate |
2000 |
url |
http://library.cuhk.edu.hk/record=b5890341 http://repository.lib.cuhk.edu.hk/en/item/cuhk-323212 |
_version_ |
1718982876060450816 |
spelling |
ndltd-cuhk.edu.hk-oai-cuhk-dr-cuhk_3232122019-02-26T03:35:14Z Automatic topic detection of multi-lingual news stories. Broadcast journalism--Data processing Information retrieval English language--Data processing Chinese language--Data processing Wong Kam Lai. Thesis (M.Phil.)--Chinese University of Hong Kong, 2000. Includes bibliographical references (leaves 92-98). Abstracts in English and Chinese. Chapter 1 --- Introduction --- p.1 Chapter 1.1 --- Our Contributions --- p.5 Chapter 1.2 --- Organization of this Thesis --- p.5 Chapter 2 --- Literature Review --- p.7 Chapter 2.1 --- Dragon Systems --- p.7 Chapter 2.2 --- Carnegie Mellon University (CMU) --- p.9 Chapter 2.3 --- University of Massachusetts (UMass) --- p.10 Chapter 2.4 --- IBM T.J. Watson Research Center --- p.11 Chapter 2.5 --- BBN Technologies --- p.12 Chapter 2.6 --- National Taiwan University (NTU) --- p.13 Chapter 2.7 --- Drawbacks of Existing Approaches --- p.14 Chapter 3 --- Overview of Proposed Approach --- p.15 Chapter 3.1 --- News Source --- p.15 Chapter 3.2 --- Story Preprocessing --- p.18 Chapter 3.3 --- Concept Term Generation --- p.20 Chapter 3.4 --- Named Entity Extraction --- p.21 Chapter 3.5 --- Gross Translation of Chinese to English --- p.21 Chapter 3.6 --- Topic Detection method --- p.22 Chapter 3.6.1 --- Deferral Period --- p.22 Chapter 3.6.2 --- Detection Approach --- p.23 Chapter 4 --- Concept Term Model --- p.25 Chapter 4.1 --- Background of Contextual Analysis --- p.25 Chapter 4.2 --- Concept Term Generation --- p.28 Chapter 4.2.1 --- Concept Generation Algorithm --- p.28 Chapter 4.2.2 --- Concept Term Representation for Detection --- p.33 Chapter 5 --- Topic Detection Model --- p.35 Chapter 5.1 --- Text Representation and Term Weights --- p.35 Chapter 5.1.1 --- Story Representation --- p.35 Chapter 5.1.2 --- Topic Representation --- p.43 Chapter 5.1.3 --- Similarity Score --- p.43 Chapter 5.1.4 --- Time adjustment scheme --- p.46 Chapter 5.2 --- Gross Translation Method --- p.48 Chapter 5.3 --- The Detection System --- p.50 Chapter 5.3.1 --- Detection Requirement --- p.50 Chapter 5.3.2 --- The Top Level Model --- p.52 Chapter 5.4 --- The Clustering Algorithm --- p.55 Chapter 5.4.1 --- Similarity Calculation --- p.55 Chapter 5.4.2 --- Grouping Related Elements --- p.56 Chapter 5.4.3 --- Topic Identification --- p.60 Chapter 6 --- Experimental Results and Analysis --- p.63 Chapter 6.1 --- Evaluation Model --- p.63 Chapter 6.1.1 --- Evaluation Methodology --- p.64 Chapter 6.2 --- Experiments on the effects of tuning the parameter --- p.68 Chapter 6.2.1 --- Experiment Setup --- p.68 Chapter 6.2.2 --- Results and Analysis --- p.69 Chapter 6.3 --- Experiments on the effects of named entities and concept terms --- p.74 Chapter 6.3.1 --- Experiment Setup --- p.74 Chapter 6.3.2 --- Results and Analysis --- p.75 Chapter 6.4 --- Experiments on the effect of using time adjustment --- p.77 Chapter 6.4.1 --- Experiment Setup --- p.77 Chapter 6.4.2 --- Results and Analysis --- p.79 Chapter 6.5 --- Experiments on mono-lingual detection --- p.80 Chapter 6.5.1 --- Experiment Setup --- p.80 Chapter 6.5.2 --- Results and Analysis --- p.80 Chapter 7 --- Conclusions and Future Work --- p.83 Chapter 7.1 --- Conclusions --- p.83 Chapter 7.2 --- Future Work --- p.85 Chapter A --- List of Topics annotated for TDT3 Corpus --- p.86 Chapter B --- Matching evaluation topics to hypothesized topics --- p.90 Bibliography --- p.92 Wong, Kam Lai. Chinese University of Hong Kong Graduate School. Division of Systems Engineering and Engineering Management. 2000 Text bibliography print xiii, 98 leaves : ill. ; 30 cm. cuhk:323212 http://library.cuhk.edu.hk/record=b5890341 eng chi Use of this resource is governed by the terms and conditions of the Creative Commons “Attribution-NonCommercial-NoDerivatives 4.0 International” License (http://creativecommons.org/licenses/by-nc-nd/4.0/) http://repository.lib.cuhk.edu.hk/en/islandora/object/cuhk%3A323212/datastream/TN/view/Automatic%20topic%20detection%20of%20multi-lingual%20news%20stories.jpghttp://repository.lib.cuhk.edu.hk/en/item/cuhk-323212 |