Automatic Hyperlink Generation For Chinese Documents

碩士 === 大同工學院 === 資訊工程研究所 === 86 === With the rapid growth of digital documents, the need for a friendly and efficient browsing environment is getting obvious. For English documents, automatic hypertext conversion has stepped into application area. But for Chinese documents, there are still some issu...

Full description

Bibliographic Details
Main Authors: Huang Chen-Chu, 黃承渠
Other Authors: Joung Yuh-Jzer
Format: Others
Language:zh-TW
Published: 1998
Online Access:http://ndltd.ncl.edu.tw/handle/01790402579067243244
id ndltd-TW-086TTIT0392017
record_format oai_dc
spelling ndltd-TW-086TTIT03920172015-10-13T17:34:49Z http://ndltd.ncl.edu.tw/handle/01790402579067243244 Automatic Hyperlink Generation For Chinese Documents 中文文件自動建立鏈結 Huang Chen-Chu 黃承渠 碩士 大同工學院 資訊工程研究所 86 With the rapid growth of digital documents, the need for a friendly and efficient browsing environment is getting obvious. For English documents, automatic hypertext conversion has stepped into application area. But for Chinese documents, there are still some issues have to be solved before we can ahead the road. The thesis proposes a method to convert existing Chinese news documents into hypertext by add HTML anchor tag on keywords. When users browse hypertext version news, they can query related document simply click on the keyword. The system do a premier statistical approach word identification by looking up a general term dictionary. Then using memorized and recursive algorithm to discover new terms. At last, calculating term weight by combining some filter rule to emphasis prop nouns.For increasing system performance, we propose an efficient way to approximately calculate term weight. It uses an auxiliary table to store term distribution information and decrease the overhead of repeatedly calculating it each time a new document arrives.The experiment result shows that the proposed method does improve system performance and maintain hyperlink quality at the same time. Thus we conclude that it can be adapted for an incremental document collection. Joung Yuh-Jzer Yeh Ching-Long 莊裕澤 葉慶隆 1998 學位論文 ; thesis 0 zh-TW
collection NDLTD
language zh-TW
format Others
sources NDLTD
description 碩士 === 大同工學院 === 資訊工程研究所 === 86 === With the rapid growth of digital documents, the need for a friendly and efficient browsing environment is getting obvious. For English documents, automatic hypertext conversion has stepped into application area. But for Chinese documents, there are still some issues have to be solved before we can ahead the road. The thesis proposes a method to convert existing Chinese news documents into hypertext by add HTML anchor tag on keywords. When users browse hypertext version news, they can query related document simply click on the keyword. The system do a premier statistical approach word identification by looking up a general term dictionary. Then using memorized and recursive algorithm to discover new terms. At last, calculating term weight by combining some filter rule to emphasis prop nouns.For increasing system performance, we propose an efficient way to approximately calculate term weight. It uses an auxiliary table to store term distribution information and decrease the overhead of repeatedly calculating it each time a new document arrives.The experiment result shows that the proposed method does improve system performance and maintain hyperlink quality at the same time. Thus we conclude that it can be adapted for an incremental document collection.
author2 Joung Yuh-Jzer
author_facet Joung Yuh-Jzer
Huang Chen-Chu
黃承渠
author Huang Chen-Chu
黃承渠
spellingShingle Huang Chen-Chu
黃承渠
Automatic Hyperlink Generation For Chinese Documents
author_sort Huang Chen-Chu
title Automatic Hyperlink Generation For Chinese Documents
title_short Automatic Hyperlink Generation For Chinese Documents
title_full Automatic Hyperlink Generation For Chinese Documents
title_fullStr Automatic Hyperlink Generation For Chinese Documents
title_full_unstemmed Automatic Hyperlink Generation For Chinese Documents
title_sort automatic hyperlink generation for chinese documents
publishDate 1998
url http://ndltd.ncl.edu.tw/handle/01790402579067243244
work_keys_str_mv AT huangchenchu automatichyperlinkgenerationforchinesedocuments
AT huángchéngqú automatichyperlinkgenerationforchinesedocuments
AT huangchenchu zhōngwénwénjiànzìdòngjiànlìliànjié
AT huángchéngqú zhōngwénwénjiànzìdòngjiànlìliànjié
_version_ 1717781434663436288