Automatic Hyperlink Generation For Chinese Documents
碩士 === 大同工學院 === 資訊工程研究所 === 86 === With the rapid growth of digital documents, the need for a friendly and efficient browsing environment is getting obvious. For English documents, automatic hypertext conversion has stepped into application area. But for Chinese documents, there are still some issu...
Main Authors: | , |
---|---|
Other Authors: | |
Format: | Others |
Language: | zh-TW |
Published: |
1998
|
Online Access: | http://ndltd.ncl.edu.tw/handle/01790402579067243244 |
id |
ndltd-TW-086TTIT0392017 |
---|---|
record_format |
oai_dc |
spelling |
ndltd-TW-086TTIT03920172015-10-13T17:34:49Z http://ndltd.ncl.edu.tw/handle/01790402579067243244 Automatic Hyperlink Generation For Chinese Documents 中文文件自動建立鏈結 Huang Chen-Chu 黃承渠 碩士 大同工學院 資訊工程研究所 86 With the rapid growth of digital documents, the need for a friendly and efficient browsing environment is getting obvious. For English documents, automatic hypertext conversion has stepped into application area. But for Chinese documents, there are still some issues have to be solved before we can ahead the road. The thesis proposes a method to convert existing Chinese news documents into hypertext by add HTML anchor tag on keywords. When users browse hypertext version news, they can query related document simply click on the keyword. The system do a premier statistical approach word identification by looking up a general term dictionary. Then using memorized and recursive algorithm to discover new terms. At last, calculating term weight by combining some filter rule to emphasis prop nouns.For increasing system performance, we propose an efficient way to approximately calculate term weight. It uses an auxiliary table to store term distribution information and decrease the overhead of repeatedly calculating it each time a new document arrives.The experiment result shows that the proposed method does improve system performance and maintain hyperlink quality at the same time. Thus we conclude that it can be adapted for an incremental document collection. Joung Yuh-Jzer Yeh Ching-Long 莊裕澤 葉慶隆 1998 學位論文 ; thesis 0 zh-TW |
collection |
NDLTD |
language |
zh-TW |
format |
Others
|
sources |
NDLTD |
description |
碩士 === 大同工學院 === 資訊工程研究所 === 86 === With the rapid growth of digital documents, the need for a friendly and efficient browsing environment is getting obvious. For English documents, automatic hypertext conversion has stepped into application area. But for Chinese documents, there are still some issues have to be solved before we can ahead the road. The thesis proposes a method to convert existing Chinese news documents into hypertext by add HTML anchor tag on keywords. When users browse hypertext version news, they can query related document simply click on the keyword. The system do a premier statistical approach word identification by looking up a general term dictionary. Then using memorized and recursive algorithm to discover new terms. At last, calculating term weight by combining some filter rule to emphasis prop nouns.For increasing system performance, we propose an efficient way to approximately calculate term weight. It uses an auxiliary table to store term distribution information and decrease the overhead of repeatedly calculating it each time a new document arrives.The experiment result shows that the proposed method does improve system performance and maintain hyperlink quality at the same time. Thus we conclude that it can be adapted for an incremental document collection.
|
author2 |
Joung Yuh-Jzer |
author_facet |
Joung Yuh-Jzer Huang Chen-Chu 黃承渠 |
author |
Huang Chen-Chu 黃承渠 |
spellingShingle |
Huang Chen-Chu 黃承渠 Automatic Hyperlink Generation For Chinese Documents |
author_sort |
Huang Chen-Chu |
title |
Automatic Hyperlink Generation For Chinese Documents |
title_short |
Automatic Hyperlink Generation For Chinese Documents |
title_full |
Automatic Hyperlink Generation For Chinese Documents |
title_fullStr |
Automatic Hyperlink Generation For Chinese Documents |
title_full_unstemmed |
Automatic Hyperlink Generation For Chinese Documents |
title_sort |
automatic hyperlink generation for chinese documents |
publishDate |
1998 |
url |
http://ndltd.ncl.edu.tw/handle/01790402579067243244 |
work_keys_str_mv |
AT huangchenchu automatichyperlinkgenerationforchinesedocuments AT huángchéngqú automatichyperlinkgenerationforchinesedocuments AT huangchenchu zhōngwénwénjiànzìdòngjiànlìliànjié AT huángchéngqú zhōngwénwénjiànzìdòngjiànlìliànjié |
_version_ |
1717781434663436288 |