Chinese Document Classification-A Case Study of an IC Equipment Manufacturer
碩士 === 元智大學 === 資訊管理學系 === 95 === The IC equipment manufacturing industry, along with the development of Taiwan''s IC manufacturing industry, has gone through the low-end products to the current high-end, high-precision product stages. During the past 40 years, the IC equipment makers have...
Main Authors: | , |
---|---|
Other Authors: | |
Format: | Others |
Language: | zh-TW |
Published: |
2007
|
Online Access: | http://ndltd.ncl.edu.tw/handle/43839742444721194393 |
id |
ndltd-TW-095YZU05396073 |
---|---|
record_format |
oai_dc |
spelling |
ndltd-TW-095YZU053960732016-05-23T04:17:53Z http://ndltd.ncl.edu.tw/handle/43839742444721194393 Chinese Document Classification-A Case Study of an IC Equipment Manufacturer 中文文件分類研究-以IC設備業為例 Hung-Mou Lin 林宏謀 碩士 元智大學 資訊管理學系 95 The IC equipment manufacturing industry, along with the development of Taiwan''s IC manufacturing industry, has gone through the low-end products to the current high-end, high-precision product stages. During the past 40 years, the IC equipment makers have accumulated a lot of documents which are not well classified and therefore are not easy to do a search. Until recently, the e-business trend has pushed IC equipment makers to digitalize and manually classified these valuable documents. The manual classification process is slow and tedious. Thus this study proposes a vector space model based method to automatically classify enterprise documents. The proposed method combines several weight factors including term frequency, term''s uniformity and document special features to boost classification performance. The experimental results showed that using vector space model (VSM) alone can reach 68.93% of accuracy. Then with additional term''s uniformity to adjust term''s class weight, the accuracy enhances to 76.42%. Finally, with the addition of document unique features, the accuracy promotes to 86.62%. The experimental results confirmed that the combination of several weight factors leads to the improvement of classification performance. 陸承志 2007 學位論文 ; thesis 51 zh-TW |
collection |
NDLTD |
language |
zh-TW |
format |
Others
|
sources |
NDLTD |
description |
碩士 === 元智大學 === 資訊管理學系 === 95 === The IC equipment manufacturing industry, along with the development of Taiwan''s IC manufacturing industry, has gone through the low-end products to the current high-end, high-precision product stages. During the past 40 years, the IC equipment makers have accumulated a lot of documents which are not well classified and therefore are not easy to do a search. Until recently, the e-business trend has pushed IC equipment makers to digitalize and manually classified these valuable documents.
The manual classification process is slow and tedious. Thus this study proposes a vector space model based method to automatically classify enterprise documents. The proposed method combines several weight factors including term frequency, term''s uniformity and document special features to boost classification performance.
The experimental results showed that using vector space model (VSM) alone can reach 68.93% of accuracy. Then with additional term''s uniformity to adjust term''s class weight, the accuracy enhances to 76.42%. Finally, with the addition of document unique features, the accuracy promotes to 86.62%. The experimental results confirmed that the combination of several weight factors leads to the improvement of classification performance.
|
author2 |
陸承志 |
author_facet |
陸承志 Hung-Mou Lin 林宏謀 |
author |
Hung-Mou Lin 林宏謀 |
spellingShingle |
Hung-Mou Lin 林宏謀 Chinese Document Classification-A Case Study of an IC Equipment Manufacturer |
author_sort |
Hung-Mou Lin |
title |
Chinese Document Classification-A Case Study of an IC Equipment Manufacturer |
title_short |
Chinese Document Classification-A Case Study of an IC Equipment Manufacturer |
title_full |
Chinese Document Classification-A Case Study of an IC Equipment Manufacturer |
title_fullStr |
Chinese Document Classification-A Case Study of an IC Equipment Manufacturer |
title_full_unstemmed |
Chinese Document Classification-A Case Study of an IC Equipment Manufacturer |
title_sort |
chinese document classification-a case study of an ic equipment manufacturer |
publishDate |
2007 |
url |
http://ndltd.ncl.edu.tw/handle/43839742444721194393 |
work_keys_str_mv |
AT hungmoulin chinesedocumentclassificationacasestudyofanicequipmentmanufacturer AT línhóngmóu chinesedocumentclassificationacasestudyofanicequipmentmanufacturer AT hungmoulin zhōngwénwénjiànfēnlèiyánjiūyǐicshèbèiyèwèilì AT línhóngmóu zhōngwénwénjiànfēnlèiyánjiūyǐicshèbèiyèwèilì |
_version_ |
1718278857190014976 |