Ontology-Based Neural Network Electronic Document Categorization System
碩士 === 國立清華大學 === 工業工程與工程管理學系 === 93 === The development of modern computer science leads to information being generated speedily. More knowledge documents about patent are difficult to be classified consistently and promptly. In order to solve this problem, many specialists take document management...
Main Authors: | , |
---|---|
Other Authors: | |
Format: | Others |
Language: | en_US |
Published: |
2005
|
Online Access: | http://ndltd.ncl.edu.tw/handle/33371626264431583534 |
id |
ndltd-TW-093NTHU5031097 |
---|---|
record_format |
oai_dc |
spelling |
ndltd-TW-093NTHU50310972015-10-13T11:15:49Z http://ndltd.ncl.edu.tw/handle/33371626264431583534 Ontology-Based Neural Network Electronic Document Categorization System 以本體論為基之類神經網路電子文件自動分類管理系統 Chia-Hung Hsieh 謝佳宏 碩士 國立清華大學 工業工程與工程管理學系 93 The development of modern computer science leads to information being generated speedily. More knowledge documents about patent are difficult to be classified consistently and promptly. In order to solve this problem, many specialists take document management, especially for technical reports and patent documents, as a significant research issue combining the expertise of IT, IP, and domain experts. In traditional categorization for patent documents, domain experts classify documents based on their experiences after reviewing documental contents. The development of automatic document categorization becomes an important research because traditional methods often bring inconsistent classification results. In this thesis, a categorization method using artificial neural network (ANN) is developed to classify patent documents based on pre-constructed ontology. Firstly, this system extracts the features of a document by using morphological analysis and sentence analysis. Secondly, these features are matched with classes and relations of pre-constructed ontology, and transferred into the inputs of ANN using two weight-transferring functions proposed in this research. Thirdly, a well-trained ANN model is applied to calculate and infer the relationships between given document and categories. We take two domains, Chemical Mechanical Polishing (CMP) and business knowledge documents, as our case studies to demonstrate the proposed system at work. International Patent Classification (IPC) is constructed as the classification schema and hierarchy. Amy J.C. Trappey 張瑞芬 2005 學位論文 ; thesis 94 en_US |
collection |
NDLTD |
language |
en_US |
format |
Others
|
sources |
NDLTD |
description |
碩士 === 國立清華大學 === 工業工程與工程管理學系 === 93 === The development of modern computer science leads to information being generated speedily. More knowledge documents about patent are difficult to be classified consistently and promptly. In order to solve this problem, many specialists take document management, especially for technical reports and patent documents, as a significant research issue combining the expertise of IT, IP, and domain experts. In traditional categorization for patent documents, domain experts classify documents based on their experiences after reviewing documental contents. The development of automatic document categorization becomes an important research because traditional methods often bring inconsistent classification results. In this thesis, a categorization method using artificial neural network (ANN) is developed to classify patent documents based on pre-constructed ontology. Firstly, this system extracts the features of a document by using morphological analysis and sentence analysis. Secondly, these features are matched with classes and relations of pre-constructed ontology, and transferred into the inputs of ANN using two weight-transferring functions proposed in this research. Thirdly, a well-trained ANN model is applied to calculate and infer the relationships between given document and categories. We take two domains, Chemical Mechanical Polishing (CMP) and business knowledge documents, as our case studies to demonstrate the proposed system at work. International Patent Classification (IPC) is constructed as the classification schema and hierarchy.
|
author2 |
Amy J.C. Trappey |
author_facet |
Amy J.C. Trappey Chia-Hung Hsieh 謝佳宏 |
author |
Chia-Hung Hsieh 謝佳宏 |
spellingShingle |
Chia-Hung Hsieh 謝佳宏 Ontology-Based Neural Network Electronic Document Categorization System |
author_sort |
Chia-Hung Hsieh |
title |
Ontology-Based Neural Network Electronic Document Categorization System |
title_short |
Ontology-Based Neural Network Electronic Document Categorization System |
title_full |
Ontology-Based Neural Network Electronic Document Categorization System |
title_fullStr |
Ontology-Based Neural Network Electronic Document Categorization System |
title_full_unstemmed |
Ontology-Based Neural Network Electronic Document Categorization System |
title_sort |
ontology-based neural network electronic document categorization system |
publishDate |
2005 |
url |
http://ndltd.ncl.edu.tw/handle/33371626264431583534 |
work_keys_str_mv |
AT chiahunghsieh ontologybasedneuralnetworkelectronicdocumentcategorizationsystem AT xièjiāhóng ontologybasedneuralnetworkelectronicdocumentcategorizationsystem AT chiahunghsieh yǐběntǐlùnwèijīzhīlèishénjīngwǎnglùdiànziwénjiànzìdòngfēnlèiguǎnlǐxìtǒng AT xièjiāhóng yǐběntǐlùnwèijīzhīlèishénjīngwǎnglùdiànziwénjiànzìdòngfēnlèiguǎnlǐxìtǒng |
_version_ |
1716840536833261568 |