Word Sense Disambiguation using Aggregated Similarity based on WordNet Graph Representation
The term of word sense disambiguation, WSD, is introduced in the context of text document processing. A knowledge based approach is conducted using WordNet lexical ontology, describing its structure and components used for the process of identification of context related senses of each polysemy word...
Main Author: | |
---|---|
Format: | Article |
Language: | English |
Published: |
Inforec Association
2013-01-01
|
Series: | Informatică economică |
Subjects: | |
Online Access: | http://revistaie.ase.ro/content/67/15%20-%20Zurini.pdf |
Summary: | The term of word sense disambiguation, WSD, is introduced in the context of text document processing. A knowledge based approach is conducted using WordNet lexical ontology, describing its structure and components used for the process of identification of context related senses of each polysemy words. The principal distance measures using the graph associated to WordNet are presented, analyzing their advantages and disadvantages. A general model for aggregation of distances and probabilities is proposed and implemented in an application in order to detect the context senses of each word. For the non-existing words from WordNet, a similarity measure is used based on probabilities of co-occurrences. The module of WSD is proposed for integration in the step of processing documents such as supervised and unsupervised classification in order to maximize the correctness of the classification. Future work is related to the implementation of different domain oriented ontologies. |
---|---|
ISSN: | 1453-1305 1842-8088 |