Ontology Driven Feature Engineering for Opinion Mining

In the process of knowledge discovery, the reliability of results depends upon the effectiveness of attributes selected for decision. The curse of dimensionality refers to the phenomenon in which the excessive number of dimensions affect the analysis. In order to eradicate the curse of dimensionalit...

Full description

Bibliographic Details
Main Authors: Shafaq Siddiqui, M. Abdul Rehman, Sher Muhammad Doudpota, Ahmad Waqas
Format: Article
Language:English
Published: IEEE 2019-01-01
Series:IEEE Access
Subjects:
Online Access:https://ieeexplore.ieee.org/document/8721082/
Description
Summary:In the process of knowledge discovery, the reliability of results depends upon the effectiveness of attributes selected for decision. The curse of dimensionality refers to the phenomenon in which the excessive number of dimensions affect the analysis. In order to eradicate the curse of dimensionality in text analysis, we are proposing an ontology-based semantic measure for intelligent selection/reduction of features. Among the various text mining techniques, ontology-based mining has a significant contribution to the field. The ontology-based semantic measures, which are mathematical models used to find the similarity between various concepts in the ontology, have made a significant contribution to feature engineering. The proposed measure is an amalgamation of semantic similarity, relatedness, and distance. The measure allows performing an in-depth analysis of various semantic relationships between concepts of the English language. The performance of the measure was evaluated against benchmarked dimension reduction techniques such as PCA. The results show improvement by reducing the size of dimensions up to 35%. The results were further evaluated by training a classifier to validate that the features are not creating any underfit/overfit model.
ISSN:2169-3536