Tag Generalization For Facet-Based Search

In this project we address over-specification of tags, a common problem of modern tag-based document management systems. In such systems tags are essential for the document retrieval task. The accuracy of this process depends mainly on the “human factor” i.e. the quality of tags assigned by users. W...

Full description

Bibliographic Details
Main Author: Niewiarowski, Tomasz
Language:en_US
Published: 2013
Online Access:http://hdl.handle.net/10222/36235
id ndltd-LACETR-oai-collectionscanada.gc.ca-NSHD.ca#10222-36235
record_format oai_dc
spelling ndltd-LACETR-oai-collectionscanada.gc.ca-NSHD.ca#10222-362352013-10-04T04:13:31ZTag Generalization For Facet-Based SearchNiewiarowski, TomaszIn this project we address over-specification of tags, a common problem of modern tag-based document management systems. In such systems tags are essential for the document retrieval task. The accuracy of this process depends mainly on the “human factor” i.e. the quality of tags assigned by users. While tagging, users are likely to pick only very specific tags that describe the content of a resource, forgetting about general concepts that represent the resource. Our proposed method to deal with this problem is an automatic tag generalization algorithm which assigns general tags to newly tagged resources. The objective of the algorithm is to provide a layer of tags consisting of general concepts and to provide good support for a system user. The proposed method automatically tags resources with more general and similar tags to user-assigned tags. The method is unsupervised and domain independent. The proposed tag generalization method consists of three stages: (1) the disambiguation and concept mapping stage maps specific tags to Wikipedia articles representing the same concept; (2) link based tag generalization is meant to find similar and more general articles using the Wikipedia link structure; (3) the concept unification stage where the system assigns tags based on the list of general articles. Evaluation on four real-life tag data sets demonstrates that the proposed method is domain independent and outperforms supervised tag recommendation systems for practical training data set sizes.2013-08-23T18:18:38Z2013-08-23T18:18:38Z2013-08-232013-08-19http://hdl.handle.net/10222/36235en_US
collection NDLTD
language en_US
sources NDLTD
description In this project we address over-specification of tags, a common problem of modern tag-based document management systems. In such systems tags are essential for the document retrieval task. The accuracy of this process depends mainly on the “human factor” i.e. the quality of tags assigned by users. While tagging, users are likely to pick only very specific tags that describe the content of a resource, forgetting about general concepts that represent the resource. Our proposed method to deal with this problem is an automatic tag generalization algorithm which assigns general tags to newly tagged resources. The objective of the algorithm is to provide a layer of tags consisting of general concepts and to provide good support for a system user. The proposed method automatically tags resources with more general and similar tags to user-assigned tags. The method is unsupervised and domain independent. The proposed tag generalization method consists of three stages: (1) the disambiguation and concept mapping stage maps specific tags to Wikipedia articles representing the same concept; (2) link based tag generalization is meant to find similar and more general articles using the Wikipedia link structure; (3) the concept unification stage where the system assigns tags based on the list of general articles. Evaluation on four real-life tag data sets demonstrates that the proposed method is domain independent and outperforms supervised tag recommendation systems for practical training data set sizes.
author Niewiarowski, Tomasz
spellingShingle Niewiarowski, Tomasz
Tag Generalization For Facet-Based Search
author_facet Niewiarowski, Tomasz
author_sort Niewiarowski, Tomasz
title Tag Generalization For Facet-Based Search
title_short Tag Generalization For Facet-Based Search
title_full Tag Generalization For Facet-Based Search
title_fullStr Tag Generalization For Facet-Based Search
title_full_unstemmed Tag Generalization For Facet-Based Search
title_sort tag generalization for facet-based search
publishDate 2013
url http://hdl.handle.net/10222/36235
work_keys_str_mv AT niewiarowskitomasz taggeneralizationforfacetbasedsearch
_version_ 1716601583473524736