A probabilistic approach to diversified query recommendation

The effectiveness of keyword-based search engines depends largely on the ability of a user to formulate proper queries that are both expressive and selective. However, web search queries issued by casual users are often short and with limited expressiveness. Query recommendation is a popular techni...

Full description

Bibliographic Details
Main Authors: Li, Ruirui., 李锐瑞.
Other Authors: Kao, CM
Language:English
Published: The University of Hong Kong (Pokfulam, Hong Kong) 2013
Subjects:
Online Access:http://hdl.handle.net/10722/181541
id ndltd-HKU-oai-hub.hku.hk-10722-181541
record_format oai_dc
spelling ndltd-HKU-oai-hub.hku.hk-10722-1815412015-07-29T04:02:04Z A probabilistic approach to diversified query recommendation Li, Ruirui. 李锐瑞. Kao, CM Querying (Computer science) Probabilities. The effectiveness of keyword-based search engines depends largely on the ability of a user to formulate proper queries that are both expressive and selective. However, web search queries issued by casual users are often short and with limited expressiveness. Query recommendation is a popular technique employed by search engines to help users refine their queries. Traditional similarity-based methods, however, often result in redundant and monotonic recommendations. We identify five basic requirements of a query recommendation system, namely relevancy, redundancy-free, diversity, ranking and efficiency. In particular, we focus on the requirements of redundancy-free and diversified recommendations. We propose the DQR framework, which mines a search log to achieve two goals: (1) It clusters search log queries to extract query concepts, based on which recommended queries are selected. Through query construction from the query concepts, we are able to avoid recommendation redundancy. (2) It employs a probabilistic model and a greedy heuristic algorithm to achieve recommendation diversification. Through a comprehensive user study we compare DQR against five other recommendation methods on real search log datasets. Our experiment shows that DQR outperforms the other methods in terms of relevancy, diversity, and ranking performance of the recommendations. At the same time, DQR also achieves high efficiency performance. published_or_final_version Computer Science Master Master of Philosophy 2013-03-03T03:21:12Z 2013-03-03T03:21:12Z 2013 2012 PG_Thesis 10.5353/th_b4979975 b4979975 http://hdl.handle.net/10722/181541 eng HKU Theses Online (HKUTO) The author retains all proprietary rights, (such as patent rights) and the right to use in future works. Creative Commons: Attribution 3.0 Hong Kong License The University of Hong Kong (Pokfulam, Hong Kong) http://hub.hku.hk/bib/B49799757
collection NDLTD
language English
sources NDLTD
topic Querying (Computer science)
Probabilities.
spellingShingle Querying (Computer science)
Probabilities.
Li, Ruirui.
李锐瑞.
A probabilistic approach to diversified query recommendation
description The effectiveness of keyword-based search engines depends largely on the ability of a user to formulate proper queries that are both expressive and selective. However, web search queries issued by casual users are often short and with limited expressiveness. Query recommendation is a popular technique employed by search engines to help users refine their queries. Traditional similarity-based methods, however, often result in redundant and monotonic recommendations. We identify five basic requirements of a query recommendation system, namely relevancy, redundancy-free, diversity, ranking and efficiency. In particular, we focus on the requirements of redundancy-free and diversified recommendations. We propose the DQR framework, which mines a search log to achieve two goals: (1) It clusters search log queries to extract query concepts, based on which recommended queries are selected. Through query construction from the query concepts, we are able to avoid recommendation redundancy. (2) It employs a probabilistic model and a greedy heuristic algorithm to achieve recommendation diversification. Through a comprehensive user study we compare DQR against five other recommendation methods on real search log datasets. Our experiment shows that DQR outperforms the other methods in terms of relevancy, diversity, and ranking performance of the recommendations. At the same time, DQR also achieves high efficiency performance. === published_or_final_version === Computer Science === Master === Master of Philosophy
author2 Kao, CM
author_facet Kao, CM
Li, Ruirui.
李锐瑞.
author Li, Ruirui.
李锐瑞.
author_sort Li, Ruirui.
title A probabilistic approach to diversified query recommendation
title_short A probabilistic approach to diversified query recommendation
title_full A probabilistic approach to diversified query recommendation
title_fullStr A probabilistic approach to diversified query recommendation
title_full_unstemmed A probabilistic approach to diversified query recommendation
title_sort probabilistic approach to diversified query recommendation
publisher The University of Hong Kong (Pokfulam, Hong Kong)
publishDate 2013
url http://hdl.handle.net/10722/181541
work_keys_str_mv AT liruirui aprobabilisticapproachtodiversifiedqueryrecommendation
AT lǐruìruì aprobabilisticapproachtodiversifiedqueryrecommendation
AT liruirui probabilisticapproachtodiversifiedqueryrecommendation
AT lǐruìruì probabilisticapproachtodiversifiedqueryrecommendation
_version_ 1716813683034685440