Predicting a Web Search Engine User''s Future Behavior using Query Log

碩士 === 國立臺灣大學 === 資訊工程學研究所 === 99 === In this study, we devise methods to predict the future search action (i.e., query or clicked URL) sequence of a search engine user. These predictions have the potential application of being integrated into a web search engine to facilitate a search engine user’s...

Full description

Bibliographic Details
Main Authors: Hsin-Yih Lin, 林欣毅
Other Authors: Hsin-Hsi Chen
Format: Others
Language:en_US
Published: 2011
Online Access:http://ndltd.ncl.edu.tw/handle/13452899274208826270
id ndltd-TW-099NTU05392014
record_format oai_dc
spelling ndltd-TW-099NTU053920142015-10-28T04:11:43Z http://ndltd.ncl.edu.tw/handle/13452899274208826270 Predicting a Web Search Engine User''s Future Behavior using Query Log 使用網路搜尋引擎歷史資料預測使用者未來行為 Hsin-Yih Lin 林欣毅 碩士 國立臺灣大學 資訊工程學研究所 99 In this study, we devise methods to predict the future search action (i.e., query or clicked URL) sequence of a search engine user. These predictions have the potential application of being integrated into a web search engine to facilitate a search engine user’s search process. The corpus used is the Microsoft query log dataset containing search sessions in 2006. The corpus is divided into a training dataset and a test dataset for our machine-learning methods. We propose four methods: WTAL, SRPF, SRPP and ACTF. WTAL is based on the concept of the co-occurrence relationship between past queries and clicked URLs. Both SRPF and SRPP incorporate information retrieval methodologies into their algorithms. ACTF is a graph-based method employing PageRank. We further merge several of our methods together. Experimental results show that WTAL has the best individual method performance. However, the combination of the methods together outperforms individual methods. Additional analysis reveals that using every query and URL that a user has already submitted or clicked on in a session does not necessarily produce a better prediction performance than using only the user’s most recent actions. It is also found that it is harder to predict future queries and clicked URLs for sessions with more queries than for sessions with fewer queries. Hsin-Hsi Chen 陳信希 2011 學位論文 ; thesis 67 en_US
collection NDLTD
language en_US
format Others
sources NDLTD
description 碩士 === 國立臺灣大學 === 資訊工程學研究所 === 99 === In this study, we devise methods to predict the future search action (i.e., query or clicked URL) sequence of a search engine user. These predictions have the potential application of being integrated into a web search engine to facilitate a search engine user’s search process. The corpus used is the Microsoft query log dataset containing search sessions in 2006. The corpus is divided into a training dataset and a test dataset for our machine-learning methods. We propose four methods: WTAL, SRPF, SRPP and ACTF. WTAL is based on the concept of the co-occurrence relationship between past queries and clicked URLs. Both SRPF and SRPP incorporate information retrieval methodologies into their algorithms. ACTF is a graph-based method employing PageRank. We further merge several of our methods together. Experimental results show that WTAL has the best individual method performance. However, the combination of the methods together outperforms individual methods. Additional analysis reveals that using every query and URL that a user has already submitted or clicked on in a session does not necessarily produce a better prediction performance than using only the user’s most recent actions. It is also found that it is harder to predict future queries and clicked URLs for sessions with more queries than for sessions with fewer queries.
author2 Hsin-Hsi Chen
author_facet Hsin-Hsi Chen
Hsin-Yih Lin
林欣毅
author Hsin-Yih Lin
林欣毅
spellingShingle Hsin-Yih Lin
林欣毅
Predicting a Web Search Engine User''s Future Behavior using Query Log
author_sort Hsin-Yih Lin
title Predicting a Web Search Engine User''s Future Behavior using Query Log
title_short Predicting a Web Search Engine User''s Future Behavior using Query Log
title_full Predicting a Web Search Engine User''s Future Behavior using Query Log
title_fullStr Predicting a Web Search Engine User''s Future Behavior using Query Log
title_full_unstemmed Predicting a Web Search Engine User''s Future Behavior using Query Log
title_sort predicting a web search engine user''s future behavior using query log
publishDate 2011
url http://ndltd.ncl.edu.tw/handle/13452899274208826270
work_keys_str_mv AT hsinyihlin predictingawebsearchengineuseraposapossfuturebehaviorusingquerylog
AT línxīnyì predictingawebsearchengineuseraposapossfuturebehaviorusingquerylog
AT hsinyihlin shǐyòngwǎnglùsōuxúnyǐnqínglìshǐzīliàoyùcèshǐyòngzhěwèiláixíngwèi
AT línxīnyì shǐyòngwǎnglùsōuxúnyǐnqínglìshǐzīliàoyùcèshǐyòngzhěwèiláixíngwèi
_version_ 1718114143469305856