An Efficient Top-k Query Scheme Based on Multilayer Grouping

The top-k query is to find the k data that has the highest scores from a candidate dataset. Sorting is a common method to find out top-k results. However, most of existing methods are not efficient enough. To remove this issue, we propose an efficient top-k query scheme based on multilayer grouping....

Full description

Bibliographic Details
Main Authors: Zongmin Cui, Yu Gao, Caixue Zhou, Guangyong Gao, Zhuolin Mei, Zongda Wu
Format: Article
Language:English
Published: Faculty of Mechanical Engineering in Slavonski Brod, Faculty of Electrical Engineering in Osijek, Faculty of Civil Engineering in Osijek 2019-01-01
Series:Tehnički Vjesnik
Subjects:
Online Access:https://hrcak.srce.hr/file/329359
id doaj-581ccf9caf5444d89c9b484880ed0f58
record_format Article
spelling doaj-581ccf9caf5444d89c9b484880ed0f582020-11-25T01:05:58ZengFaculty of Mechanical Engineering in Slavonski Brod, Faculty of Electrical Engineering in Osijek, Faculty of Civil Engineering in Osijek Tehnički Vjesnik1330-36511848-63392019-01-0126513391345An Efficient Top-k Query Scheme Based on Multilayer GroupingZongmin Cui0Yu Gao1Caixue Zhou2Guangyong Gao3Zhuolin Mei4Zongda Wu5School of Information Science and Technology, Jiujiang University, No. 551, Qianjin East Road, Jiujiang, Jiangxi 332005, ChinaSchool of Information Science and Technology, Jiujiang University, No. 551, Qianjin East Road, Jiujiang, Jiangxi 332005, ChinaSchool of Information Science and Technology, Jiujiang University, No. 551, Qianjin East Road, Jiujiang, Jiangxi 332005, ChinaSchool of Computer and Software, Nanjing University of Information Science and Technology, No. 219, Ningliu Road, Nanjing, Jiangsu 210000, China / School of Information Science and Technology, Jiujiang University, No. 551, Qianjin East Road, Jiujiang, JiaSchool of Information Science and Technology, Jiujiang University, No. 551, Qianjin East Road, Jiujiang, Jiangxi 332005, ChinaOujiang College, Wenzhou University, Wenzhou 325035, Zhejiang, ChinaThe top-k query is to find the k data that has the highest scores from a candidate dataset. Sorting is a common method to find out top-k results. However, most of existing methods are not efficient enough. To remove this issue, we propose an efficient top-k query scheme based on multilayer grouping. First, we find the reference item by computing the average score of the candidate dataset. Second, we group the candidate dataset into three datasets: winner set, middle set and loser set based on the reference item. Third, we further group the winner set to the second-layer three datasets according to k value. And so on, until the data number of winner set is close to k value. Meanwhile, if k value is larger than the data number of winner set, we directly return the winner set to the user as a part of top-k results almost without sorting. In this case, we also return the top results with the highest scores from the middle set almost without sorting. Based on above innovations, we almost minimize the sorting. Experimental results show that our scheme significantly outperforms the current classical method on the performance of memory consumption and top-k query.https://hrcak.srce.hr/file/329359almost minimizing sortingmultilayer groupingtop-k query
collection DOAJ
language English
format Article
sources DOAJ
author Zongmin Cui
Yu Gao
Caixue Zhou
Guangyong Gao
Zhuolin Mei
Zongda Wu
spellingShingle Zongmin Cui
Yu Gao
Caixue Zhou
Guangyong Gao
Zhuolin Mei
Zongda Wu
An Efficient Top-k Query Scheme Based on Multilayer Grouping
Tehnički Vjesnik
almost minimizing sorting
multilayer grouping
top-k query
author_facet Zongmin Cui
Yu Gao
Caixue Zhou
Guangyong Gao
Zhuolin Mei
Zongda Wu
author_sort Zongmin Cui
title An Efficient Top-k Query Scheme Based on Multilayer Grouping
title_short An Efficient Top-k Query Scheme Based on Multilayer Grouping
title_full An Efficient Top-k Query Scheme Based on Multilayer Grouping
title_fullStr An Efficient Top-k Query Scheme Based on Multilayer Grouping
title_full_unstemmed An Efficient Top-k Query Scheme Based on Multilayer Grouping
title_sort efficient top-k query scheme based on multilayer grouping
publisher Faculty of Mechanical Engineering in Slavonski Brod, Faculty of Electrical Engineering in Osijek, Faculty of Civil Engineering in Osijek
series Tehnički Vjesnik
issn 1330-3651
1848-6339
publishDate 2019-01-01
description The top-k query is to find the k data that has the highest scores from a candidate dataset. Sorting is a common method to find out top-k results. However, most of existing methods are not efficient enough. To remove this issue, we propose an efficient top-k query scheme based on multilayer grouping. First, we find the reference item by computing the average score of the candidate dataset. Second, we group the candidate dataset into three datasets: winner set, middle set and loser set based on the reference item. Third, we further group the winner set to the second-layer three datasets according to k value. And so on, until the data number of winner set is close to k value. Meanwhile, if k value is larger than the data number of winner set, we directly return the winner set to the user as a part of top-k results almost without sorting. In this case, we also return the top results with the highest scores from the middle set almost without sorting. Based on above innovations, we almost minimize the sorting. Experimental results show that our scheme significantly outperforms the current classical method on the performance of memory consumption and top-k query.
topic almost minimizing sorting
multilayer grouping
top-k query
url https://hrcak.srce.hr/file/329359
work_keys_str_mv AT zongmincui anefficienttopkqueryschemebasedonmultilayergrouping
AT yugao anefficienttopkqueryschemebasedonmultilayergrouping
AT caixuezhou anefficienttopkqueryschemebasedonmultilayergrouping
AT guangyonggao anefficienttopkqueryschemebasedonmultilayergrouping
AT zhuolinmei anefficienttopkqueryschemebasedonmultilayergrouping
AT zongdawu anefficienttopkqueryschemebasedonmultilayergrouping
AT zongmincui efficienttopkqueryschemebasedonmultilayergrouping
AT yugao efficienttopkqueryschemebasedonmultilayergrouping
AT caixuezhou efficienttopkqueryschemebasedonmultilayergrouping
AT guangyonggao efficienttopkqueryschemebasedonmultilayergrouping
AT zhuolinmei efficienttopkqueryschemebasedonmultilayergrouping
AT zongdawu efficienttopkqueryschemebasedonmultilayergrouping
_version_ 1725192099865821184