Distributed Processing of Location-Based Aggregate Queries Using MapReduce

The <i>location-based aggregate queries</i>, consisting of the <i>shortest average distance query</i> (<i>SAvgDQ</i>), the <i>shortest minimal distance query</i> (<i>SMinDQ</i>), the <i>shortest maximal distance query</i> (<i...

Full description

Bibliographic Details
Main Author: Yuan-Ko Huang
Format: Article
Language:English
Published: MDPI AG 2019-08-01
Series:ISPRS International Journal of Geo-Information
Subjects:
Online Access:https://www.mdpi.com/2220-9964/8/9/370
id doaj-307973235865470a821be870b8927866
record_format Article
spelling doaj-307973235865470a821be870b89278662020-11-24T21:50:01ZengMDPI AGISPRS International Journal of Geo-Information2220-99642019-08-018937010.3390/ijgi8090370ijgi8090370Distributed Processing of Location-Based Aggregate Queries Using MapReduceYuan-Ko Huang0Department of Maritime Information and Technology, National Kaohsiung University of Science and Technology, 80543 Kaohsiung City, TaiwanThe <i>location-based aggregate queries</i>, consisting of the <i>shortest average distance query</i> (<i>SAvgDQ</i>), the <i>shortest minimal distance query</i> (<i>SMinDQ</i>), the <i>shortest maximal distance query</i> (<i>SMaxDQ</i>), and the <i>shortest sum distance query</i> (<i>SSumDQ</i>) are new types of location-based queries. Such queries can be used to provide the user with useful object information by considering both the spatial closeness of objects to the query object and the neighboring relationship between objects. Due to a large amount of location-based aggregate queries that need to be evaluated concurrently, the centralized processing system would suffer a heavy query load, leading eventually to poor performance. As a result, in this paper, we focus on developing the distributed processing technique to answer multiple location-based aggregate queries, based on the <i>MapReduce</i> platform. We first design a grid structure to manage information of objects by taking into account the storage balance, and then develop a distributed processing algorithm, namely the <i>MapReduce-based aggregate query algorithm</i> (<i>MRAggQ algorithm</i>), to efficiently process the location-based aggregate queries in a distributed manner. Extensive experiments using synthetic and real datasets are conducted to demonstrate the scalability and the efficiency of the proposed processing algorithm.https://www.mdpi.com/2220-9964/8/9/370location-based aggregate queriesdistributed processing techniqueMapReducegrid structureMapReduce-based aggregate query algorithm
collection DOAJ
language English
format Article
sources DOAJ
author Yuan-Ko Huang
spellingShingle Yuan-Ko Huang
Distributed Processing of Location-Based Aggregate Queries Using MapReduce
ISPRS International Journal of Geo-Information
location-based aggregate queries
distributed processing technique
MapReduce
grid structure
MapReduce-based aggregate query algorithm
author_facet Yuan-Ko Huang
author_sort Yuan-Ko Huang
title Distributed Processing of Location-Based Aggregate Queries Using MapReduce
title_short Distributed Processing of Location-Based Aggregate Queries Using MapReduce
title_full Distributed Processing of Location-Based Aggregate Queries Using MapReduce
title_fullStr Distributed Processing of Location-Based Aggregate Queries Using MapReduce
title_full_unstemmed Distributed Processing of Location-Based Aggregate Queries Using MapReduce
title_sort distributed processing of location-based aggregate queries using mapreduce
publisher MDPI AG
series ISPRS International Journal of Geo-Information
issn 2220-9964
publishDate 2019-08-01
description The <i>location-based aggregate queries</i>, consisting of the <i>shortest average distance query</i> (<i>SAvgDQ</i>), the <i>shortest minimal distance query</i> (<i>SMinDQ</i>), the <i>shortest maximal distance query</i> (<i>SMaxDQ</i>), and the <i>shortest sum distance query</i> (<i>SSumDQ</i>) are new types of location-based queries. Such queries can be used to provide the user with useful object information by considering both the spatial closeness of objects to the query object and the neighboring relationship between objects. Due to a large amount of location-based aggregate queries that need to be evaluated concurrently, the centralized processing system would suffer a heavy query load, leading eventually to poor performance. As a result, in this paper, we focus on developing the distributed processing technique to answer multiple location-based aggregate queries, based on the <i>MapReduce</i> platform. We first design a grid structure to manage information of objects by taking into account the storage balance, and then develop a distributed processing algorithm, namely the <i>MapReduce-based aggregate query algorithm</i> (<i>MRAggQ algorithm</i>), to efficiently process the location-based aggregate queries in a distributed manner. Extensive experiments using synthetic and real datasets are conducted to demonstrate the scalability and the efficiency of the proposed processing algorithm.
topic location-based aggregate queries
distributed processing technique
MapReduce
grid structure
MapReduce-based aggregate query algorithm
url https://www.mdpi.com/2220-9964/8/9/370
work_keys_str_mv AT yuankohuang distributedprocessingoflocationbasedaggregatequeriesusingmapreduce
_version_ 1725885807693135872