CPRQ: Cost Prediction for Range Queries in Moving Object Databases

Predicting query cost plays an important role in moving object databases. Accurate predictions help database administrators effectively schedule workloads and achieve optimal resource allocation strategies. There are some works focusing on query cost prediction, but most of them employ analytical me...

Full description

Bibliographic Details
Main Authors: Shengnan Guo, Jianqiu Xu
Format: Article
Language:English
Published: MDPI AG 2021-07-01
Series:ISPRS International Journal of Geo-Information
Subjects:
Online Access:https://www.mdpi.com/2220-9964/10/7/468
id doaj-54ce8691d4f84022aca2c0f6ff49cf8f
record_format Article
spelling doaj-54ce8691d4f84022aca2c0f6ff49cf8f2021-07-23T13:45:01ZengMDPI AGISPRS International Journal of Geo-Information2220-99642021-07-011046846810.3390/ijgi10070468CPRQ: Cost Prediction for Range Queries in Moving Object DatabasesShengnan Guo0Jianqiu Xu1College of Computer Science and Technology, Nanjing University of Aeronautics and Astronautics, Nanjing 211106, ChinaCollege of Computer Science and Technology, Nanjing University of Aeronautics and Astronautics, Nanjing 211106, ChinaPredicting query cost plays an important role in moving object databases. Accurate predictions help database administrators effectively schedule workloads and achieve optimal resource allocation strategies. There are some works focusing on query cost prediction, but most of them employ analytical methods to obtain an index-based cost prediction model. The accuracy can be seriously challenged as the workload of the database management system becomes more and more complex. Differing from the previous work, this paper proposes a method called CPRQ (Cost Prediction of Range Query) which is based on machine-learning techniques. The proposed method contains four learning models: the polynomial regression model, the decision tree regression model, the random forest regression model, and the KNN (k-Nearest Neighbor) regression model. Using R-squared and MSE (Mean Squared Error) as measurements, we perform an extensive experimental evaluation. The results demonstrate that CPRQ achieves high accuracy and the random forest regression model obtains the best predictive performance (R-squared is 0.9695 and MSE is 0.154).https://www.mdpi.com/2220-9964/10/7/468cost predictionrange querymoving object databasemachine learning
collection DOAJ
language English
format Article
sources DOAJ
author Shengnan Guo
Jianqiu Xu
spellingShingle Shengnan Guo
Jianqiu Xu
CPRQ: Cost Prediction for Range Queries in Moving Object Databases
ISPRS International Journal of Geo-Information
cost prediction
range query
moving object database
machine learning
author_facet Shengnan Guo
Jianqiu Xu
author_sort Shengnan Guo
title CPRQ: Cost Prediction for Range Queries in Moving Object Databases
title_short CPRQ: Cost Prediction for Range Queries in Moving Object Databases
title_full CPRQ: Cost Prediction for Range Queries in Moving Object Databases
title_fullStr CPRQ: Cost Prediction for Range Queries in Moving Object Databases
title_full_unstemmed CPRQ: Cost Prediction for Range Queries in Moving Object Databases
title_sort cprq: cost prediction for range queries in moving object databases
publisher MDPI AG
series ISPRS International Journal of Geo-Information
issn 2220-9964
publishDate 2021-07-01
description Predicting query cost plays an important role in moving object databases. Accurate predictions help database administrators effectively schedule workloads and achieve optimal resource allocation strategies. There are some works focusing on query cost prediction, but most of them employ analytical methods to obtain an index-based cost prediction model. The accuracy can be seriously challenged as the workload of the database management system becomes more and more complex. Differing from the previous work, this paper proposes a method called CPRQ (Cost Prediction of Range Query) which is based on machine-learning techniques. The proposed method contains four learning models: the polynomial regression model, the decision tree regression model, the random forest regression model, and the KNN (k-Nearest Neighbor) regression model. Using R-squared and MSE (Mean Squared Error) as measurements, we perform an extensive experimental evaluation. The results demonstrate that CPRQ achieves high accuracy and the random forest regression model obtains the best predictive performance (R-squared is 0.9695 and MSE is 0.154).
topic cost prediction
range query
moving object database
machine learning
url https://www.mdpi.com/2220-9964/10/7/468
work_keys_str_mv AT shengnanguo cprqcostpredictionforrangequeriesinmovingobjectdatabases
AT jianqiuxu cprqcostpredictionforrangequeriesinmovingobjectdatabases
_version_ 1721288008501559296