CPRQ: Cost Prediction for Range Queries in Moving Object Databases
Predicting query cost plays an important role in moving object databases. Accurate predictions help database administrators effectively schedule workloads and achieve optimal resource allocation strategies. There are some works focusing on query cost prediction, but most of them employ analytical me...
Main Authors: | , |
---|---|
Format: | Article |
Language: | English |
Published: |
MDPI AG
2021-07-01
|
Series: | ISPRS International Journal of Geo-Information |
Subjects: | |
Online Access: | https://www.mdpi.com/2220-9964/10/7/468 |
id |
doaj-54ce8691d4f84022aca2c0f6ff49cf8f |
---|---|
record_format |
Article |
spelling |
doaj-54ce8691d4f84022aca2c0f6ff49cf8f2021-07-23T13:45:01ZengMDPI AGISPRS International Journal of Geo-Information2220-99642021-07-011046846810.3390/ijgi10070468CPRQ: Cost Prediction for Range Queries in Moving Object DatabasesShengnan Guo0Jianqiu Xu1College of Computer Science and Technology, Nanjing University of Aeronautics and Astronautics, Nanjing 211106, ChinaCollege of Computer Science and Technology, Nanjing University of Aeronautics and Astronautics, Nanjing 211106, ChinaPredicting query cost plays an important role in moving object databases. Accurate predictions help database administrators effectively schedule workloads and achieve optimal resource allocation strategies. There are some works focusing on query cost prediction, but most of them employ analytical methods to obtain an index-based cost prediction model. The accuracy can be seriously challenged as the workload of the database management system becomes more and more complex. Differing from the previous work, this paper proposes a method called CPRQ (Cost Prediction of Range Query) which is based on machine-learning techniques. The proposed method contains four learning models: the polynomial regression model, the decision tree regression model, the random forest regression model, and the KNN (k-Nearest Neighbor) regression model. Using R-squared and MSE (Mean Squared Error) as measurements, we perform an extensive experimental evaluation. The results demonstrate that CPRQ achieves high accuracy and the random forest regression model obtains the best predictive performance (R-squared is 0.9695 and MSE is 0.154).https://www.mdpi.com/2220-9964/10/7/468cost predictionrange querymoving object databasemachine learning |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Shengnan Guo Jianqiu Xu |
spellingShingle |
Shengnan Guo Jianqiu Xu CPRQ: Cost Prediction for Range Queries in Moving Object Databases ISPRS International Journal of Geo-Information cost prediction range query moving object database machine learning |
author_facet |
Shengnan Guo Jianqiu Xu |
author_sort |
Shengnan Guo |
title |
CPRQ: Cost Prediction for Range Queries in Moving Object Databases |
title_short |
CPRQ: Cost Prediction for Range Queries in Moving Object Databases |
title_full |
CPRQ: Cost Prediction for Range Queries in Moving Object Databases |
title_fullStr |
CPRQ: Cost Prediction for Range Queries in Moving Object Databases |
title_full_unstemmed |
CPRQ: Cost Prediction for Range Queries in Moving Object Databases |
title_sort |
cprq: cost prediction for range queries in moving object databases |
publisher |
MDPI AG |
series |
ISPRS International Journal of Geo-Information |
issn |
2220-9964 |
publishDate |
2021-07-01 |
description |
Predicting query cost plays an important role in moving object databases. Accurate predictions help database administrators effectively schedule workloads and achieve optimal resource allocation strategies. There are some works focusing on query cost prediction, but most of them employ analytical methods to obtain an index-based cost prediction model. The accuracy can be seriously challenged as the workload of the database management system becomes more and more complex. Differing from the previous work, this paper proposes a method called CPRQ (Cost Prediction of Range Query) which is based on machine-learning techniques. The proposed method contains four learning models: the polynomial regression model, the decision tree regression model, the random forest regression model, and the KNN (k-Nearest Neighbor) regression model. Using R-squared and MSE (Mean Squared Error) as measurements, we perform an extensive experimental evaluation. The results demonstrate that CPRQ achieves high accuracy and the random forest regression model obtains the best predictive performance (R-squared is 0.9695 and MSE is 0.154). |
topic |
cost prediction range query moving object database machine learning |
url |
https://www.mdpi.com/2220-9964/10/7/468 |
work_keys_str_mv |
AT shengnanguo cprqcostpredictionforrangequeriesinmovingobjectdatabases AT jianqiuxu cprqcostpredictionforrangequeriesinmovingobjectdatabases |
_version_ |
1721288008501559296 |