Heterogeneous data fusion considering spatial correlations using graph convolutional networks and its application in air quality prediction

Models that predict the future state of certain observations are commonly developed by utilizing heterogeneous data. Most traditional prediction models tend to ignore inconsistencies and imperfections in heterogeneous data, and they are also limited in their ability to consider spatial correlations...

Full description

Bibliographic Details
Main Authors: Cuomo, S. (Author), Ma, Z. (Author), Mei, G. (Author), Piccialli, F. (Author)
Format: Article
Language:English
Published: King Saud bin Abdulaziz University 2022
Subjects:
Online Access:View Fulltext in Publisher
LEADER 02353nam a2200229Ia 4500
001 10.1016-j.jksuci.2022.04.003
008 220706s2022 CNT 000 0 und d
020 |a 13191578 (ISSN) 
245 1 0 |a Heterogeneous data fusion considering spatial correlations using graph convolutional networks and its application in air quality prediction 
260 0 |b King Saud bin Abdulaziz University  |c 2022 
856 |z View Fulltext in Publisher  |u https://doi.org/10.1016/j.jksuci.2022.04.003 
520 3 |a Models that predict the future state of certain observations are commonly developed by utilizing heterogeneous data. Most traditional prediction models tend to ignore inconsistencies and imperfections in heterogeneous data, and they are also limited in their ability to consider spatial correlations among monitoring points, as well as predict for the entire study area simultaneously. As a solution, this paper proposes a deep learning method to fuse heterogeneous data collected from multiple monitoring points by exploiting graph convolutional networks (GCNs), thus perform prediction for certain observations. The effectiveness of this approach was evaluated by applying it to a prediction scenario for air quality. As the fundamental idea behind the proposed method, (1) the collected heterogeneous data is fused according to the coordinates of the monitoring points considering its spatial correlations, and (2) the prediction considers global information as opposed to local information. According to an air quality prediction scenario, (1) the fused data obtained by the RBF-based fusion method are reasonable and reliable; (2) the fusion significantly increases predictive model performance; and (3) the STGCN model enhanced by the fusion achieves the highest performance. This approach can be similarly applied in scenarios involving continuous heterogeneous data collected from scattered multiple monitoring points in the study area. © 2022 The Author(s) 
650 0 4 |a Data fusion 
650 0 4 |a Deep learning 
650 0 4 |a Graph convolutional networks (GCN) 
650 0 4 |a Heterogeneous data sources 
650 0 4 |a Radial basis functions (RBF) 
700 1 |a Cuomo, S.  |e author 
700 1 |a Ma, Z.  |e author 
700 1 |a Mei, G.  |e author 
700 1 |a Piccialli, F.  |e author 
773 |t Journal of King Saud University - Computer and Information Sciences