CLASSIFICATION ACCURACY ASSESSMENT FOR REGIONAL VECTOR DATA PRODUCT BASED ON SPATIAL SAMPLING: A CASE STUDY OF JAPAN

Spatial vector data is a kind of data that represents real spatial information through points, lines and polygons. Spatial data quality is one of the basic theoretical research in geographic information science. Accurate and reliable data quality assessment is very important for its theoretical sign...

Full description

Bibliographic Details
Published in:The International Archives of the Photogrammetry, Remote Sensing and Spatial Information Sciences
Main Authors: Y. Lu, J. Zhang, X. Tong, W. Han, H. Zhao
Format: Article
Language:English
Published: Copernicus Publications 2019-06-01
Online Access:https://www.int-arch-photogramm-remote-sens-spatial-inf-sci.net/XLII-2-W13/1243/2019/isprs-archives-XLII-2-W13-1243-2019.pdf
_version_ 1857021312499712000
author Y. Lu
Y. Lu
J. Zhang
X. Tong
W. Han
H. Zhao
author_facet Y. Lu
Y. Lu
J. Zhang
X. Tong
W. Han
H. Zhao
author_sort Y. Lu
collection DOAJ
container_title The International Archives of the Photogrammetry, Remote Sensing and Spatial Information Sciences
description Spatial vector data is a kind of data that represents real spatial information through points, lines and polygons. Spatial data quality is one of the basic theoretical research in geographic information science. Accurate and reliable data quality assessment is very important for its theoretical significance and practical value. This paper proposes an improved method for the traditional classification accuracy evaluation of spatial vector data: (1) Quantitative estimation of sample size. According to the statistical principle of probability theory, the overall quantity is estimated by controlling the sampling error and the acceptance quality level. The sample quality is the unbiased estimate of the overall quality. (2) Stratification strategy: the overall objects are divided into three layers according to the three basic geometric structures -- points, lines and polygons. The difference within the layer is small and the difference between layers is large, which conforms to the basic principle of stratification. Then, the proportion of the total number of elements in each layer is taken as the weight to distribute layer by layer, and the sample size of each layer is obtained. (3) Allocation of samples. The spatial property of spatial sampling is mainly reflected in the allocation of samples. Considering the spatial correlation of elements in same layer, Local Moran's I index was used to calculate the correlation degree of a certain attribute between each spatial element and its neighbouring elements. After cluster analysis of elements in each layer, samples were screened by setting a reasonable threshold value. (4) Sample inspection. Each sample was examined against reference information, including images and data. The classification of each sample is judged by the principle of majority judgment. (5) Classification accuracy assessment. The classification accuracy information of samples was obtained by making the confusion matrix of the classification result of samples and the real results. The classification accuracy of experimental data is evaluated according to Kappa index. A case study of Global Core Vector Data of Japan shows the improved method in this paper and process of classification accuracy assessment for regional spatial vector data product. Global Core Vector Data are organized according to the country or region, including three categories of transportation, river system, place names, which are divided into 8 middle categories and 52 small categories. In this paper, 1405 samples of Global Core Vector Data in the experimental area of Japan are selected by spatial stratified sampling in 3 strata. The experimental results show that the proposed improved method is applicable to classification accuracy assessment of regional spatial vector data product and overcomes the disadvantages of type-based spatial stratified sampling that relies on the classification information of all elements. The Kappa coefficient is 0.831, which reflects the result of classification accuracy assessment in the experimental area is good. The proposed improved method provides a reference for the method of classification accuracy assessment classification of following global spatial vector data product.
format Article
id doaj-art-0b85fa4fca824f93a45fbfe0bb452f9d
institution Directory of Open Access Journals
issn 1682-1750
2194-9034
language English
publishDate 2019-06-01
publisher Copernicus Publications
record_format Article
spelling doaj-art-0b85fa4fca824f93a45fbfe0bb452f9d2025-08-19T19:43:27ZengCopernicus PublicationsThe International Archives of the Photogrammetry, Remote Sensing and Spatial Information Sciences1682-17502194-90342019-06-01XLII-2-W131243124710.5194/isprs-archives-XLII-2-W13-1243-2019CLASSIFICATION ACCURACY ASSESSMENT FOR REGIONAL VECTOR DATA PRODUCT BASED ON SPATIAL SAMPLING: A CASE STUDY OF JAPANY. Lu0Y. Lu1J. Zhang2X. Tong3W. Han4H. Zhao5School of Surveying and Geo-Informatics, Tongji University, Shanghai, 200092, ChinaNational Quality Inspection and Testing Center For Surveying and Mapping Products, Beijing, 100830, ChinaNational Quality Inspection and Testing Center For Surveying and Mapping Products, Beijing, 100830, ChinaSchool of Surveying and Geo-Informatics, Tongji University, Shanghai, 200092, ChinaNational Quality Inspection and Testing Center For Surveying and Mapping Products, Beijing, 100830, ChinaNational Quality Inspection and Testing Center For Surveying and Mapping Products, Beijing, 100830, ChinaSpatial vector data is a kind of data that represents real spatial information through points, lines and polygons. Spatial data quality is one of the basic theoretical research in geographic information science. Accurate and reliable data quality assessment is very important for its theoretical significance and practical value. This paper proposes an improved method for the traditional classification accuracy evaluation of spatial vector data: (1) Quantitative estimation of sample size. According to the statistical principle of probability theory, the overall quantity is estimated by controlling the sampling error and the acceptance quality level. The sample quality is the unbiased estimate of the overall quality. (2) Stratification strategy: the overall objects are divided into three layers according to the three basic geometric structures -- points, lines and polygons. The difference within the layer is small and the difference between layers is large, which conforms to the basic principle of stratification. Then, the proportion of the total number of elements in each layer is taken as the weight to distribute layer by layer, and the sample size of each layer is obtained. (3) Allocation of samples. The spatial property of spatial sampling is mainly reflected in the allocation of samples. Considering the spatial correlation of elements in same layer, Local Moran's I index was used to calculate the correlation degree of a certain attribute between each spatial element and its neighbouring elements. After cluster analysis of elements in each layer, samples were screened by setting a reasonable threshold value. (4) Sample inspection. Each sample was examined against reference information, including images and data. The classification of each sample is judged by the principle of majority judgment. (5) Classification accuracy assessment. The classification accuracy information of samples was obtained by making the confusion matrix of the classification result of samples and the real results. The classification accuracy of experimental data is evaluated according to Kappa index. A case study of Global Core Vector Data of Japan shows the improved method in this paper and process of classification accuracy assessment for regional spatial vector data product. Global Core Vector Data are organized according to the country or region, including three categories of transportation, river system, place names, which are divided into 8 middle categories and 52 small categories. In this paper, 1405 samples of Global Core Vector Data in the experimental area of Japan are selected by spatial stratified sampling in 3 strata. The experimental results show that the proposed improved method is applicable to classification accuracy assessment of regional spatial vector data product and overcomes the disadvantages of type-based spatial stratified sampling that relies on the classification information of all elements. The Kappa coefficient is 0.831, which reflects the result of classification accuracy assessment in the experimental area is good. The proposed improved method provides a reference for the method of classification accuracy assessment classification of following global spatial vector data product.https://www.int-arch-photogramm-remote-sens-spatial-inf-sci.net/XLII-2-W13/1243/2019/isprs-archives-XLII-2-W13-1243-2019.pdf
spellingShingle Y. Lu
Y. Lu
J. Zhang
X. Tong
W. Han
H. Zhao
CLASSIFICATION ACCURACY ASSESSMENT FOR REGIONAL VECTOR DATA PRODUCT BASED ON SPATIAL SAMPLING: A CASE STUDY OF JAPAN
title CLASSIFICATION ACCURACY ASSESSMENT FOR REGIONAL VECTOR DATA PRODUCT BASED ON SPATIAL SAMPLING: A CASE STUDY OF JAPAN
title_full CLASSIFICATION ACCURACY ASSESSMENT FOR REGIONAL VECTOR DATA PRODUCT BASED ON SPATIAL SAMPLING: A CASE STUDY OF JAPAN
title_fullStr CLASSIFICATION ACCURACY ASSESSMENT FOR REGIONAL VECTOR DATA PRODUCT BASED ON SPATIAL SAMPLING: A CASE STUDY OF JAPAN
title_full_unstemmed CLASSIFICATION ACCURACY ASSESSMENT FOR REGIONAL VECTOR DATA PRODUCT BASED ON SPATIAL SAMPLING: A CASE STUDY OF JAPAN
title_short CLASSIFICATION ACCURACY ASSESSMENT FOR REGIONAL VECTOR DATA PRODUCT BASED ON SPATIAL SAMPLING: A CASE STUDY OF JAPAN
title_sort classification accuracy assessment for regional vector data product based on spatial sampling a case study of japan
url https://www.int-arch-photogramm-remote-sens-spatial-inf-sci.net/XLII-2-W13/1243/2019/isprs-archives-XLII-2-W13-1243-2019.pdf
work_keys_str_mv AT ylu classificationaccuracyassessmentforregionalvectordataproductbasedonspatialsamplingacasestudyofjapan
AT ylu classificationaccuracyassessmentforregionalvectordataproductbasedonspatialsamplingacasestudyofjapan
AT jzhang classificationaccuracyassessmentforregionalvectordataproductbasedonspatialsamplingacasestudyofjapan
AT xtong classificationaccuracyassessmentforregionalvectordataproductbasedonspatialsamplingacasestudyofjapan
AT whan classificationaccuracyassessmentforregionalvectordataproductbasedonspatialsamplingacasestudyofjapan
AT hzhao classificationaccuracyassessmentforregionalvectordataproductbasedonspatialsamplingacasestudyofjapan