Selection of the Maximum Spatial Cluster Size of the Spatial Scan Statistic by Using the Maximum Clustering Set-Proportion Statistic.

Spatial scan statistics are widely used in various fields. The performance of these statistics is influenced by parameters, such as maximum spatial cluster size, and can be improved by parameter selection using performance measures. Current performance measures are based on the presence of clusters...

Full description

Bibliographic Details
Main Authors: Yue Ma, Fei Yin, Tao Zhang, Xiaohua Andrew Zhou, Xiaosong Li
Format: Article
Language:English
Published: Public Library of Science (PLoS) 2016-01-01
Series:PLoS ONE
Online Access:http://europepmc.org/articles/PMC4731069?pdf=render
id doaj-809d854b178f4d8096411483e25a5470
record_format Article
spelling doaj-809d854b178f4d8096411483e25a54702020-11-24T22:06:49ZengPublic Library of Science (PLoS)PLoS ONE1932-62032016-01-01111e014791810.1371/journal.pone.0147918Selection of the Maximum Spatial Cluster Size of the Spatial Scan Statistic by Using the Maximum Clustering Set-Proportion Statistic.Yue MaFei YinTao ZhangXiaohua Andrew ZhouXiaosong LiSpatial scan statistics are widely used in various fields. The performance of these statistics is influenced by parameters, such as maximum spatial cluster size, and can be improved by parameter selection using performance measures. Current performance measures are based on the presence of clusters and are thus inapplicable to data sets without known clusters. In this work, we propose a novel overall performance measure called maximum clustering set-proportion (MCS-P), which is based on the likelihood of the union of detected clusters and the applied dataset. MCS-P was compared with existing performance measures in a simulation study to select the maximum spatial cluster size. Results of other performance measures, such as sensitivity and misclassification, suggest that the spatial scan statistic achieves accurate results in most scenarios with the maximum spatial cluster sizes selected using MCS-P. Given that previously known clusters are not required in the proposed strategy, selection of the optimal maximum cluster size with MCS-P can improve the performance of the scan statistic in applications without identified clusters.http://europepmc.org/articles/PMC4731069?pdf=render
collection DOAJ
language English
format Article
sources DOAJ
author Yue Ma
Fei Yin
Tao Zhang
Xiaohua Andrew Zhou
Xiaosong Li
spellingShingle Yue Ma
Fei Yin
Tao Zhang
Xiaohua Andrew Zhou
Xiaosong Li
Selection of the Maximum Spatial Cluster Size of the Spatial Scan Statistic by Using the Maximum Clustering Set-Proportion Statistic.
PLoS ONE
author_facet Yue Ma
Fei Yin
Tao Zhang
Xiaohua Andrew Zhou
Xiaosong Li
author_sort Yue Ma
title Selection of the Maximum Spatial Cluster Size of the Spatial Scan Statistic by Using the Maximum Clustering Set-Proportion Statistic.
title_short Selection of the Maximum Spatial Cluster Size of the Spatial Scan Statistic by Using the Maximum Clustering Set-Proportion Statistic.
title_full Selection of the Maximum Spatial Cluster Size of the Spatial Scan Statistic by Using the Maximum Clustering Set-Proportion Statistic.
title_fullStr Selection of the Maximum Spatial Cluster Size of the Spatial Scan Statistic by Using the Maximum Clustering Set-Proportion Statistic.
title_full_unstemmed Selection of the Maximum Spatial Cluster Size of the Spatial Scan Statistic by Using the Maximum Clustering Set-Proportion Statistic.
title_sort selection of the maximum spatial cluster size of the spatial scan statistic by using the maximum clustering set-proportion statistic.
publisher Public Library of Science (PLoS)
series PLoS ONE
issn 1932-6203
publishDate 2016-01-01
description Spatial scan statistics are widely used in various fields. The performance of these statistics is influenced by parameters, such as maximum spatial cluster size, and can be improved by parameter selection using performance measures. Current performance measures are based on the presence of clusters and are thus inapplicable to data sets without known clusters. In this work, we propose a novel overall performance measure called maximum clustering set-proportion (MCS-P), which is based on the likelihood of the union of detected clusters and the applied dataset. MCS-P was compared with existing performance measures in a simulation study to select the maximum spatial cluster size. Results of other performance measures, such as sensitivity and misclassification, suggest that the spatial scan statistic achieves accurate results in most scenarios with the maximum spatial cluster sizes selected using MCS-P. Given that previously known clusters are not required in the proposed strategy, selection of the optimal maximum cluster size with MCS-P can improve the performance of the scan statistic in applications without identified clusters.
url http://europepmc.org/articles/PMC4731069?pdf=render
work_keys_str_mv AT yuema selectionofthemaximumspatialclustersizeofthespatialscanstatisticbyusingthemaximumclusteringsetproportionstatistic
AT feiyin selectionofthemaximumspatialclustersizeofthespatialscanstatisticbyusingthemaximumclusteringsetproportionstatistic
AT taozhang selectionofthemaximumspatialclustersizeofthespatialscanstatisticbyusingthemaximumclusteringsetproportionstatistic
AT xiaohuaandrewzhou selectionofthemaximumspatialclustersizeofthespatialscanstatisticbyusingthemaximumclusteringsetproportionstatistic
AT xiaosongli selectionofthemaximumspatialclustersizeofthespatialscanstatisticbyusingthemaximumclusteringsetproportionstatistic
_version_ 1725821669527781376