Selection of the Maximum Spatial Cluster Size of the Spatial Scan Statistic by Using the Maximum Clustering Set-Proportion Statistic.
Spatial scan statistics are widely used in various fields. The performance of these statistics is influenced by parameters, such as maximum spatial cluster size, and can be improved by parameter selection using performance measures. Current performance measures are based on the presence of clusters...
Main Authors: | , , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
Public Library of Science (PLoS)
2016-01-01
|
Series: | PLoS ONE |
Online Access: | http://europepmc.org/articles/PMC4731069?pdf=render |
id |
doaj-809d854b178f4d8096411483e25a5470 |
---|---|
record_format |
Article |
spelling |
doaj-809d854b178f4d8096411483e25a54702020-11-24T22:06:49ZengPublic Library of Science (PLoS)PLoS ONE1932-62032016-01-01111e014791810.1371/journal.pone.0147918Selection of the Maximum Spatial Cluster Size of the Spatial Scan Statistic by Using the Maximum Clustering Set-Proportion Statistic.Yue MaFei YinTao ZhangXiaohua Andrew ZhouXiaosong LiSpatial scan statistics are widely used in various fields. The performance of these statistics is influenced by parameters, such as maximum spatial cluster size, and can be improved by parameter selection using performance measures. Current performance measures are based on the presence of clusters and are thus inapplicable to data sets without known clusters. In this work, we propose a novel overall performance measure called maximum clustering set-proportion (MCS-P), which is based on the likelihood of the union of detected clusters and the applied dataset. MCS-P was compared with existing performance measures in a simulation study to select the maximum spatial cluster size. Results of other performance measures, such as sensitivity and misclassification, suggest that the spatial scan statistic achieves accurate results in most scenarios with the maximum spatial cluster sizes selected using MCS-P. Given that previously known clusters are not required in the proposed strategy, selection of the optimal maximum cluster size with MCS-P can improve the performance of the scan statistic in applications without identified clusters.http://europepmc.org/articles/PMC4731069?pdf=render |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Yue Ma Fei Yin Tao Zhang Xiaohua Andrew Zhou Xiaosong Li |
spellingShingle |
Yue Ma Fei Yin Tao Zhang Xiaohua Andrew Zhou Xiaosong Li Selection of the Maximum Spatial Cluster Size of the Spatial Scan Statistic by Using the Maximum Clustering Set-Proportion Statistic. PLoS ONE |
author_facet |
Yue Ma Fei Yin Tao Zhang Xiaohua Andrew Zhou Xiaosong Li |
author_sort |
Yue Ma |
title |
Selection of the Maximum Spatial Cluster Size of the Spatial Scan Statistic by Using the Maximum Clustering Set-Proportion Statistic. |
title_short |
Selection of the Maximum Spatial Cluster Size of the Spatial Scan Statistic by Using the Maximum Clustering Set-Proportion Statistic. |
title_full |
Selection of the Maximum Spatial Cluster Size of the Spatial Scan Statistic by Using the Maximum Clustering Set-Proportion Statistic. |
title_fullStr |
Selection of the Maximum Spatial Cluster Size of the Spatial Scan Statistic by Using the Maximum Clustering Set-Proportion Statistic. |
title_full_unstemmed |
Selection of the Maximum Spatial Cluster Size of the Spatial Scan Statistic by Using the Maximum Clustering Set-Proportion Statistic. |
title_sort |
selection of the maximum spatial cluster size of the spatial scan statistic by using the maximum clustering set-proportion statistic. |
publisher |
Public Library of Science (PLoS) |
series |
PLoS ONE |
issn |
1932-6203 |
publishDate |
2016-01-01 |
description |
Spatial scan statistics are widely used in various fields. The performance of these statistics is influenced by parameters, such as maximum spatial cluster size, and can be improved by parameter selection using performance measures. Current performance measures are based on the presence of clusters and are thus inapplicable to data sets without known clusters. In this work, we propose a novel overall performance measure called maximum clustering set-proportion (MCS-P), which is based on the likelihood of the union of detected clusters and the applied dataset. MCS-P was compared with existing performance measures in a simulation study to select the maximum spatial cluster size. Results of other performance measures, such as sensitivity and misclassification, suggest that the spatial scan statistic achieves accurate results in most scenarios with the maximum spatial cluster sizes selected using MCS-P. Given that previously known clusters are not required in the proposed strategy, selection of the optimal maximum cluster size with MCS-P can improve the performance of the scan statistic in applications without identified clusters. |
url |
http://europepmc.org/articles/PMC4731069?pdf=render |
work_keys_str_mv |
AT yuema selectionofthemaximumspatialclustersizeofthespatialscanstatisticbyusingthemaximumclusteringsetproportionstatistic AT feiyin selectionofthemaximumspatialclustersizeofthespatialscanstatisticbyusingthemaximumclusteringsetproportionstatistic AT taozhang selectionofthemaximumspatialclustersizeofthespatialscanstatisticbyusingthemaximumclusteringsetproportionstatistic AT xiaohuaandrewzhou selectionofthemaximumspatialclustersizeofthespatialscanstatisticbyusingthemaximumclusteringsetproportionstatistic AT xiaosongli selectionofthemaximumspatialclustersizeofthespatialscanstatisticbyusingthemaximumclusteringsetproportionstatistic |
_version_ |
1725821669527781376 |