SPODT: An R Package to Perform Spatial Partitioning
Spatial cluster detection is a classical question in epidemiology: Are cases located near other cases? In order to classify a study area into zones of different risks and determine their boundaries, we have developed a spatial partitioning method based on oblique decision trees, which is called spat...
Main Authors: | , , , , , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
Foundation for Open Access Statistics
2015-02-01
|
Series: | Journal of Statistical Software |
Online Access: | http://www.jstatsoft.org/index.php/jss/article/view/2231 |
id |
doaj-4b30692c0e084e6ea281a1fc0cf8a874 |
---|---|
record_format |
Article |
spelling |
doaj-4b30692c0e084e6ea281a1fc0cf8a8742020-11-24T20:40:36ZengFoundation for Open Access StatisticsJournal of Statistical Software1548-76602015-02-0163112310.18637/jss.v063.i16835SPODT: An R Package to Perform Spatial PartitioningJean GaudartNathalie GraffeoDrissa CoulibalyGuillaume BarbetStanilas RebaudetNadine DessayOgobara K. DoumboRoch GiorgiSpatial cluster detection is a classical question in epidemiology: Are cases located near other cases? In order to classify a study area into zones of different risks and determine their boundaries, we have developed a spatial partitioning method based on oblique decision trees, which is called spatial oblique decision tree (SpODT). This non-parametric method is based on the classification and regression tree (CART) approach introduced by Leo Breiman. Applied to epidemiological spatial data, the algorithm recursively searches among the coordinates for a threshold or a boundary between zones, so that the risks estimated in these zones are as different as possible. While the CART algorithm leads to rectangular zones, providing perpendicular splits of longitudes and latitudes, the SpODT algorithm provides oblique splitting of the study area, which is more appropriate and accurate for spatial epidemiology. Oblique decision trees can be considered as non-parametric regression models. Beyond the basic function, we have developed a set of functions that enable extended analyses of spatial data, providing: inference, graphical representations, spatio-temporal analysis, adjustments on covariates, spatial weighted partition, and the gathering of similar adjacent final classes. In this paper, we propose a new R package, SPODT, which provides an extensible set of functions for partitioning spatial and spatio-temporal data. The implementation and extensions of the algorithm are described. Function usage examples are proposed, looking for clustering malaria episodes in Bandiagara, Mali, and samples showing three different cluster shapes.http://www.jstatsoft.org/index.php/jss/article/view/2231 |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Jean Gaudart Nathalie Graffeo Drissa Coulibaly Guillaume Barbet Stanilas Rebaudet Nadine Dessay Ogobara K. Doumbo Roch Giorgi |
spellingShingle |
Jean Gaudart Nathalie Graffeo Drissa Coulibaly Guillaume Barbet Stanilas Rebaudet Nadine Dessay Ogobara K. Doumbo Roch Giorgi SPODT: An R Package to Perform Spatial Partitioning Journal of Statistical Software |
author_facet |
Jean Gaudart Nathalie Graffeo Drissa Coulibaly Guillaume Barbet Stanilas Rebaudet Nadine Dessay Ogobara K. Doumbo Roch Giorgi |
author_sort |
Jean Gaudart |
title |
SPODT: An R Package to Perform Spatial Partitioning |
title_short |
SPODT: An R Package to Perform Spatial Partitioning |
title_full |
SPODT: An R Package to Perform Spatial Partitioning |
title_fullStr |
SPODT: An R Package to Perform Spatial Partitioning |
title_full_unstemmed |
SPODT: An R Package to Perform Spatial Partitioning |
title_sort |
spodt: an r package to perform spatial partitioning |
publisher |
Foundation for Open Access Statistics |
series |
Journal of Statistical Software |
issn |
1548-7660 |
publishDate |
2015-02-01 |
description |
Spatial cluster detection is a classical question in epidemiology: Are cases located near other cases? In order to classify a study area into zones of different risks and determine their boundaries, we have developed a spatial partitioning method based on oblique decision trees, which is called spatial oblique decision tree (SpODT). This non-parametric method is based on the classification and regression tree (CART) approach introduced by Leo Breiman. Applied to epidemiological spatial data, the algorithm recursively searches among the coordinates for a threshold or a boundary between zones, so that the risks estimated in these zones are as different as possible. While the CART algorithm leads to rectangular zones, providing perpendicular splits of longitudes and latitudes, the SpODT algorithm provides oblique splitting of the study area, which is more appropriate and accurate for spatial epidemiology. Oblique decision trees can be considered as non-parametric regression models. Beyond the basic function, we have developed a set of functions that enable extended analyses of spatial data, providing: inference, graphical representations, spatio-temporal analysis, adjustments on covariates, spatial weighted partition, and the gathering of similar adjacent final classes. In this paper, we propose a new R package, SPODT, which provides an extensible set of functions for partitioning spatial and spatio-temporal data. The implementation and extensions of the algorithm are described. Function usage examples are proposed, looking for clustering malaria episodes in Bandiagara, Mali, and samples showing three different cluster shapes. |
url |
http://www.jstatsoft.org/index.php/jss/article/view/2231 |
work_keys_str_mv |
AT jeangaudart spodtanrpackagetoperformspatialpartitioning AT nathaliegraffeo spodtanrpackagetoperformspatialpartitioning AT drissacoulibaly spodtanrpackagetoperformspatialpartitioning AT guillaumebarbet spodtanrpackagetoperformspatialpartitioning AT stanilasrebaudet spodtanrpackagetoperformspatialpartitioning AT nadinedessay spodtanrpackagetoperformspatialpartitioning AT ogobarakdoumbo spodtanrpackagetoperformspatialpartitioning AT rochgiorgi spodtanrpackagetoperformspatialpartitioning |
_version_ |
1716826335653920768 |