SPODT: An R Package to Perform Spatial Partitioning

Spatial cluster detection is a classical question in epidemiology: Are cases located near other cases? In order to classify a study area into zones of different risks and determine their boundaries, we have developed a spatial partitioning method based on oblique decision trees, which is called spat...

Full description

Bibliographic Details
Main Authors: Jean Gaudart, Nathalie Graffeo, Drissa Coulibaly, Guillaume Barbet, Stanilas Rebaudet, Nadine Dessay, Ogobara K. Doumbo, Roch Giorgi
Format: Article
Language:English
Published: Foundation for Open Access Statistics 2015-02-01
Series:Journal of Statistical Software
Online Access:http://www.jstatsoft.org/index.php/jss/article/view/2231
id doaj-4b30692c0e084e6ea281a1fc0cf8a874
record_format Article
spelling doaj-4b30692c0e084e6ea281a1fc0cf8a8742020-11-24T20:40:36ZengFoundation for Open Access StatisticsJournal of Statistical Software1548-76602015-02-0163112310.18637/jss.v063.i16835SPODT: An R Package to Perform Spatial PartitioningJean GaudartNathalie GraffeoDrissa CoulibalyGuillaume BarbetStanilas RebaudetNadine DessayOgobara K. DoumboRoch GiorgiSpatial cluster detection is a classical question in epidemiology: Are cases located near other cases? In order to classify a study area into zones of different risks and determine their boundaries, we have developed a spatial partitioning method based on oblique decision trees, which is called spatial oblique decision tree (SpODT). This non-parametric method is based on the classification and regression tree (CART) approach introduced by Leo Breiman. Applied to epidemiological spatial data, the algorithm recursively searches among the coordinates for a threshold or a boundary between zones, so that the risks estimated in these zones are as different as possible. While the CART algorithm leads to rectangular zones, providing perpendicular splits of longitudes and latitudes, the SpODT algorithm provides oblique splitting of the study area, which is more appropriate and accurate for spatial epidemiology. Oblique decision trees can be considered as non-parametric regression models. Beyond the basic function, we have developed a set of functions that enable extended analyses of spatial data, providing: inference, graphical representations, spatio-temporal analysis, adjustments on covariates, spatial weighted partition, and the gathering of similar adjacent final classes. In this paper, we propose a new R package, SPODT, which provides an extensible set of functions for partitioning spatial and spatio-temporal data. The implementation and extensions of the algorithm are described. Function usage examples are proposed, looking for clustering malaria episodes in Bandiagara, Mali, and samples showing three different cluster shapes.http://www.jstatsoft.org/index.php/jss/article/view/2231
collection DOAJ
language English
format Article
sources DOAJ
author Jean Gaudart
Nathalie Graffeo
Drissa Coulibaly
Guillaume Barbet
Stanilas Rebaudet
Nadine Dessay
Ogobara K. Doumbo
Roch Giorgi
spellingShingle Jean Gaudart
Nathalie Graffeo
Drissa Coulibaly
Guillaume Barbet
Stanilas Rebaudet
Nadine Dessay
Ogobara K. Doumbo
Roch Giorgi
SPODT: An R Package to Perform Spatial Partitioning
Journal of Statistical Software
author_facet Jean Gaudart
Nathalie Graffeo
Drissa Coulibaly
Guillaume Barbet
Stanilas Rebaudet
Nadine Dessay
Ogobara K. Doumbo
Roch Giorgi
author_sort Jean Gaudart
title SPODT: An R Package to Perform Spatial Partitioning
title_short SPODT: An R Package to Perform Spatial Partitioning
title_full SPODT: An R Package to Perform Spatial Partitioning
title_fullStr SPODT: An R Package to Perform Spatial Partitioning
title_full_unstemmed SPODT: An R Package to Perform Spatial Partitioning
title_sort spodt: an r package to perform spatial partitioning
publisher Foundation for Open Access Statistics
series Journal of Statistical Software
issn 1548-7660
publishDate 2015-02-01
description Spatial cluster detection is a classical question in epidemiology: Are cases located near other cases? In order to classify a study area into zones of different risks and determine their boundaries, we have developed a spatial partitioning method based on oblique decision trees, which is called spatial oblique decision tree (SpODT). This non-parametric method is based on the classification and regression tree (CART) approach introduced by Leo Breiman. Applied to epidemiological spatial data, the algorithm recursively searches among the coordinates for a threshold or a boundary between zones, so that the risks estimated in these zones are as different as possible. While the CART algorithm leads to rectangular zones, providing perpendicular splits of longitudes and latitudes, the SpODT algorithm provides oblique splitting of the study area, which is more appropriate and accurate for spatial epidemiology. Oblique decision trees can be considered as non-parametric regression models. Beyond the basic function, we have developed a set of functions that enable extended analyses of spatial data, providing: inference, graphical representations, spatio-temporal analysis, adjustments on covariates, spatial weighted partition, and the gathering of similar adjacent final classes. In this paper, we propose a new R package, SPODT, which provides an extensible set of functions for partitioning spatial and spatio-temporal data. The implementation and extensions of the algorithm are described. Function usage examples are proposed, looking for clustering malaria episodes in Bandiagara, Mali, and samples showing three different cluster shapes.
url http://www.jstatsoft.org/index.php/jss/article/view/2231
work_keys_str_mv AT jeangaudart spodtanrpackagetoperformspatialpartitioning
AT nathaliegraffeo spodtanrpackagetoperformspatialpartitioning
AT drissacoulibaly spodtanrpackagetoperformspatialpartitioning
AT guillaumebarbet spodtanrpackagetoperformspatialpartitioning
AT stanilasrebaudet spodtanrpackagetoperformspatialpartitioning
AT nadinedessay spodtanrpackagetoperformspatialpartitioning
AT ogobarakdoumbo spodtanrpackagetoperformspatialpartitioning
AT rochgiorgi spodtanrpackagetoperformspatialpartitioning
_version_ 1716826335653920768