Publications des scientifiques de l'IRD

Gaudart J., Graffeo N., Coulibaly D., Barbet G., Rebaudet S., Dessay Nadine, Doumbo O.K., Giorgi Roch. (2015). SPODT: An R package to perform spatial partitioning. Journal of Statistical Science, 63 (16), 23 p. ISSN 1548-7660.

Titre du document
SPODT: An R package to perform spatial partitioning
Année de publication
2015
Type de document
Article référencé dans le Web of Science WOS:000349847100001
Auteurs
Gaudart J., Graffeo N., Coulibaly D., Barbet G., Rebaudet S., Dessay Nadine, Doumbo O.K., Giorgi Roch
Source
Journal of Statistical Science, 2015, 63 (16), 23 p. ISSN 1548-7660
Spatial cluster detection is a classical question in epidemiology : are cases located near other cases ? In order to classify a study area into zones of different risks and determine their boundaries, we have developed a spatial partitioning method based on oblique decision trees, which is called spatial oblique decision tree (SpODT). This non-parametric method is based on the classification and regression tree (CART) approach introduced by Leo Breiman. Applied to epidemiological spatial data, the algorithm recursively searches among the coordinates for a threshold or a boundary between zones, so that the risks estimated in these zones are as different as possible. While the CART algorithm leads to rectangular zones, providing perpendicular splits of longitudes and latitudes, the SpODT algorithm provides oblique splitting of the study area, which is more appropriate and accurate for spatial epidemiology. Oblique decision trees can be considered as non-parametric regression models. Beyond the basic function, we have developed a set of functions that enable extended analyses of spatial data, providing: inference, graphical representations, spatio-temporal analysis, adjustments on covariates, spatial weighted partition, and the gathering of similar adjacent final classes. In this paper, we propose a new R package, SPODT, which provides an extensible set of functions for partitioning spatial and spatio-temporal data. The implementation and extensions of the algorithm are described. Function usage examples are proposed, looking for clustering malaria episodes in Bandiagara, Mali, and samples showing three different cluster shapes.
Plan de classement
Sciences fondamentales / Techniques d'analyse et de recherche [020] ; Entomologie médicale / Parasitologie / Virologie [052] ; Télédétection [126]
Localisation
Fonds IRD [F B010067259]
Identifiant IRD
fdi:010067259
Contact