Publications des scientifiques de l'IRD

Lenters T. P., Henderson A., Dracxler C. M., Elias G. A., Kamga S. M., Couvreur Thomas, Kissling W. D. (2021). Integration and harmonization of trait data from plant individuals across heterogeneous sources. Ecological Informatics, 62, p. 101206 [11 p.]. ISSN 1574-9541.

Titre du document
Integration and harmonization of trait data from plant individuals across heterogeneous sources
Année de publication
2021
Type de document
Article référencé dans le Web of Science WOS:000640473700003
Auteurs
Lenters T. P., Henderson A., Dracxler C. M., Elias G. A., Kamga S. M., Couvreur Thomas, Kissling W. D.
Source
Ecological Informatics, 2021, 62, p. 101206 [11 p.] ISSN 1574-9541
Trait data represent the basis for ecological and evolutionary research and have relevance for biodiversity conservation, ecosystem management and earth system modelling. The collection and mobilization of trait data has strongly increased over the last decade, but many trait databases still provide only species-level, aggregated trait values (e.g. ranges, means) and lack the direct observations on which those data are based. Thus, the vast majority of trait data measured directly from individuals remains hidden and highly heterogeneous, impeding their discoverability, semantic interoperability, digital accessibility and (re-)use. Here, we integrate quantitative measurements of verbatim trait information from plant individuals (e.g. lengths, widths, counts and angles of stems, leaves, fruits and inflorescence parts) from multiple sources such as field observations and herbarium collections. We develop a workflow to harmonize heterogeneous trait measurements (e.g. trait names and their values and units) as well as additional information related to taxonomy, measurement or fact and occurrence. This data integration and harmonization builds on vocabularies and terminology from existing metadata standards and ontologies such as the Ecological Trait-data Standard (ETS), the Darwin Core (DwC), the Thesaurus Of Plant characteristics (TOP) and the Plant Trait Ontology (TO). A metadata form filled out by data providers enables the automated integration of trait information from heterogeneous datasets. We illustrate our tools with data from palms (family Arecaceae), a globally distributed (pantropical), diverse plant family that is considered a good model system for understanding the ecology and evolution of tropical rainforests. We mobilize nearly 140,000 individual palm trait measurements in an interoperable format, identify semantic gaps in existing plant trait terminology and provide suggestions for the future development of a thesaurus of plant characteristics. Our work thereby promotes the semantic integration of plant trait data in a machine-readable way and shows how large amounts of small trait data sets and their metadata can be integrated into standardized data products.
Plan de classement
Sciences fondamentales / Techniques d'analyse et de recherche [020] ; Sciences du monde végétal [076] ; Informatique [122]
Localisation
Fonds IRD [F B010081349]
Identifiant IRD
fdi:010081349
Contact