Publications des scientifiques de l'IRD

Moudry V., Devillers Rodolphe. (2020). Quality and usability challenges of global marine biodiversity databases : an example for marine mammal data. Ecological Informatics, 56, 101051 [10 p.]. ISSN 1574-9541.

Titre du document
Quality and usability challenges of global marine biodiversity databases : an example for marine mammal data
Année de publication
2020
Type de document
Article
Auteurs
Moudry V., Devillers Rodolphe
Source
Ecological Informatics, 2020, 56, 101051 [10 p.] ISSN 1574-9541
Knowing spatial and temporal patterns of species distribution is paramount to support marine species persistence. While datasets provided by global aggregators are increasingly rich and useful, they suffer from various types of data quality issues that can impact their usage. Using marine mammals as an example, we assessed the quality and information gaps in species distribution data from three major databases: the Global Biodiversity Information Facility (GBIF), the Ocean Biogeographic Information System (OBIS) and the International Union for Conservation of Nature (IUCN) range maps. We analysed marine mammal records from 2015 (n=1,396,581) and from 2019 (n=1,904,968), for six types of common quality or usability issues. Results for both OBIS and GBIF indicate that 35 to 55/100 (depending on the respective database and year) of individual database's records are potential duplicates, fall on land, or miss a data collection date. The positional accuracy of data records varies greatly due to varying precision and rounding of geographic coordinates. However, coordinate precision is specified only in 45/100 and 70/100 of records in GBIF and OBIS, respectively. In 2019, only approximately 70/100 of GBIF and OBIS records are encoded using more than three decimals (i.e. remaining records have a positional accuracy lower than 100) m. We also quantified that only 19/100 (n=135,885) and 11/100 (n=133,882) of the records in 2015 and 2019, respectively, were common to OBIS and GBIF. Despite the continuous increase in the number of records in both databases, the number of shared records slightly decreased. It is therefore likely that new records added to GBIF and OBIS between 2015 and 2019 come from different data providers. Finally, to identify potential information gaps in marine mammal distributions, we overlaid IUCN range maps and species occurrences from global databases. We found that areas previously identified as hotspots for marine mammals' diversity show some of the highest rates of potential false positives (i.e. species are thought to occur there based on their range map, but no species record exist in either GBIF or OBIS). While global biodiversity databases are key to assess global species distribution patterns, our study points to challenges that can limit data usability in biodiversity research. Improving existing data entry mechanisms, quality control routines, as well as data exchange between aggregators should help make those databases more useful to the community and reduce the risks of misuse of biological data.
Plan de classement
Autres vertébrés [034BIOVER03] ; Vertébrés [082FAUNE02] ; Applications diverses [122APPLIC]
Localisation
Fonds IRD [F B010082446]
Identifiant IRD
fdi:010082446
Contact