Publications des scientifiques de l'IRD

Villon Sébastien, Mangeas Morgan, Berteaux-Lecellier V., Vigliola Laurent, Lecellier Gael. (2026). Fine-grained assignment of unknown marine eDNA sequences using neural networks. Biology, 15 (3), p. 285 [16 p.].

Titre du document
Fine-grained assignment of unknown marine eDNA sequences using neural networks
Année de publication
2026
Type de document
Article référencé dans le Web of Science WOS:001687983500001
Auteurs
Villon Sébastien, Mangeas Morgan, Berteaux-Lecellier V., Vigliola Laurent, Lecellier Gael
Source
Biology, 2026, 15 (3), p. 285 [16 p.]
Environmental DNA (eDNA) metabarcoding is an innovative tool that is transforming ecological research. It offers a simple and effective method for simultaneously detecting numerous species across a wide range of environments. The method relies on assigning DNA sequences sampled from the environment to taxa, which is straightforward for species that have already been sequenced and are represented in reference databases. However, existing bioinformatics tools often fail to deliver accurate, fine-grained assignments when target species are absent from these databases. This limitation arises from handcrafted classification thresholds that do not account for nucleotide positional information. Here, we propose a deep neural architecture specifically designed to exploit both nucleotide identity and positional patterns in short TELEO sequences. Using an in-silico validation framework based on NCBI genbank sequences, we compare our approach with several state-of-the-art bioinformatics tools (Obitools, Kraken2, Lolo), as well as alternative sequence embedding methods, under controlled conditions. Our approach yields significantly higher classification accuracy at the genus and family levels, achieving average accuracies of 94.7% at the genus level and 86.5% at the family level, substantially outperforming the tested reference-based pipelines. The method remains robust with limited training data and shows improved performance when nucleotide positional information is preserved through sequence alignment. These results demonstrate the potential of AI-powered eDNA metabarcoding to complement existing taxonomic assignment tools, particularly in contexts where reference databases are incomplete or species-level resolution is not achievable, thereby supporting biodiversity monitoring and ecosystem management.
Plan de classement
Sciences fondamentales / Techniques d'analyse et de recherche [020] ; Ecologie, systèmes aquatiques [036]
Localisation
Fonds IRD [F B010096350]
Identifiant IRD
fdi:010096350
Contact
  • Coordonnées :
    Mission Science Ouverte (MSO)
    IRD - Délégation régionale Île-de-France & Ouest
    Campus Condorcet - Hôtel à projets
    8 cours des Humanités - 93322 Aubervilliers Cedex
    Horizon Pleins textes
    Aide
  •