Publications des scientifiques de l'IRD

Orozco-Arias S., Candamil-Cortes M. S., Valencia-Castrillon E., Jaimes P. A., Orozco N. T., Arias-Mendoza M., Tabares-Soto R., Guyot Romain, Isaza G., IEEE. (2021). SENMAP : a convolutional neural network architecture for curation of LTR-RT libraries from plant genomes. 2021 IEEE 2nd International Congress of Biomedical Engineering and Bioengineering, p. 341660 [4 p.].

Titre du document
SENMAP : a convolutional neural network architecture for curation of LTR-RT libraries from plant genomes
Année de publication
2021
Type de document
Article référencé dans le Web of Science WOS:000853483100063
Auteurs
Orozco-Arias S., Candamil-Cortes M. S., Valencia-Castrillon E., Jaimes P. A., Orozco N. T., Arias-Mendoza M., Tabares-Soto R., Guyot Romain, Isaza G., IEEE
Source
2021 IEEE 2nd International Congress of Biomedical Engineering and Bioengineering, 2021, p. 341660 [4 p.]
Transposable elements (TEs) are specific structures of the genome of species, which can move from one location to another. For that reason, they can cause mutations or changes that can be negative, such as the appearance of diseases, or beneficial, such as participating in fundamental roles in the evolution of genomes and genetic diversity. Long Terminal Repeat retrotransposons (LTR-RT) are the most abundant in plant species, hence the importance of studying these structures in particular. Over the time, these elements can suffer changes called nested insertions, which can inactivate or modify the functioning of the element, for that they are no longer consider as intact element and cannot be used for identification and classification studies. In this work we present SENMAP, a convolutional neural network architecture to obtain intact LTR-RT sequences in plant genomes, which is composed by four convolutional layers, LeakyReLU as activation function and BinaryFocalLoss as loss function. Achieving an F1-score percentage of 91.37% with test data, identifying low quality sequences rapidly and efficiently, contributing to curate libraries of LTR retrotransposons of plants genomes published in large-scale sequencing projects due to the post-genomic era.
Plan de classement
Sciences fondamentales / Techniques d'analyse et de recherche [020] ; Sciences du monde végétal [076] ; Informatique [122]
Localisation
Fonds IRD [F B010086138]
Identifiant IRD
fdi:010086138
Contact