Publications des scientifiques de l'IRD

Coba-Males M. A., Orozco-Arias S., Guyot Romain, Vela D. (2025). Identification of transposable elements and satellite DNA in the Neotropical species Drosophila amaguana from the Ecuadorian Andean Forests. PLoS One, 20 (12), p. e0337390 [22 p.].

Titre du document
Identification of transposable elements and satellite DNA in the Neotropical species Drosophila amaguana from the Ecuadorian Andean Forests
Année de publication
2025
Type de document
Article référencé dans le Web of Science WOS:001636238500041
Auteurs
Coba-Males M. A., Orozco-Arias S., Guyot Romain, Vela D.
Source
PLoS One, 2025, 20 (12), p. e0337390 [22 p.]
Genome size variation in eukaryotic species is largely influenced by repetitive DNA sequences such as transposable elements (TEs), simple repeats, and satellite DNAs (satDNAs), which do not necessarily correlate with organismal complexity. In insects, TEs are crucial to evolutionary processes and are correlated with variations in genome size. In this study, we describe, for the first time, the mobilome and satellitome of Drosophila amaguana, an Ecuadorian Neotropical species with a large, unexplored genome size, to assess the contribution of these repetitive DNA sequences to its genome composition. Using a draft genome assembly of approximately 455.5 Mb, generated from Illumina short-read sequences obtained from 10 wild specimens of D. amaguana collected at the Refugio de Vida Silvestre Pasochoa, we employed a de novo approach to create a manually curated TE library of 737 consensus sequences. We identified 716 novel TE families that had not been previously described, 20 TEs previously characterized in other Drosophila species, and one DNA transposon previously described in the Lepeophtheirus genus. The total TE content in the D. amaguana genome was 21.54%, distributed as follows: 6.35% Helitrons (1 superfamily), 5.13% LTR retrotransposons (5 superfamilies), 3.63% TIRs (9 superfamilies), 3.61% LINEs (7 superfamilies), 1.17% MITEs, 0.94% Maverick, 0.67% PLE, 0.02% SINEs, and 0.01% DIRS. We also identified 11.8% of simple repeats. Additionally, we estimated the satDNA content using Illumina raw reads and identified 16 satDNA families, all unique to the Drosophila genus, which comprise 4.90% of the genome. Overall, our results based on short-read data suggest that the large genome size of D. amaguana may not be the consequence of a high amount of TEs or satDNAs. Instead, its large genome size could be attributed to other factors (e.g., noncoding DNA occupying substantial portions of the genome or a high percentage of duplicated genes) that remain to be determined or explored in future studies using long-reads to overcome short-reads limitations. These findings may currently offer valuable insights into the adaptative and evolutionary processes of the mesophragmatica species group in the Andean forests.
Plan de classement
Sciences fondamentales / Techniques d'analyse et de recherche [020] ; Sciences du monde animal [080]
Description Géographique
EQUATEUR ; ANDES
Localisation
Fonds IRD [F B010095975]
Identifiant IRD
fdi:010095975
Contact
  • Coordonnées :
    Mission Science Ouverte (MSO)
    IRD - Délégation régionale Île-de-France & Ouest
    Campus Condorcet - Hôtel à projets
    8 cours des Humanités - 93322 Aubervilliers Cedex
    Horizon Pleins textes
    Aide
  •