Publications des scientifiques de l'IRD

Gschloessl B., Dorkeld F., Berges H., Beydon G., Bouchez O., Branco M., Bretaudeau A., Burban C., Dubois E., Gauthier Philippe, Lhuillier E., Nichols J., Nidelet S., Rocha S., Saune L., Streiff R., Gautier M., Kerdelhue C. (2018). Draft genome and reference transcriptomic resources for the urticating pine defoliator Thaumetopoea pityocampa (Lepidoptera : Notodontidae). Molecular Ecology Resources, 18 (3), p. 602-619. ISSN 1755-098X.

Titre du document
Draft genome and reference transcriptomic resources for the urticating pine defoliator Thaumetopoea pityocampa (Lepidoptera : Notodontidae)
Année de publication
2018
Type de document
Article référencé dans le Web of Science WOS:000432662400018
Auteurs
Gschloessl B., Dorkeld F., Berges H., Beydon G., Bouchez O., Branco M., Bretaudeau A., Burban C., Dubois E., Gauthier Philippe, Lhuillier E., Nichols J., Nidelet S., Rocha S., Saune L., Streiff R., Gautier M., Kerdelhue C.
Source
Molecular Ecology Resources, 2018, 18 (3), p. 602-619 ISSN 1755-098X
The pine processionary moth Thaumetopoea pityocampa (Lepidoptera: Notodontidae) is the main pine defoliator in the Mediterranean region. Its urticating larvae cause severe human and animal health concerns in the invaded areas. This species shows a high phenotypic variability for various traits, such as phenology, fecundity and tolerance to extreme temperatures. This study presents the construction and analysis of extensive genomic and transcriptomic resources, which are an obligate prerequisite to understand their underlying genetic architecture. Using a well-studied population from Portugal with peculiar phenological characteristics, the karyotype was first determined and a first draft genome of 537Mb total length was assembled into 68,292 scaffolds (N50 = 164kb). From this genome assembly, 29,415 coding genes were predicted. To circumvent some limitations for fine-scale physical mapping of genomic regions of interest, a 3X coverage BAC library was also developed. In particular, 11 BACs from this library were individually sequenced to assess the assembly quality. Additionally, de novo transcriptomic resources were generated from various developmental stages sequenced with HiSeq and MiSeq Illumina technologies. The reads were de novo assembled into 62,376 and 63,175 transcripts, respectively. Then, a robust subset of the genome-predicted coding genes, the de novo transcriptome assemblies and previously published 454/Sanger data were clustered to obtain a high-quality and comprehensive reference transcriptome consisting of 29,701 bona fide unigenes. These sequences covered 99% of the cegma and 88% of the busco highly conserved eukaryotic genes and 84% of the busco arthropod gene set. Moreover, 90% of these transcripts could be localized on the draft genome. The described information is available via a genome annotation portal (
Plan de classement
Sciences fondamentales / Techniques d'analyse et de recherche [020] ; Sciences du monde végétal [076] ; Sciences du monde animal [080]
Localisation
Fonds IRD [F B010073027]
Identifiant IRD
fdi:010073027
Contact