Draft genome and reference transcriptomic resources for the urticating pine defoliator Thaumetopoea pityocampa (Lepidoptera : Notodontidae)- fdi:010073027- Horizon

Publications des scientifiques de l'IRD

Gschloessl B., Dorkeld F., Berges H., Beydon G., Bouchez O., Branco M., Bretaudeau A., Burban C., Dubois E., Gauthier Philippe, Lhuillier E., Nichols J., Nidelet S., Rocha S., Saune L., Streiff R., Gautier M., Kerdelhue C. (2018). Draft genome and reference transcriptomic resources for the urticating pine defoliator Thaumetopoea pityocampa (Lepidoptera : Notodontidae). Molecular Ecology Resources, 18 (3), p. 602-619. ISSN 1755-098X.

Titre du document

Draft genome and reference transcriptomic resources for the urticating pine defoliator Thaumetopoea pityocampa (Lepidoptera : Notodontidae)

Année de publication

2018

Type de document

Article référencé dans le Web of Science WOS:000432662400018

Auteurs

Source

Molecular Ecology Resources, 2018, 18 (3), p. 602-619 ISSN 1755-098X

The pine processionary moth Thaumetopoea pityocampa (Lepidoptera: Notodontidae) is the main pine defoliator in the Mediterranean region. Its urticating larvae cause severe human and animal health concerns in the invaded areas. This species shows a high phenotypic variability for various traits, such as phenology, fecundity and tolerance to extreme temperatures. This study presents the construction and analysis of extensive genomic and transcriptomic resources, which are an obligate prerequisite to understand their underlying genetic architecture. Using a well-studied population from Portugal with peculiar phenological characteristics, the karyotype was first determined and a first draft genome of 537Mb total length was assembled into 68,292 scaffolds (N50 = 164kb). From this genome assembly, 29,415 coding genes were predicted. To circumvent some limitations for fine-scale physical mapping of genomic regions of interest, a 3X coverage BAC library was also developed. In particular, 11 BACs from this library were individually sequenced to assess the assembly quality. Additionally, de novo transcriptomic resources were generated from various developmental stages sequenced with HiSeq and MiSeq Illumina technologies. The reads were de novo assembled into 62,376 and 63,175 transcripts, respectively. Then, a robust subset of the genome-predicted coding genes, the de novo transcriptome assemblies and previously published 454/Sanger data were clustered to obtain a high-quality and comprehensive reference transcriptome consisting of 29,701 bona fide unigenes. These sequences covered 99% of the cegma and 88% of the busco highly conserved eukaryotic genes and 84% of the busco arthropod gene set. Moreover, 90% of these transcripts could be localized on the draft genome. The described information is available via a genome annotation portal (

Plan de classement

Sciences fondamentales / Techniques d'analyse et de recherche [020] ; Sciences du monde végétal [076] ; Sciences du monde animal [080]

Localisation

Fonds IRD [F B010073027]

Identifiant IRD

fdi:010073027

Demander le PDF DOI HAL

Contact

Coordonnées :
IST / IRD Ile-de-France
32 avenue Henri Varagnat
93140 Bondy Cedex
France
Horizon Pleins textes
Aide

Export de données

CSV EndNote XML EndNote MODS Dublin core BibTeX