Publications des scientifiques de l'IRD

Erkes A., Grove R. P., Zarkovic M., Krautwurst S., Koebnik Ralf, Morgan R. D., Wilson G. G., Holzer M., Marz M., Boch J., Grau J. (2023). Assembling highly repetitive Xanthomonas TALomes using Oxford Nanopore sequencing. BMC Genomics, 24 (1), 151 [18 p.]. ISSN 1471-2164.

Titre du document
Assembling highly repetitive Xanthomonas TALomes using Oxford Nanopore sequencing
Année de publication
2023
Type de document
Article référencé dans le Web of Science WOS:000958027400005
Auteurs
Erkes A., Grove R. P., Zarkovic M., Krautwurst S., Koebnik Ralf, Morgan R. D., Wilson G. G., Holzer M., Marz M., Boch J., Grau J.
Source
BMC Genomics, 2023, 24 (1), 151 [18 p.] ISSN 1471-2164
Background Most plant-pathogenic Xanthomonas bacteria harbor transcription activator-like effector (TALE) genes, which function as transcriptional activators of host plant genes and support infection. The entire repertoire of up to 29 TALE genes of a Xanthomonas strain is also referred to as TALome. The DNA-binding domain of TALEs is comprised of highly conserved repeats and TALE genes often occur in gene clusters, which precludes the assembly of TALE-carrying Xanthomonas genomes based on standard sequencing approaches. Results Here, we report the successful assembly of the 5 Mbp genomes of five Xanthomonas strains from Oxford Nanopore Technologies (ONT) sequencing data. For one of these strains, Xanthomonas oryzae pv. oryzae (Xoo) PXO35, we illustrate why Illumina short reads and longer PacBio reads are insufficient to fully resolve the genome. While ONT reads are perfectly suited to yield highly contiguous genomes, they suffer from a specific error profile within homopolymers. To still yield complete and correct TALomes from ONT assemblies, we present a computational correction pipeline specifically tailored to TALE genes, which yields at least comparable accuracy as Illumina-based polishing. We further systematically assess the ONT-based pipeline for its multiplexing capacity and find that, combined with computational correction, the complete TALome of Xoo PXO35 could have been reconstructed from less than 20,000 ONT reads. Conclusions Our results indicate that multiplexed ONT sequencing combined with a computational correction of TALE genes constitutes a highly capable tool for characterizing the TALomes of huge collections of Xanthomonas strains in the future.
Plan de classement
Sciences du monde végétal [076] ; Biotechnologies [084]
Localisation
Fonds IRD [F B010087584]
Identifiant IRD
fdi:010087584
Contact