Publications des scientifiques de l'IRD

Delalande F., Carapito C., Brizard Jean-Paul, Brugidou Christophe, Van Dorsselaer A. (2005). Multigenic families and proteomics : extended protein characterization as a tool for paralog gene identification. Proteomics, 5 (2), p. 450-460. ISSN 1615-9853.

Titre du document
Multigenic families and proteomics : extended protein characterization as a tool for paralog gene identification
Année de publication
2005
Type de document
Article référencé dans le Web of Science WOS:000227112100015
Auteurs
Delalande F., Carapito C., Brizard Jean-Paul, Brugidou Christophe, Van Dorsselaer A.
Source
Proteomics, 2005, 5 (2), p. 450-460 ISSN 1615-9853
In classical proteomic studies, the searches in protein databases lead mostly to the identification of protein functions by homology due to the non-exhaustiveness of the protein databases. The quality of the identification depends on the studied organism, its complexity and its representation in the protein databases. Nevertheless, this basic function identification is insufficient for certain applications namely for the development of RNA-based gene-silencing strategies, commonly termed RNA interference (RNAi) in animals and post-transcriptional gene silencing (PTGS) in plants, that require an unambiguous identification of the targeted gene sequence. A PTGS strategy was considered in the study of the infection of Oryza sativa by the Rice Yellow Mottle Virus (RYMV). It is suspected that the RYMV recruits host proteins after its entry into plant cells to form a complex facilitating virus multiplication and spreading. The protein partners of this complex were identified by a classical proteomic approach, nano liquid chromatography tandem mass spectrometry. Among the identified proteins, several were retained for a PTGS strategy. Nevertheless most of the protein candidates appear to be members of multigenic families for which all paralog genes are not present in protein databases. Thus the identification of the real expressed paralog gene with classical protein database searches is impossible. Consequently, as the genome contains all genes and thus all paralog genes, a whole genome search strategy was developed to determine the specific expressed paralog gene. With this approach, the identification of peptides matching only a single gene, called discriminant peptides, allows definitive proof of the expression of this identified gene. This strategy has several requirements: (i) a genome completely sequenced and accessible; (ii) high protein sequence coverage. In the present work, through three examples, we report and validate for the first time a genome database search strategy to specifically identify paralog genes belonging to multigenic families expressed under specific conditions.
Plan de classement
Sciences fondamentales / Techniques d'analyse et de recherche [020] ; Sciences du monde végétal [076]
Identifiant IRD
PAR00000100
Contact