Publications des scientifiques de l'IRD

Milet Jacqueline, Courtin David, Garcia André, Perdry H. (2020). Mixed logistic regression in genome-wide association studies. BMC Bioinformatics, 21 (1), p. 536 [17 p.]. ISSN 1471-2105.

Titre du document
Mixed logistic regression in genome-wide association studies
Année de publication
2020
Type de document
Article référencé dans le Web of Science WOS:000595711300001
Auteurs
Milet Jacqueline, Courtin David, Garcia André, Perdry H.
Source
BMC Bioinformatics, 2020, 21 (1), p. 536 [17 p.] ISSN 1471-2105
BackgroundMixed linear models (MLM) have been widely used to account for population structure in case-control genome-wide association studies, the status being analyzed as a quantitative phenotype. Chen et al. proved in 2016 that this method is inappropriate in some situations and proposed GMMAT, a score test for the mixed logistic regression (MLR). However, this test does not produces an estimation of the variants' effects. We propose two computationally efficient methods to estimate the variants' effects. Their properties and those of other methods (MLM, logistic regression) are evaluated using both simulated and real genomic data from a recent GWAS in two geographically close population in West Africa.ResultsWe show that, when the disease prevalence differs between population strata, MLM is inappropriate to analyze binary traits. MLR performs the best in all circumstances. The variants' effects are well evaluated by our methods, with a moderate bias when the effect sizes are large. Additionally, we propose a stratified QQ-plot, enhancing the diagnosis of p values inflation or deflation when population strata are not clearly identified in the sample.ConclusionThe two proposed methods are implemented in the R package milorGWAS available on the CRAN. Both methods scale up to at least 10,000 individuals. The same computational strategies could be applied to other models (e.g. mixed Cox model for survival analysis).
Plan de classement
Sciences fondamentales / Techniques d'analyse et de recherche [020]
Description Géographique
AFRIQUE DE L'OUEST
Localisation
Fonds IRD [F B010080443]
Identifiant IRD
fdi:010080443
Contact