Estimation of demo-genetic model probabilities with Approximate Bayesian Computation using linear discriminant analysis on summary statistics.
Cornuet, Jean-Marie; Robert, Christian P.; Pudlo, Pierre; Guillemaud, Thomas; Marin, Jean-Michel; Lombaert, Eric; Estoup, Arnaud (2012), Estimation of demo-genetic model probabilities with Approximate Bayesian Computation using linear discriminant analysis on summary statistics., Molecular Ecology Resources, 12, 5, p. 846-855. http://dx.doi.org/10.1111/j.1755-0998.2012.03153.x
TypeArticle accepté pour publication ou publié
Journal nameMolecular Ecology Resources
MetadataShow full item record
Robert, Christian P.
Abstract (EN)Comparison of demo-genetic models using Approximate Bayesian Computation (ABC) is an active research field. Although large numbers of populations and models (i.e. scenarios) can be analysed with ABC using molecular data obtained from various marker types, methodological and computational issues arise when these numbers become too large. Moreover, Robert et al. (Proceedings of the National Academy of Sciences of the United States of America, 2011, 108, 15112) have shown that the conclusions drawn on ABC model comparison cannot be trusted per se and required additional simulation analyses. Monte Carlo inferential techniques to empirically evaluate confidence in scenario choice are very time-consuming, however, when the numbers of summary statistics (Ss) and scenarios are large. We here describe a methodological innovation to process efficient ABC scenario probability computation using linear discriminant analysis (LDA) on Ss before computing logistic regression. We used simulated pseudo-observed data sets (pods) to assess the main features of the method (precision and computation time) in comparison with traditional probability estimation using raw (i.e. not LDA transformed) Ss. We also illustrate the method on real microsatellite data sets produced to make inferences about the invasion routes of the coccinelid Harmonia axyridis. We found that scenario probabilities computed from LDA-transformed and raw Ss were strongly correlated. Type I and II errors were similar for both methods. The faster probability computation that we observed (speed gain around a factor of 100 for LDA-transformed Ss) substantially increases the ability of ABC practitioners to analyse large numbers of pods and hence provides a manageable way to empirically evaluate the power available to discriminate among a large set of complex scenarios.
Subjects / KeywordsGenetic; Models; Population; Genetics; Genetic Markers; Computational Biology; Biostatistics; Beetles; Animals
Showing items related by title and author.
Some discussions of D. Fearnhead and D. Prangle's Read Paper "Constructing summary statistics for approximate Bayesian computation: semi-automatic approximate Bayesian computation" Singh, Sumeetpal S.; Sedki, Mohammed; Jasra, Ajay; Pudlo, Pierre; Robert, Christian P.; Lee, Anthony; Marin, Jean-Michel; Kosmidis, Ioannis; Girolami, Mark; Andrieu, Christophe; Cornebise, Julien; Doucet, Arnaud; Barthelme, Simon; Chopin, Nicolas (2012) Article accepté pour publication ou publié
Infering population history with DIY ABC : a user-friendly approach to Approximate Bayesian Computation Estoup, Arnaud; Marin, Jean-Michel; Robert, Christian P.; Beaumont, Mark A.; Santos, Filipe; Guillemaud, Thomas; Balding, David; Cornuet, Jean-Marie (2008-04) Article accepté pour publication ou publié
Pudlo, Pierre; Marin, Jean-Michel; Estoup, Arnaud; Cornuet, Jean-Marie; Gautier, Mathieu; Robert, Christian P. (2016) Article accepté pour publication ou publié
Robert, Christian P.; Cornuet, Jean-Marie; Marin, Jean-Michel; Pillai, Natesh S. (2011) Article accepté pour publication ou publié