• français
    • English
  • English 
    • français
    • English
  • Login
JavaScript is disabled for your browser. Some features of this site may not work without it.
BIRD Home

Browse

This CollectionBy Issue DateAuthorsTitlesSubjectsJournals BIRDResearch centres & CollectionsBy Issue DateAuthorsTitlesSubjectsJournals

My Account

Login

Statistics

View Usage Statistics

Overfitting Bayesian Mixture Models with an Unknown Number of Components.

Thumbnail
Date
2015
Publisher city
Paris
Link to item file
http://dx.doi.org/10.1371/journal.pone.0131739
Dewey
Probabilités et mathématiques appliquées
Sujet
mixture models; Markov Chain Monte Carlo; Bayesian Analysis
JEL code
C11
Journal issue
PLoS ONE
Volume
10
Number
7
Publication date
2015
Article pages
e0131739
Publisher
Public Library of Science
DOI
http://dx.doi.org/10.1371/journal.pone.0131739
URI
https://basepub.dauphine.fr/handle/123456789/14698
Collections
  • CEREMADE : Publications
Metadata
Show full item record
Author
van Havre, Zoé
White, Nicole M.
Rousseau, Judith
Mengersen, Kerrie
Type
Article accepté pour publication ou publié
Abstract (EN)
This paper proposes solutions to three issues pertaining to the estimation of finitemixture models with an unknown number of components: the non-identifiabilityinduced by overfitting the number of components, the mixing limitations of standardMarkov Chain Monte Carlo (MCMC) sampling techniques, and the related labelswitching problem.An overfitting approach is used to estimate the number of components in a finitemixture model via a Zmix algorithm. Zmix provides a bridge betweenmultidimensional samplers and test based estimation methods, whereby priors arechosen to encourage extra groups to have weights approaching zero. MCMC samplingis made possible by the implementation of prior parallel tempering, an extension ofparallel tempering. Zmix can accurately estimate the number of components, posteriorparameter estimates and allocation probabilities given a sufficiently large sample size.The results will reflect uncertainty in the final model and will reportthe range ofpossible candidate models and their respective estimated probabilities from a singlerun. Label switching is resolved with a computationally light-weight method, Zswitch,developed for overfitted mixtures by exploiting the intuitiveness ofallocation-basedrelabelling algorithms and the precision of label-invariant loss functions.Four simulation studies are included to illustrate Zmix and Zswitch, as well asthree case studies from the literature. All methods are available aspart of the RpackageZmix(github.com/zoevanhavre/Zmix), which can currently be applied tounivariate Gaussian mixture models

  • Accueil Bibliothèque
  • Site de l'Université Paris-Dauphine
  • Contact
SCD Paris Dauphine - Place du Maréchal de Lattre de Tassigny 75775 Paris Cedex 16

 Content on this site is licensed under a Creative Commons 2.0 France (CC BY-NC-ND 2.0) license.