Statistical Methods

Poisson mixture distribution analysis for North Carolina SIDS counts using information criteria


Abstract


Mixture distribution analysis provides us with a tool for identifying unlabeled clusters that naturally arise in a data set.  In this paper, we demonstrate how to use the information criteria AIC and BIC to choose the optimal number of clusters for a given set of univariate Poisson data.  We give an empirical comparison between minimum Hellinger distance (MHD) estimation and EM estimation for finding parameters in a mixture of Poisson distributions with artificial data.  In addition, we discuss Bayes error in the context of classification problems with mixture of 2, 3, 4, and 5 Poisson models.  Finally, we provide an example with real data, taken from a study that looked at sudden infant death syndrome (SIDS) count data from 100 North Carolina counties (Symons et al., 1983).  This gives us an opportunity to demonstrate the advantages of the proposed model framework in comparison with the original analysis.





DOI: http://dx.doi.org/10.2427/12550

References



Article Metrics

Metrics Loading ...

Metrics powered by PLOS ALM

Refbacks

  • There are currently no refbacks.


Copyright (c) 2013 Epidemiology, Biostatistics and Public Health

Creative Commons License
This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License.

EBPH Epidemiology, Biostatistics and Public Health | ISSN 2282-0930

Creative Commons License
This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License.