文档详情

A Bayesian test for excess zeros in a zero-inflated power series distribution.pdf

发布:2017-04-11约4.27万字共16页下载文档
文本预览下载声明
a r X i v : 0 8 0 5 .2 2 5 8 v 1 [ m a t h .S T ] 1 5 M a y 2 0 0 8 IMS Collections Beyond Parametrics in Interdisciplinary Research: Festschrift in Honor of Professor Pranab K. Sen Vol. 1 (2008) 89–104 c? Institute of Mathematical Statistics, 2008 DOI: 10.1214A Bayesian test for excess zeros in a zero-inflated power series distribution? Archan Bhattacharya1, Bertrand S. Clarke2 and Gauri S. Datta1 University of Georgia, University of British Columbia and University of Georgia Abstract: Power series distributions form a useful subclass of one-parameter discrete exponential families suitable for modeling count data. A zero-inflated power series distribution is a mixture of a power series distribution and a degenerate distribution at zero, with a mixing probability p for the degenerate distribution. This distribution is useful for modeling count data that may have extra zeros. One question is whether the mixture model can be reduced to the power series portion, corresponding to p = 0, or whether there are so many zeros in the data that zero inflation relative to the pure power series distribution must be included in the model i.e., p ≥ 0. The problem is difficult partially because p = 0 is a boundary point. Here, we present a Bayesian test for this problem based on recognizing that the parameter space can be expanded to allow p to be negative. Negative values of p are inconsistent with the interpretation of p as a mixing probabil- ity, however, they index distributions that are physically and probabilistically meaningful. We compare our Bayesian solution to two standard frequentist testing procedures and find that using a posterior probability as a test statis- tic has slightly higher power on the most important ranges of the sample size n and parameter values than the score test and likelihood ratio test in simu- lations. Our method also performs well on three real data sets. 1. Zero-inflated families Models for count data often fail to fi
显示全部
相似文档