Skip to content

Correcting for overdispersion

4 messages · Lawrence, Adaku, Peter Dalgaard

#
On Jul 9, 2012, at 20:23 , Lawrence, Adaku wrote:

            
Er, in what sense is that a problem? Your code is not reproducible, at least some output to look at might help.

-pd

  
    
#
On Jul 9, 2012, at 21:08 , Lawrence, Adaku wrote:

            
Not without qualification. There are various schools, but if you ask me, I think that overdispersion models are used a bit too often without proper attention to what they actually mean. Sometimes the effect is (unwittingly) to paper over systematic lack of fit in the model (judging by your residuals, that's not likely the case here, though). 

To use such models you should have evidence of lack of fit and/or a plausible reason for the extra variation. 

Re. evidence, you have a deviance of 7.31 on 4 df which corresponds to a p value of 0.12 in the asymptotic chi-square distribution. So, not exactly convincing; also, you need to consider whether the expected counts are large enough for the asymptotics to hold.

Re. plausibility, you should ask yourself whether there is good reason to have have an extra random effect operating at the level of individual binomial distributions. This could be the case if you have an experiment of the sort where you give, say, a doses of pesticide to containers of 50 flies, and count the dead ones. In that case, there could be effects of getting the dose slightly wrong, the temperature of the container, and whatnot. If on the other hand, you inject a batch of rats with a dose from a randomly chosen vial, each of which contain a carefully and individually measured-out dose, then it could be quite hard to think of a reason for something increasing or decreasing the probability for all rats at the same dose.

That being said, as far as I can tell, there's no problem in principle with using dose.p on an overdispersed model, because it only depends on vcov(obj). An overdispersion parameter based on 4 df is the most worrying bit.

-pd