Skip to content
Prev 38475 / 398500 Next

Under dispersion; Was: [R] binomial glm warnings revisited

Tord Snall <tord.snall at ebc.uu.se> writes:
...
Don't trust deviances as measures of dispersion with binary data!
...
No. Neither nor.
 
With binary data, the deviance is purely a function of the fitted
parameters. It is the difference in -2 log L between a "perfect fit"
and the observed fit. A perfect fit has a zero prob. where the obs is
"0" and probability 1 where it is "1", and L == 1 identically in that
case. Now consider the likelihood for the "complete toss-up" i.e.
intercept and slope both equal to 0 so all probabilities are 0.5. The
likelihood in that case is 0.5^269, i.e. a constant. Take logarithms
and notice that the model deviance plus the change in deviance from
the model to the "toss-up" model is constant (2*269*log(2) to be
precise). So what appears to be a measure of residual error is
really just a measure of how far the fitted probabilities are from
0.5!