Distribution of deviance residuals - R-SIG-mixed-models

Roelof Coster · 2014-09-22T21:43:53Z

Hello, What is the theoretical distribution of the residual deviances of a well-fitting logistic regression mixed model? The background of my question is as follows: I am looking for a way to combine the ideas from regression tree modelling (aka model trees) and mixed models. I have a data set to which I want to fit a logistic regression model. My data come in groups, so I need a random effect to account for those groups. The mob function in the party package does what I need, but only for fi

Ben Bolker

Wed, Sep 24, 2014 12:50 PM #

Roelof Coster <roelofcoster at ...> writes:

Do you mean the deviance residuals (i.e. the per-observation contributions
to the deviance, or the signed square root of the contribution) or the total
residual deviance (i.e., the sum of squared deviance residuals)?
   If the former, I think you're probably in trouble -- empirically,
the distribution of residuals is usually pretty ugly, and theoretically
I can't think of a reason it should be nice.  If the latter, then
you can presumably just rely on asymptotic theory which would say
(I'm being sloppy here of course) that the sum of squares of lots of
iid things should be chi-square distributed (and then eventually
Normal).
  For what it's worth, a great deal of the theory of GLMMs is inherited
from GLM theory, so if you can solve your problem or find a solution
for a plain old *non*-mixed logistic regression, it is likely to work
reasonably well for a mixed logistic regression as well (provided you
have a reasonable number of levels of the random effect/your estimate
isn't singular).  (Conversely if it's known to be nasty for ordinary
logistic regression you're probably screwed.)

  As always I'm happy to be corrected by more sensible/knowledgeable
people.