lmer stand dev of coefficients

Sun, Dec 21, 2008 12:12 PM

Hi all,

This article might help:

The BLUPs are not "best" when it comes to bootstrapping

Jeffrey S. Morris 

Statistics & Probability Letters 56 (2002) 425-430

In the setting of mixed models, some researchers may construct a
semiparametric bootstrap by sampling from the best linear unbiased
predictor residuals.  This paper demonstrates both mathematically and
by simulation that such a bootstrap will consistently underestimate
the variation in the data in finite samples.

Cheers,

Andrew

On Sun, Dec 21, 2008 at 10:59:01AM -0600, Douglas Bates wrote:

On Sun, Dec 21, 2008 at 9:40 AM, Daniel Ezra Johnson
<danielezrajohnson at gmail.com> wrote:

---------- Forwarded message ----------
From: Daniel Ezra Johnson <danielezrajohnson at gmail.com>
Date: Sun, Dec 21, 2008 at 3:39 PM
Subject: Re: [R-sig-ME] lmer stand dev of coefficients
To: Douglas Bates <bates at stat.wisc.edu>

Can you explain briefly what circumstances would lead these quantities
to be quite different?

First, I misspoke. (Note to self: Don't try to answer questions on
theory before the second cup of coffee.)  The standard deviation of
the BLUPs (or, as I prefer to call them, the conditional modes) of the
random effects are not an estimate of the conditional standard
deviation of the random effects given the data.  I can only make sense
of the conditional standard deviation of a particular random effect
and that would be much smaller than the observed standard deviation of
the conditional modes.

What I should have said is somewhat more subtle.  We know that the
conditional modes of the random effects have less variability than the
corresponding individual estimates of a parameter.  I enclose a script
and its output for a particularly simple example - a random-effects
model fit to the Dyestuff data from the lme4 package.  The design is a
balanced, one-way classification so the estimate of the mean yield is
simply the mean of the Yield variable.

We see that the conditional modes are always smaller in magnitude than
the deviations of the individual means from the overall mean.  The
fact that the ratio is constant is a consequence of the balanced
design.   We say that the conditional modes are shrunk towards zero
because the random effects have a finite variance.

The conditional modes are also shrunk relative to what would be
expected from the unconditional variance of the random effects, but I
find it more difficult to explain why.  It makes sense to me that the
mle of the unconditional standard deviation would be larger than the
standard deviation of the conditional modes but of the way the way the
likelihood criterion is formulated.

Perhaps someone else can explain why.

Suppose the random effect grouping factor is Subject.

On what basis would the software estimate the unconditional SD of (the
population of) Subjects to be something quite different (and as you
say, usually larger) than that of the particular group of Subjects in
the data?

Dan

On Sun, Dec 21, 2008 at 3:32 PM, Douglas Bates <bates at stat.wisc.edu> wrote:

On Sun, Dec 21, 2008 at 3:55 AM, Iasonas Lamprianou
<lamprianou at yahoo.com> wrote:

Dear friends
when I use sd(coef(mymodel)$myvariable) I get 0.21
However, the summary of the model gives
Error terms:
 Groups      Name        Std.Dev.
 myvariable (Intercept) 0.33
 Residual               0.76

Why dont I get the same value (0.21 instead of 0.33)?

Because they are estimates of different quantities:
sd(coef(mymodel)$myvariable) is an estimate (although it is not
entirely clear what the properties of such an estimate would be) of
the conditional standard deviation of the random effects given the
data, whereas 0.33 is the maximum likelihood estimate or REML estimate
of the unconditional standard deviation of the random effects.  We
would expect the conditional standard deviation to be smaller than the
unconditional standard deviation.

P.S. If you are starting a new topic on the mailing list you don't
need to quote a previous message to the list and especially not an
entire digest message.

_______________________________________________
R-sig-mixed-models at r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-sig-mixed-models

_______________________________________________
R-sig-mixed-models at r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-sig-mixed-models

_______________________________________________
R-sig-mixed-models at r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-sig-mixed-models

Andrew Robinson  
Department of Mathematics and Statistics            Tel: +61-3-8344-6410
University of Melbourne, VIC 3010 Australia         Fax: +61-3-8344-4599
http://www.ms.unimelb.edu.au/~andrewpr
http://blogs.mbs.edu/fishing-in-the-bay/

lmer stand dev of coefficients

Thread (5 messages)