Skip to content

degrees of freedom in mixed model

10 messages · Baldwin, Jim -FS, Jake Westfall, Ben Bolker +4 more

#
.......
It's the wrong approach.

You are using lmer, which uses maximum likelihood estimation, not classical sums of squares. The degrees of freedom don't mean the same thing, and the distribution of REML estimates of variance isn't necessarily chi-squared. So F is interpretable in the same way as it would be in classical anova.

If you want p-values from an lmer model, you could get hold of the lmerTest package. Other recommended approaches include variants on MCMC. There is a great deal of controversy on this point, though; try Googling "p-values from lmer" with particular attention to anything by Douglas Bates (the package author). You _should_ find enough to make you worry that the method used by lmerTest (which as I understand it implements a method used by SAS) comes with quite strong theoretical objections. I am quite sure the lmerTest authors know that perfectly well and offer lmerTest as a package for those who want to find out or for those whose management insist on a SAS-compatible answer. But if I read correctly, that doesn't make it the right thing to do

[Caveat - I'm a chemist. I could be wrong about this]
*******************************************************************
This email and any attachments are confidential. Any use...{{dropped:8}}
#
S Ellison:  Despite you being a chemist, I think you're at least mostly correct.  But from the construction of my statement, it's obvious that I am a statistician and I'm allowed, by law, to be wrong 5% of the time.  And if I claim to be a Frequentist, I don't even have to identify which of my particular statements are incorrect.

Jim

Jim Baldwin
Station Statistician
Pacific Southwest Research Station
USDA Forest Service

-----Original Message-----
From: r-sig-mixed-models-bounces at r-project.org [mailto:r-sig-mixed-models-bounces at r-project.org] On Behalf Of S Ellison
Sent: Friday, January 24, 2014 5:40 AM
To: Iain Gallagher; r-sig-mixed-models at r-project.org
Subject: Re: [R-sig-ME] degrees of freedom in mixed model
.......
It's the wrong approach.

You are using lmer, which uses maximum likelihood estimation, not classical sums of squares. The degrees of freedom don't mean the same thing, and the distribution of REML estimates of variance isn't necessarily chi-squared. So F is interpretable in the same way as it would be in classical anova.

If you want p-values from an lmer model, you could get hold of the lmerTest package. Other recommended approaches include variants on MCMC. There is a great deal of controversy on this point, though; try Googling "p-values from lmer" with particular attention to anything by Douglas Bates (the package author). You _should_ find enough to make you worry that the method used by lmerTest (which as I understand it implements a method used by SAS) comes with quite strong theoretical objections. I am quite sure the lmerTest authors know that perfectly well and offer lmerTest as a package for those who want to find out or for those whose management insist on a SAS-compatible answer. But if I read correctly, that doesn't make it the right thing to do

[Caveat - I'm a chemist. I could be wrong about this]
*******************************************************************
This email and any attachments are confidential. Any use...{{dropped:17}}
#
On 14-01-24 01:07 PM, Jake Westfall wrote:
I more or less agree.  The issue with F distributions, degrees of
freedom, etc etc., is mostly a problem with complex designs that don't
fit into the classical method-of-moments/ANOVA paradigm (R-side effects
[which lme4 doesn't do yet], crossed and partially crossed random
effects, etc.).  In simple cases (as in the example here), the results
of (restricted) ML analyses should more or less line up with the
classical results.  In addition to lmerTest, as pointed out by S?ren
Hojsgaard in the original thread on r-help, the Kenward-Roger
approximation is available in the PBKRtest package ...

  If you asked me about 'denominator df' calculations for GLMMs I would
be considerably more pessimistic ...

Schaalje, G., J. McBride, and G. Fellingham. 2002. ?Adequacy of
Approximations to Distributions of Test Statistics in Complex Mixed
Linear Models.? Journal of Agricultural, Biological & Environmental
Statistics 7 (14): 512?24.
http://www.ingentaconnect.com/content/asa/jabes/2002/00000007/00000004/art00004.
This email and any attachments are confidential. Any u...{{dropped:9}}
#
A small addition to the discussion:  I was recently reading Stroup's "Generalized Linear Mixed Models", which discusses this degrees of freedom issue for LMM's a bit.  For models outside the more "classical" paradigm, it seems that the Kenward-Roger correction controls the type I error rate but the Satterthwaite correction does not (although I did not go on to read the original papers on the subject). 

Ariel

-----Original Message-----
From: r-sig-mixed-models-bounces at r-project.org [mailto:r-sig-mixed-models-bounces at r-project.org] On Behalf Of Ben Bolker
Sent: Friday, January 24, 2014 10:20 AM
To: r-sig-mixed-models at r-project.org
Subject: Re: [R-sig-ME] degrees of freedom in mixed model
On 14-01-24 01:07 PM, Jake Westfall wrote:
I more or less agree.  The issue with F distributions, degrees of freedom, etc etc., is mostly a problem with complex designs that don't fit into the classical method-of-moments/ANOVA paradigm (R-side effects [which lme4 doesn't do yet], crossed and partially crossed random effects, etc.).  In simple cases (as in the example here), the results of (restricted) ML analyses should more or less line up with the classical results.  In addition to lmerTest, as pointed out by S?ren Hojsgaard in the original thread on r-help, the Kenward-Roger approximation is available in the PBKRtest package ...

  If you asked me about 'denominator df' calculations for GLMMs I would be considerably more pessimistic ...

Schaalje, G., J. McBride, and G. Fellingham. 2002. "Adequacy of Approximations to Distributions of Test Statistics in Complex Mixed Linear Models." Journal of Agricultural, Biological & Environmental Statistics 7 (14): 512-24.
http://www.ingentaconnect.com/content/asa/jabes/2002/00000007/00000004/art00004.
This email and any attachments are confidential. Any u.....{{dropped:11}}
#
This post:  https://stat.ethz.ch/pipermail/r-sig-mixed-models/2009q1/001819.html
from a few years ago simulates data (based on a real dataset) and
computes a p-value, then looks at the distribution of the p-value.  In
some cases the p-value was distributed uniformly (when the null was
true) and so the approximation was reasonble, in another case the
p-values were very non-uniform (when the null was true), so for that
case the approximation failed.  It also shows that instead of worrying
about the exact distribution, you can just use the distribution
simulated from the null.

Similar simulations could be done to compare the above suggestions for
the structure of the data and questions of interest.
On Fri, Jan 24, 2014 at 11:19 AM, Ben Bolker <bbolker at gmail.com> wrote:

  
    
3 days later
#
Just for the record, I just noticed that there was a rather crucial 'not' missing from my initial post, just before 'interpretable' in
But others have covered it more than adequately, and it's good to be reminded that using K-R is indeed not often a serious concern in LMMs despite the sometimes heated debate.

Me, I just wish I could find a good, simple answer for fixed effects confidence intervals with crossed random effects in small experiments ...

Steve E




*******************************************************************
This email and any attachments are confidential. Any use...{{dropped:8}}