Skip to content

Wald F tests

3 messages · John Maindonald, Ben Bolker

#
Does anyone out there have opinions on this subject?
How should one test hypotheses about fixed effects in
(G)LMMs, especially for small to moderate sample sizes?
(Please ignore issues of _estimation_ (PQL
vs Laplace vs AGQ vs ...)

  Should it amuse you to do so, you can vote at:

http://www.surveymonkey.com/s.aspx?sm=yLyfrV_2ftw6WGx2dEFLWnIw_3d_3d

(since we all know that scientific questions are settled
by a democratic process)

a hypothesis testing is soooo 20th century, don't bother
b likelihood ratio tests [ignore known anticonservatism]
c F tests (LMM) or Wald tests (GLMM) [ignore mismatch with hypothesized
null distributions]
d bootstrapped confidence intervals
e [mcmcsamp confidence intervals -- if available]
f randomization/simulation tests of nested null hypotheses
g AIC comparisons [ignore that prediction != hypothesis testing]

  Note that Wald Z tests [option c] are more or less what you're
doing, implicitly, if you just eyeball the estimated
parameter values and their standard errors.

  cheers
    Ben Bolker

-------- Original Message --------
Subject: [Fwd: Re: [R-sig-ME] Wald F tests]
Date: Tue, 07 Oct 2008 17:51:01 -0400
From: Ben Bolker <bolker at ufl.edu>
To: R Mixed Models <r-sig-mixed-models at r-project.org>


  But ... LRTs are non-recommended (anticonservative) for
comparing fixed effects of LMMs hence (presumably) for
GLMMs, unless sample size (# blocks/"residual" total sample
size) is large, no?

I just got through telling readers of
a forthcoming TREE (Trends in Ecology and Evolution) article
that they should use Wald Z, chi^2, t, or F (depending on
whether testing a single or multiple parameters, and whether
there is overdispersion or not), in preference to LRTs,
for testing fixed effects ... ?  Or do you consider LRT
better than Wald in this case (in which case as far as
we know _nothing_ works very well for GLMMs, and I might
just start to cry ...)  Or perhaps I have to get busy
running some simulations ...

  Where would _you_ go to find advice on inference
(as opposed to estimation) on estimated GLMM parameters?

  cheers
   Ben Bolker
Douglas Bates wrote:
#
I think it is going too far to say that one should not be
testing hypotheses (the implication of that "is soooo 20th C"?).
But the place of that activity is much more limited than is
commonly recognized.

Basically, I do not like the range of options that this (half-serious?)
survey has on offer, and I'd need to write half a page or more
to explain why.  Democracy maybe, but (as I suppose is
always the case in the political democracies that are on offer)
the choices are severely constrained.

Where such a hypothesis testing perspective may be
appropriate, the preferred starting point is almost always
a confidence interval.  Why not ask the comparable questions
arise for estimation?

There's an editorial in Volume 72(5) (pp.1057-1058) of the
Journal of Wildlife Management with which I pretty much agree:
"... understand that the average reader of the Journal is
interested in the biological questions addressed with your
work.  The analytical framework and resulting results should
support those questions and flow from them, not overwhelm
them."

But I guess that Ben would like us to assume that the proper
support framework is in place!

Note also, on pages 1272-1278 of the same issue:
"Suggestions for Basic Graph Use When Reporting Wildlife
Research Results", by Brett Collier.

John Maindonald             email: john.maindonald at anu.edu.au
phone : +61 2 (6125)3473    fax  : +61 2(6125)5549
Centre for Mathematics & Its Applications, Room 1194,
John Dedman Mathematical Sciences Building (Building 27)
Australian National University, Canberra ACT 0200.
On 11/10/2008, at 6:47 AM, Ben Bolker wrote:

            
#
Yes, this is partly tongue in cheek, and I agree that
hypothesis testing is overemphasized (I suspect that many
of the r-sig-mixed-models regulars would also agree).  Let's
say we want to construct confidence intervals rather than
test null hypotheses.  Then our choices are something like

* construct Z- or t-based confidence intervals from
estimated standard error
* bootstrap confidence intervals
* mcmcsamp confidence intervals

  which correspond to c,d,e below.  I suppose another
choice (corresponding more or less to b, LRT)
would be likelihood profile
confidence intervals, but I would really worry in
this case that the known anticonservatism of LRTs
would translate to profile confidence intervals
with poor coverage.

Most of the difficulties that arise in null-hypothesis testing have
analogues in constructing appropriate confidence intervals.

  cheers
    Ben Bolker

        
John Maindonald wrote: