model selection in lme4

Took a (very) quick look at Raftery, which all seems sensible
and well-argued.  However ... the paper contrasts Bayes/BIC
with classical hypothesis testing.  Many of the points listed on p. 155
(better assessment of evidence, applicability to non-nested models, take
model uncertainty into account, allow model averaging, easy to
implement) apply to AIC as well as BIC.  BIC does have many good
qualities (approximation to Bayes factor, sensible "flat prior"
interpretation, statistical consistency, ...).  But the crux of the
argument between BIC and AIC is the difference in their objective. BIC
aims to identify the "true model", which essentially assumes that there
is a sharp cutoff between parameters/processes that are in the model and
those that are out. Burnham and Anderson have a lot to say about
tapering effect sizes; they are zealots about AIC, and I often discount
their enthusiasm, but after much percolation I've decided that AIC
really does make sense for the kinds of questions I (and many
ecologists) tend to ask.

   When you say that AIC selects an overly complex model, how
do you know what the correct model is and which parameters are
unnecessary?  Is this a case of fitting to simulation output?
In that case I might bring up B&A's "tapering effects" argument
again -- selecting the correct model with a fixed number of parameters
with non-tapering effects is what BIC is for, not what AIC is for.

  I have tried to say this more coherently at
http://emdbolker.wikidot.com/blog:aic-vs-bic

  As an aside, I don't have a vested interest in this, and I don't
claim that AIC is better for everything ... just that it seems
most ecologists are working with "true models" that are of
arbitrarily large dimension with tapering effects, which is where
AIC should select the model with the best predictive capability ...

  Ben Bolker

model selection in lme4

Thread (7 messages)