multiple variance structure in lmer giving zero variances

4 messages · Sara Krause, Ben Bolker

Sat, Jun 2, 2012 8:54 PM #

An embedded and charset-unspecified text was scrubbed...
Name: not available
URL: <https://stat.ethz.ch/pipermail/r-sig-mixed-models/attachments/20120602/9960983a/attachment.pl>

Ben Bolker

Sun, Jun 3, 2012 12:15 PM #

Sara Krause <skkrause at ...> writes:

(I can understand your desperation, but please don't cross-post ...
I almost answered this on r-help with a suggestion that you try
on r-sig-mixed instead.)

[snipped from below]:

In general, categorical predictors should not appear in both
the fixed and random parts of the model -- that constitutes overfitting.

Depending on the structure of your data you should also strongly
consider including interactions between treatment and your random
factors (see Schielzeth, Holger, and Wolfgang
Forstmeier. 2009. ?Conclusions Beyond Support: Overconfident Estimates
in Mixed Models.? Behavioral Ecology 20 (2):416?420.
doi:10.1093/beheco/arn145.
http://www.ncbi.nlm.nih.gov/pubmed/19461866.).  However, you can only
include treatment interactions that make sense. It sounds like each
individual is allocated to a single treatment, so (guessing that your
statistician means that you want the *interactions* of ID with state
and season) I think you should probably use

y~trt*season*state*site+(state|ID)+(0+season|ID)

which includes a random effect of state, a state:ID interaction
(variation of the state effect among individuals), and a season:ID
interaction (variation of seasonal effect among individuals).

  Why are you moving to lmer? is it so that you can handle the
separate effects of state:ID and season:ID ?

I'm mostly going to postpone comment on your second question until
your first question is sorted out, although I would also encourage you
to see if your heteroscedasticity problem might be handled by transformation --
most typically, if treatment increases both the mean and variance a lot,
then log transforming the data might take care of the problem with less
hassle ...

Your basic problem here is that it doesn't make sense to test the interaction
of treatment with ID (as you have effectively done here), because each
individual is only in one treatment category ...

Sara Krause

Sun, Jun 3, 2012 4:49 PM #

An embedded and charset-unspecified text was scrubbed...
Name: not available
URL: <https://stat.ethz.ch/pipermail/r-sig-mixed-models/attachments/20120603/8b5df5ac/attachment.pl>

Ben Bolker

Mon, Jun 4, 2012 2:02 AM #

Sara Krause <skkrause at ...> writes:

Not a big deal, now you know.

I don't think groupedData is absolutely necessary.  I'll leave the
pdBlocked question for someone else ...

Yes, but in this case you just can't -- because each individual only
gets a single treatment.  So the interaction might be there, but you
can't estimate it from your data (which gets you off the hook).

Anyone else want to chime in on this ... ?

I think you're OK: (state|ID) and (season|ID) are putting *interactions*
in the model.  What would definitely break would be (1|state) or (1|season)
(i.e. using fixed effects as *grouping variables* for the random effects).
I think the model is fine as it is.

Do you mean "not" above (and "he" is your statistician)?  I can
imagine him saying "not" because they are unidentifiable based on your
design -- so it's not that they're not important, just that you can't
do anything about it.

[snip]

  I claim you're already including these interactions (as
interactions of *fixed* effects).  
   
 [snip]

Probably still somewhat overfitted.  Try simplifying the model
a bit and see if you can get somewhere (e.g. take out the four-way
interaction and use (trt+season+state+site)^3 instead?