Skip to content
Prev 7314 / 20628 Next

specifying/interpreting random effects with near-zero variance in glmer()

Margaret Metz <mrmetz at ...> writes:

[snip]
The topo variables are continuous, right?

  You probably don't need to -- this is one of the strengths of 
the mixed modeling approach.
This looks reasonable, you might want to check for overdispersion.
(topo1 | sp) is equivalent to (1 | topo1 | sp) (as
(0 + topo1 | sp) is equivalent to (topo1 - 1 | sp)

  If you have enough data you could try

(topo1 + topo2 + topo3 | sp ) 

which allows for correlation among the effects of the topographic
variables -- although you can run out of data pretty quickly in
some cases, and it sounds from stuff below as though you're running
low on signal anyway.  (This model has (n+1)*(n+2)/2 = 10 parameters --
4 variances (topo[1-3] plus intercept) and 6 covariances -- as opposed
to the 4 variances of the model you are using.) (I'm not counting
the station variable in these totals.)
This is a reasonable interpretation.  However, be aware that this
is signal-to-noise / sample-size dependent.  There could be (is, by
definition, in an ecological system) some among-species and
among-station variance that you just can't detect with this data set.
(In a classical model with a balanced, nested, etc. design you would
probably just find a small (non-significant) variance in this case,
rather than a practically-zero one -- on the other hand, there are
other classical models where you would actually estimate a *negative*
variance.)
I don't have a reference but I would suggest that checking for
spatial autocorrelation might be worthwhile. Spatial autocorrelation
would detect the effects of _unmeasured_ covariates that were more
similar among nearby stations.
Exactly (variance among species in responses to topo1, topo2, topo3)

Or would I need to include
(topo1 | sp) is effectively crossing topo with species.

  I would consider looking (at least graphically) for evidence
of nonlinearity in the responses to the continuous variables ...
you could fit a GAM without *too* much extra effort, and with
this size dataset it might produce interesting results.