Contrasts for interactions in lmer

First, to clear things up, RRT is not reciprocal reading time, but re-reading time (time spent on a region after it was fixated and left).
I saw that now you use log-transformed DVs and that in my
experience is a good choice for durations collected in eye tracking.
Nevertheless you should check the distribution of model residuals to
back up this decision. Anyway, the log-transformation of RRT may have
lifted the t-value for the PCU  X COND interaction. So I am curious
whether it did or not?
I am very unsecure about whether or not i should log-transform. Because most of the duration variables include zeroes and because this is informative, as well, I don't want to exclude them. To still be able to log-transform, I would have to add a constant value and I am uncertain, if that is a valid operation. Plotting fitted values against residuals doesn't look good both when I use raw data and when I log-transform. The distribution of the model residuals look at least ok when zeroes are excluded (and even better with log(RRT)), but that would no longer be the DV that I am interested in. See these plots for illustration: http://amor.rz.hu-berlin.de/~metznerp/Rplot.pdf
However, the log-transformation did indeed lift the t-value for PCU x COND, but only marginally (from 1.492 to 1.581). Generally, the log-transform seems to have little impact on the main effects and interactions (two or three effects disappear without log-transformation).

Maybe I responded to quickly ...

First, I guess for a two-level factor a sum contrast can also be
called a Helmert contrast; it is a bit unusual, I think.

Second, the story about the fixed-effect correlations is complicated.
What I wrote for balanced designs returning a zero correlation matrix
of fixed effects assumes also that contrasts for the fixed effects are
orthogonal and that the variance components are specified only for the
intercepts, as you had set up your first model. If you specify a
non-orthogonal set of treatment contrasts, the fixed-effects
correlations will  be 0.5. Thus, these correlations inform about the
correlations of the predictors in the model matrix.
       Moreover, the story changes again  if you estimate values for
parameters representing (co-)-variance components for random effects
in a balanced design, the fixed-effect correlations return values that
(sometimes?) are close to within-subject correlations (i.e.,
correlations unadjusted for shrinkage); maybe for balanced designs
with orthogonal predictors there is actually a specification under
which they are actually identical with them.  This would be cool.

Third, I think the fixed-effect part of the model you give now looks
fine; it is defensible (and sometimes necessary) to exclude
non-significant higher-order interactions.  I still don't think you
need the variance components associated with COND and DIR for subjects
and you may be communicating the wrong thing, but opinions may differ
on this, because non-significance of these components is a thorny
issue.
        In this case, as far as I can see, these correlations are not
really related to the COND x PCU interaction  you are interested in.
Significant effect correlations  can be mapped onto a different kind
of interaction (e.g.,, subjects with a large COND effect may tend to
have a larger DIR effect than subjects with a small COND effect), but
this does not bear on your PCU covariate, at least not directly. (This
could happen independent of a DIR x COND interaction in the
fixed-effect part of the model.)
       I saw that now you use log-transformed DVs and that in my
experience is a good choice for durations collected in eye tracking.
Nevertheless you should check the distribution of model residuals to
back up this decision. Anyway, the log-transformation of RRT may have
lifted the t-value for the PCU  X COND interaction. So I am curious
whether it did or not?

Reinhold Kliegl

On Fri, Aug 13, 2010 at 10:20 AM, Paul Metzner <paul.metzner at gmail.com> wrote:
Thank you for the quick answer!

(1) The Fixed Effects correlations are probably not what you are
after. For example, in a perfectly balanced design, these correlations
will be zero.
They are not, but like you suggested, I wanted them to be at least close to zero. When I changed the model like mentioned before, I noticed an increase in fixed effects correlations and a curious change in contrast coding (see below), that I couldn't explain.
My main interest are the fixed effects interactions. My hypothesis is that subjects with a higher PCU will be affected more strongly by the condition manipulation. Also, in some studies only one kind of verbs (DIR) has been shown to evoke the effect, hence the desired interaction of COND and DIR. But, because I really don't want individual differences over and above what is explained by PCU, I implemented the random effect term like you suggested and re-included the factors contributing to the interactions. My model now looks like this:

lmer(log(RRT)~COND + PCU + COND:PCU + DIR + COND:DIR + (1+COND+DIR|SUBJECT) + (1|ITEM), data=fm3)

Although including the covariance component did not improve model fit, I decided to leave it in the model for the reasons mentioned above. I did, however, exclude the three-way interaction COND:DIR:PCU.

(3) You used a sum contrast specification for the two factors (COND
and DIR). This is fine. For two-level factors there is no point in
specifying Helmert contrasts. So it is unclear what you referring to
in this context.
Being a novice to contrast coding, I thought it was the same. Coincidentally, that seems to be the case for two-level factors. Thanks again for the suggestions!

Paul

On 12 Aug 2010, at 11:24, Reinhold Kliegl wrote:

There is a bit of evidence for an interaction of COND and PCU:
COND1:PCU        48.309     29.850   1.618
If the t-value were larger it would indicate that slopes for the
regression of RRT on PCU differ between the two condition.

There is no statistical support for the the interaction of DIR and PCU
PCU:DIR1        -26.835     29.814  -0.900
Now to some of your questions relating to correlations:
(1) The Fixed Effects correlations are probably not what you are
after. For example, in a perfectly balanced design, these correlations
will be zero.

(2) I suspect what you might be after are effect correlations related
to subjects or items. Assuming cond and verb bias are within-subject
effects, you could get an estimate of the parameter for the covariance
component with the following specification.
RRT ~ COND * PCU * DIR + (1 + COND + DIR  | SUBJECT) + (1 | ITEM)

You should check whether adding these variance components to the model
improves the goodness fo fit, for example with an ANOVA..

(3) You used a sum contrast specification for the two factors (COND
and DIR). This is fine. For two-level factors there is no point in
specifying Helmert contrasts. So it is unclear what you referring to
in this context.

Finally, it is generally a bad idea to specify models with
interactions terms leaving out the factors contributing to the
interactions. If you do so, you need to have very good theoretical
reasons.

Reinhold Kliegl

On Thu, Aug 12, 2010 at 10:44 AM, Paul Metzner <paul.metzner at gmail.com> wrote:
Dear all.

I am currently analyzing eye-tracking data and am interested in a main effect of condition (COND) plus its interaction with subjects' operation span (PCU) and the direction of a verb bias (1 or 2). The contrasts are:

contrasts(COND)
 [,1]
a   -1
b    1
and

contrasts(DIR)
 [,1]
1   -1
2    1
PCU is a continuous predictor which I centered by subtracting the mean (the problem does, however, persist when I split the sample into extreme groups and work with a categorial predictor). With the following model, I don't get a correlation between the fixed effects:

Linear mixed model fit by REML
Formula: RRT ~ COND * PCU * DIR + (1 | SUBJECT) + (1 | ITEM)
   Data: fm3
   AIC   BIC logLik deviance REMLdev
 46733 46801 -23355    46768   46711
Random effects:
 Groups   Name        Variance Std.Dev.
 SUBJECT  (Intercept)  8918.29  94.437
 ITEM     (Intercept)   404.85  20.121
 Residual             34881.69 186.766
Number of obs: 3503, groups: SUBJECT, 59; ITEM, 59

Fixed effects:
               Estimate Std. Error t value
(Intercept)     122.900     12.963   9.481
COND1            15.924      3.165   5.031
PCU             139.411    120.025   1.162
DIR1             -7.746      4.107  -1.886
COND1:PCU        48.309     29.850   1.618
COND1:DIR1       -3.396      3.164  -1.073
PCU:DIR1        -26.835     29.814  -0.900
COND1:PCU:DIR1   -8.069     29.838  -0.270

Correlation of Fixed Effects:
            (Intr) COND1  PCU    DIR1   COND1:PCU COND1:D PCU:DI
COND1        0.002
PCU          0.004 -0.001
DIR1         0.002 -0.004  0.004
COND1:PCU   -0.001 -0.001  0.003  0.000
COND1:DIR1  -0.001  0.000  0.000  0.007  0.021
PCU:DIR1     0.005  0.000 -0.003  0.000 -0.009    -0.005
COND1:PCU:D  0.000  0.021 -0.002 -0.004 -0.009    -0.001   0.011
But, since I'm mainly interested in the interactions and not so much the main effects of PCU and DIR, I changed the model to the following:

Linear mixed model fit by REML
Formula: RRT ~ COND + COND:PCU + COND:DIR + (1 | SUBJECT) + (1 | ITEM)
   Data: fm3
   AIC   BIC logLik deviance REMLdev
 46744 46800 -23363    46769   46726
Random effects:
 Groups   Name        Variance Std.Dev.
 SUBJECT  (Intercept)  8911.15  94.399
 ITEM     (Intercept)   406.16  20.153
 Residual             34869.91 186.735
Number of obs: 3503, groups: SUBJECT, 59; ITEM, 59

Fixed effects:
            Estimate Std. Error t value
(Intercept)  122.962     12.959   9.489
COND1         15.941      3.164   5.039
CONDa:PCU     91.049    123.553   0.737
CONDb:PCU    187.055    123.714   1.512
CONDa:DIR1    -4.340      5.168  -0.840
CONDb:DIR1   -11.160      5.204  -2.144

Correlation of Fixed Effects:
           (Intr) COND1  CONDa:PCU CONDb:PCU CONDa:DIR1
COND1       0.002
CONDa:PCU   0.004 -0.001
CONDb:PCU   0.004 -0.001  0.883
CONDa:DIR1  0.002 -0.003  0.006     0.000
CONDb:DIR1  0.001 -0.003  0.000     0.006     0.256
Not I do get a considerable correlation between the interactions. From the output (CONDa:?, CONDb:?), I infer that the model didn't always use helmert coding for condition but applied something else for the interactions. Is that right? When I code COND numerically as -1 and 1, the correlations turn out fine, which supports my conclusion. I would be very grateful for suggestions.

Thanks,
Paul

---
Paul Metzner

Humboldt-Universit?t zu Berlin
Philosophische Fakult?t II
Institut f?r deutsche Sprache und Linguistik

Post: Unter den Linden 6 | 10099 Berlin | Deutschland
Besuch: Dorotheenstra?e 24 | 10117 Berlin | Deutschland

+49-(0)30-2093-9726
paul.metzner at gmail.com
http://amor.rz.hu-berlin.de/~metznerp/

_______________________________________________
R-sig-mixed-models at r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-sig-mixed-models

---
Paul Metzner

Humboldt-Universit?t zu Berlin
Philosophische Fakult?t II
Institut f?r deutsche Sprache und Linguistik

Post: Unter den Linden 6 | 10099 Berlin | Deutschland
Besuch: Dorotheenstra?e 24 | 10117 Berlin | Deutschland

+49-(0)30-2093-9726
paul.metzner at gmail.com
http://amor.rz.hu-berlin.de/~metznerp/

---
Paul Metzner
Manfred-von-Richthofen-Str. 13
12101 Berlin
Deutschland

Tel.:  +49-(0)30-6730-9220
Mobil: +49-(0)17-8288-1059

paul.metzner at gmail.com
http://amor.rz.hu-berlin.de/~metznerp/

Contrasts for interactions in lmer

Thread (6 messages)