Non normal random effects

Dear list,

is non normality of random effects a serious issue for inference on the 
fixed effects? I am having a non normal random effect that tremendously 
improves model AIC.

Thanks!
Eric Edeline
Assistant Professor
UPMC-Paris6
UMR 7618 BIOEMCO
Ecole Normale Sup?rieure
46 rue d'Ulm
75230 Paris cedex 05
France

Tel: +33 (0)1 44 32 38 84
Fax: +33 (0)1 44 32 38 85

http://www.biologie.ens.fr/bioemco/biodiversite/edeline.html
Contrary to what is often claimed, it is not the normality of the
random effects themselves that matters, but the normality of 
the sampling distribution of the relevant fixed effect.  In mixed 
models, there is by comparison with iid models the additional 
complication that normality can affect the trade-offs between 
the different components in the fitted model.   Opportunities
for such trade-offs are large if there are several random effects
and there is imbalance or incompleteness (some combinations
of factor levels missing) in the fixed effects structure.  Non-normality
in the random effects can then be both hard to detect and have
implications for inference.

There is an examination of a data set with a relatively complicated 
random effects structure in the overheads at:
http://www.maths.anu.edu.au/%7Ejohnm/r-book/2edn/xtras/mlm-ohp.pdf

John Maindonald             email: john.maindonald at anu.edu.au
phone : +61 2 (6125)3473    fax  : +61 2(6125)5549
Centre for Mathematics & Its Applications, Room 1194,
John Dedman Mathematical Sciences Building (Building 27)
Australian National University, Canberra ACT 0200.
http://www.maths.anu.edu.au/~johnm

Dear list,

is non normality of random effects a serious issue for inference on the fixed effects? I am having a non normal random effect that tremendously improves model AIC.

Thanks!

-- 
Eric Edeline
Assistant Professor
UPMC-Paris6
UMR 7618 BIOEMCO
Ecole Normale Sup?rieure
46 rue d'Ulm
75230 Paris cedex 05
France

Tel: +33 (0)1 44 32 38 84
Fax: +33 (0)1 44 32 38 85

http://www.biologie.ens.fr/bioemco/biodiversite/edeline.html

_______________________________________________
R-sig-mixed-models at r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-sig-mixed-models
Dear John,

thanks for your feed back and for the useful tutorial. Actually the 
random effect in question is normally distributed (I did not check 
before, sorry), so the problem comes from somewhere else. I am modeling 
fish body size from a large dataset as a function of many covariates, 
and adding a "species" effect (be it fixed or random) skews the 
residuals but drops the AIC:

m1<-lmer(log(Length) 
~log(Slope)+log(Width)+Temp*log(D)+Temp*log(Compint2)+Temp*log(Predln102)+Temp*Year
+(1|Species/Station),
data=Data, na.action=na.omit, REML=TRUE) #AIC 73427, skewed residuals

m2<-lmer(log(Length) 
~log(Slope)+log(Width)+Temp*log(D)+Temp*log(Compint2)+Temp*log(Predln102)+Temp*Year
+(1|Station),
data=Data, na.action=na.omit, REML=TRUE) #AIC 147157, Gaussian residuals

This looks puzzling to me. Would you have an idea for why introducing a 
normally distributed effect shews the residuals?
Contrary to what is often claimed, it is not the normality of the
random effects themselves that matters, but the normality of
the sampling distribution of the relevant fixed effect.  In mixed
models, there is by comparison with iid models the additional
complication that normality can affect the trade-offs between
the different components in the fitted model.   Opportunities
for such trade-offs are large if there are several random effects
and there is imbalance or incompleteness (some combinations
of factor levels missing) in the fixed effects structure.  Non-normality
in the random effects can then be both hard to detect and have
implications for inference.

There is an examination of a data set with a relatively complicated
random effects structure in the overheads at:
http://www.maths.anu.edu.au/%7Ejohnm/r-book/2edn/xtras/mlm-ohp.pdf

John Maindonald             email: john.maindonald at anu.edu.au
phone : +61 2 (6125)3473    fax  : +61 2(6125)5549
Centre for Mathematics&  Its Applications, Room 1194,
John Dedman Mathematical Sciences Building (Building 27)
Australian National University, Canberra ACT 0200.
http://www.maths.anu.edu.au/~johnm

On 27/11/2010, at 7:04 AM, Eric Edeline wrote:

Dear list,

is non normality of random effects a serious issue for inference on the fixed effects? I am having a non normal random effect that tremendously improves model AIC.

Thanks!

-- 
Eric Edeline
Assistant Professor
UPMC-Paris6
UMR 7618 BIOEMCO
Ecole Normale Sup?rieure
46 rue d'Ulm
75230 Paris cedex 05
France

Tel: +33 (0)1 44 32 38 84
Fax: +33 (0)1 44 32 38 85

http://www.biologie.ens.fr/bioemco/biodiversite/edeline.html

_______________________________________________
R-sig-mixed-models at r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-sig-mixed-models

Eric Edeline
Assistant Professor
UPMC-Paris6
UMR 7618 BIOEMCO
Ecole Normale Sup?rieure
46 rue d'Ulm
75230 Paris cedex 05
France

Tel: +33 (0)1 44 32 38 84
Fax: +33 (0)1 44 32 38 85

http://www.biologie.ens.fr/bioemco/biodiversite/edeline.html
An embedded and charset-unspecified text was scrubbed...
Name: not available
URL: <https://stat.ethz.ch/pipermail/r-sig-mixed-models/attachments/20101127/921046ec/attachment.pl>
Dear John,

thanks for your feed back and for the useful tutorial. Actually the random
effect in question is normally distributed (I did not check before, sorry),
so the problem comes from somewhere else. I am modeling fish body size from
a large dataset as a function of many covariates, and adding a "species"
effect (be it fixed or random) skews the residuals but drops the AIC:

m1<-lmer(log(Length)
~log(Slope)+log(Width)+Temp*log(D)+Temp*log(Compint2)+Temp*log(Predln102)+Temp*Year
+(1|Species/Station),
data=Data, na.action=na.omit, REML=TRUE) #AIC 73427, skewed residuals
Just in terms of the model specification, do you really mean
(1|Species/Station)?  That expands to

 (1|Species) + (1|Species:Station)

and wouldn't reduce to (1|Station) in a model specification.  I think
you meant "species within station", which would be written as
(1|Station/Species) although I prefer the more explicit form
(1|Station) + (1|Station:Species)
m2<-lmer(log(Length)
~log(Slope)+log(Width)+Temp*log(D)+Temp*log(Compint2)+Temp*log(Predln102)+Temp*Year
+(1|Station),
data=Data, na.action=na.omit, REML=TRUE) #AIC 147157, Gaussian residuals

This looks puzzling to me. Would you have an idea for why introducing a
normally distributed effect shews the residuals?

On 11/26/2010 10:51 PM, John Maindonald wrote:
Contrary to what is often claimed, it is not the normality of the
random effects themselves that matters, but the normality of
the sampling distribution of the relevant fixed effect. ?In mixed
models, there is by comparison with iid models the additional
complication that normality can affect the trade-offs between
the different components in the fitted model. ? Opportunities
for such trade-offs are large if there are several random effects
and there is imbalance or incompleteness (some combinations
of factor levels missing) in the fixed effects structure. ?Non-normality
in the random effects can then be both hard to detect and have
implications for inference.

There is an examination of a data set with a relatively complicated
random effects structure in the overheads at:
http://www.maths.anu.edu.au/%7Ejohnm/r-book/2edn/xtras/mlm-ohp.pdf

John Maindonald ? ? ? ? ? ? email: john.maindonald at anu.edu.au
phone : +61 2 (6125)3473 ? ?fax ?: +61 2(6125)5549
Centre for Mathematics& ?Its Applications, Room 1194,
John Dedman Mathematical Sciences Building (Building 27)
Australian National University, Canberra ACT 0200.
http://www.maths.anu.edu.au/~johnm

On 27/11/2010, at 7:04 AM, Eric Edeline wrote:

Dear list,

is non normality of random effects a serious issue for inference on the
fixed effects? I am having a non normal random effect that tremendously
improves model AIC.

Thanks!

--
Eric Edeline
Assistant Professor
UPMC-Paris6
UMR 7618 BIOEMCO
Ecole Normale Sup?rieure
46 rue d'Ulm
75230 Paris cedex 05
France

Tel: +33 (0)1 44 32 38 84
Fax: +33 (0)1 44 32 38 85

http://www.biologie.ens.fr/bioemco/biodiversite/edeline.html

_______________________________________________
R-sig-mixed-models at r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-sig-mixed-models

--
Eric Edeline
Assistant Professor
UPMC-Paris6
UMR 7618 BIOEMCO
Ecole Normale Sup?rieure
46 rue d'Ulm
75230 Paris cedex 05
France

Tel: +33 (0)1 44 32 38 84
Fax: +33 (0)1 44 32 38 85

http://www.biologie.ens.fr/bioemco/biodiversite/edeline.html

_______________________________________________
R-sig-mixed-models at r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-sig-mixed-models

Dear Eric,
I'm impressed by the drop in the AIC due to the 'Species' variable. I
think it could be useful comparing the coefficients estimated for the
fixed effects between the two models. It may be the case that they are
rather different between m1 and m2. I would pay great attention to the
interpretation of the effect of each variable.

Hope this could help

Valerio
Dear list,

is non normality of random effects a serious issue for inference on the 
fixed effects? I am having a non normal random effect that tremendously 
improves model AIC.

Thanks!

Valerio Bartolino, PhD

Institute of Marine Research - Swedish Board of Fisheries
PO Box 4, 45321 Lysekil, Sweden

Department of Earth Sciences - Gothenburg University
PO Box 460, 40530 G?teborg, Sweden

e-mail: valerio.bartolino at gvc.gu.se
        valerio.bartolino at gmail.com

><(((*> ----- ><(((*> ----- ><(((*> ----- ><(((*>
Here are model coeffs, which are indeed very different between the two 
models for some effects:

1> m1 at fixef
         (Intercept)          log(Slope)          
log(Width)                Temp
       64.3355546922        0.0176278497       -0.0301275431       
-3.8692860013
              log(D)       log(Compint2)      
log(Predln102)                Year
        0.0021372925        0.0023000751        0.0327498020       
-0.0294303421
         Temp:log(D)  Temp:log(Compint2) Temp:log(Predln102)           
Temp:Year
       -0.0032965368       -0.0013422775       -0.0008383696        
0.0019082777
1> m2 at fixef
         (Intercept)          log(Slope)          
log(Width)                Temp
        49.601725055        -0.113952871        -0.013659799        
-2.285783668
              log(D)       log(Compint2)      
log(Predln102)                Year
         0.023140502         0.055044993        -0.252180423        
-0.023767542
         Temp:log(D)  Temp:log(Compint2) Temp:log(Predln102)           
Temp:Year
        -0.006871451        -0.011123770         0.010086872         
0.001163320

In m1 I indeed meant "Station" within "Species". Actually, the full 
model I have selected so far is much more complex and includes several 
nested random effects + variance functions in lme:

m3<-lme(log(Length) 
~log(Slope)+log(Width)+log(Fcl)+Nb.species+Temp*log(D)+Temp*log(Compint2)+Temp*log(Predln102)+Temp*Year,
data=Data, na.action=na.omit, random=list(Species=pdDiag(form=~1), 
Strategy=pdDiag(form=~1), Method=pdDiag(form=~1),
Region=pdDiag(form=~1), Station=pdDiag(form=~1)), control=list(maxIter=100),
weights=varComb(varPower(form=~D), varPower(form=~Predln102), 
varPower(form=~Compint2)))#AIC 50945.01

Maybe this is too complex, but this is the best structure in terms of 
AIC. Still, the problem of skewed residuals from having a species effect 
remains (even as a fixed effect in simple lm models), and I really can't 
figure out why...
Dear Eric,
I'm impressed by the drop in the AIC due to the 'Species' variable. I
think it could be useful comparing the coefficients estimated for the
fixed effects between the two models. It may be the case that they are
rather different between m1 and m2. I would pay great attention to the
interpretation of the effect of each variable.

Hope this could help

Valerio

On Fri, 2010-11-26 at 21:04 +0100, Eric Edeline wrote:

Dear list,

is non normality of random effects a serious issue for inference on the
fixed effects? I am having a non normal random effect that tremendously
improves model AIC.

Thanks!

Eric Edeline
Assistant Professor
UPMC-Paris6
UMR 7618 BIOEMCO
Ecole Normale Sup?rieure
46 rue d'Ulm
75230 Paris cedex 05
France

Tel: +33 (0)1 44 32 38 84
Fax: +33 (0)1 44 32 38 85

http://www.biologie.ens.fr/bioemco/biodiversite/edeline.html
You can have this sort of situation:
'Normal' effect                       .                                       .                                  .
Observations                   .. .  .  .    .                        ..  .    .    .                      ..  .    .     .

The large contribution from the random effect means that,
until it is accounted for, you will not see the non-normality.

John Maindonald             email: john.maindonald at anu.edu.au
phone : +61 2 (6125)3473    fax  : +61 2(6125)5549
Centre for Mathematics & Its Applications, Room 1194,
John Dedman Mathematical Sciences Building (Building 27)
Australian National University, Canberra ACT 0200.
http://www.maths.anu.edu.au/~johnm

Dear John,

thanks for your feed back and for the useful tutorial. Actually the random effect in question is normally distributed (I did not check before, sorry), so the problem comes from somewhere else. I am modeling fish body size from a large dataset as a function of many covariates, and adding a "species" effect (be it fixed or random) skews the residuals but drops the AIC:

m1<-lmer(log(Length) ~log(Slope)+log(Width)+Temp*log(D)+Temp*log(Compint2)+Temp*log(Predln102)+Temp*Year
+(1|Species/Station),
data=Data, na.action=na.omit, REML=TRUE) #AIC 73427, skewed residuals

m2<-lmer(log(Length) ~log(Slope)+log(Width)+Temp*log(D)+Temp*log(Compint2)+Temp*log(Predln102)+Temp*Year
+(1|Station),
data=Data, na.action=na.omit, REML=TRUE) #AIC 147157, Gaussian residuals

This looks puzzling to me. Would you have an idea for why introducing a normally distributed effect shews the residuals?

On 11/26/2010 10:51 PM, John Maindonald wrote:
Contrary to what is often claimed, it is not the normality of the
random effects themselves that matters, but the normality of
the sampling distribution of the relevant fixed effect.  In mixed
models, there is by comparison with iid models the additional
complication that normality can affect the trade-offs between
the different components in the fitted model.   Opportunities
for such trade-offs are large if there are several random effects
and there is imbalance or incompleteness (some combinations
of factor levels missing) in the fixed effects structure.  Non-normality
in the random effects can then be both hard to detect and have
implications for inference.

There is an examination of a data set with a relatively complicated
random effects structure in the overheads at:
http://www.maths.anu.edu.au/%7Ejohnm/r-book/2edn/xtras/mlm-ohp.pdf

John Maindonald             email: john.maindonald at anu.edu.au
phone : +61 2 (6125)3473    fax  : +61 2(6125)5549
Centre for Mathematics&  Its Applications, Room 1194,
John Dedman Mathematical Sciences Building (Building 27)
Australian National University, Canberra ACT 0200.
http://www.maths.anu.edu.au/~johnm

On 27/11/2010, at 7:04 AM, Eric Edeline wrote:

Dear list,

is non normality of random effects a serious issue for inference on the fixed effects? I am having a non normal random effect that tremendously improves model AIC.

Thanks!

-- 
Eric Edeline
Assistant Professor
UPMC-Paris6
UMR 7618 BIOEMCO
Ecole Normale Sup?rieure
46 rue d'Ulm
75230 Paris cedex 05
France

Tel: +33 (0)1 44 32 38 84
Fax: +33 (0)1 44 32 38 85

http://www.biologie.ens.fr/bioemco/biodiversite/edeline.html

_______________________________________________
R-sig-mixed-models at r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-sig-mixed-models

-- 
Eric Edeline
Assistant Professor
UPMC-Paris6
UMR 7618 BIOEMCO
Ecole Normale Sup?rieure
46 rue d'Ulm
75230 Paris cedex 05
France

Tel: +33 (0)1 44 32 38 84
Fax: +33 (0)1 44 32 38 85

http://www.biologie.ens.fr/bioemco/biodiversite/edeline.html

You can have this sort of situation:
'Normal' effect                       .                                       .                                  .
Observations                   .. .  .  .    .                        ..  .    .    .                      ..  .    .     .

The large contribution from the random effect means that,
until it is accounted for, you will not see the non-normality.
                                                                              ~~~~~~~~~~~

[For the extreme case that is illustrated, "skewness" perhaps
rather than "non-normality".    But if the contribution from the
random effect is somewhat weaker, overlap between points 
that correspond to the successive sets of non-normally 
distributed residuals will indeed lead to a distribution that, in 
practice, will be quite hard to distinguish from normal.  
Non-normality at the level of the residuals may or may not 
matter, depending on what it does to the sampling distributions 
that are relevant to the intended inferences.]

John Maindonald             email: john.maindonald at anu.edu.au
phone : +61 2 (6125)3473    fax  : +61 2(6125)5549
Centre for Mathematics & Its Applications, Room 1194,
John Dedman Mathematical Sciences Building (Building 27)
Australian National University, Canberra ACT 0200.
http://www.maths.anu.edu.au/~johnm

Dear John,

thanks for your feed back and for the useful tutorial. Actually the random effect in question is normally distributed (I did not check before, sorry), so the problem comes from somewhere else. I am modeling fish body size from a large dataset as a function of many covariates, and adding a "species" effect (be it fixed or random) skews the residuals but drops the AIC:

m1<-lmer(log(Length) ~log(Slope)+log(Width)+Temp*log(D)+Temp*log(Compint2)+Temp*log(Predln102)+Temp*Year
+(1|Species/Station),
data=Data, na.action=na.omit, REML=TRUE) #AIC 73427, skewed residuals

m2<-lmer(log(Length) ~log(Slope)+log(Width)+Temp*log(D)+Temp*log(Compint2)+Temp*log(Predln102)+Temp*Year
+(1|Station),
data=Data, na.action=na.omit, REML=TRUE) #AIC 147157, Gaussian residuals

This looks puzzling to me. Would you have an idea for why introducing a normally distributed effect shews the residuals?

On 11/26/2010 10:51 PM, John Maindonald wrote:
Contrary to what is often claimed, it is not the normality of the
random effects themselves that matters, but the normality of
the sampling distribution of the relevant fixed effect.  In mixed
models, there is by comparison with iid models the additional
complication that normality can affect the trade-offs between
the different components in the fitted model.   Opportunities
for such trade-offs are large if there are several random effects
and there is imbalance or incompleteness (some combinations
of factor levels missing) in the fixed effects structure.  Non-normality
in the random effects can then be both hard to detect and have
implications for inference.

There is an examination of a data set with a relatively complicated
random effects structure in the overheads at:
http://www.maths.anu.edu.au/%7Ejohnm/r-book/2edn/xtras/mlm-ohp.pdf

John Maindonald             email: john.maindonald at anu.edu.au
phone : +61 2 (6125)3473    fax  : +61 2(6125)5549
Centre for Mathematics&  Its Applications, Room 1194,
John Dedman Mathematical Sciences Building (Building 27)
Australian National University, Canberra ACT 0200.
http://www.maths.anu.edu.au/~johnm

On 27/11/2010, at 7:04 AM, Eric Edeline wrote:

Dear list,

is non normality of random effects a serious issue for inference on the fixed effects? I am having a non normal random effect that tremendously improves model AIC.

Thanks!

-- 
Eric Edeline
Assistant Professor
UPMC-Paris6
UMR 7618 BIOEMCO
Ecole Normale Sup?rieure
46 rue d'Ulm
75230 Paris cedex 05
France

Tel: +33 (0)1 44 32 38 84
Fax: +33 (0)1 44 32 38 85

http://www.biologie.ens.fr/bioemco/biodiversite/edeline.html

_______________________________________________
R-sig-mixed-models at r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-sig-mixed-models

-- 
Eric Edeline
Assistant Professor
UPMC-Paris6
UMR 7618 BIOEMCO
Ecole Normale Sup?rieure
46 rue d'Ulm
75230 Paris cedex 05
France

Tel: +33 (0)1 44 32 38 84
Fax: +33 (0)1 44 32 38 85

http://www.biologie.ens.fr/bioemco/biodiversite/edeline.html

Dear John,

thanks a lot for your reply. For some reason part of your message 
(illustrations of distributions seemingly) comes up corrupted. Although 
I got the essence of your explanation, I would very much like having it 
in full. Could you send illustrations in another format?

eric
You can have this sort of situation:
'Normal' effect                       .                                       .                                  .
Observations                   .. .  .  .    .                        ..  .    .    .                      ..  .    .     .

The large contribution from the random effect means that,
until it is accounted for, you will not see the non-normality.
                                                                               ~~~~~~~~~~~

[For the extreme case that is illustrated, "skewness" perhaps
rather than "non-normality".    But if the contribution from the
random effect is somewhat weaker, overlap between points
that correspond to the successive sets of non-normally
distributed residuals will indeed lead to a distribution that, in
practice, will be quite hard to distinguish from normal.
Non-normality at the level of the residuals may or may not
matter, depending on what it does to the sampling distributions
that are relevant to the intended inferences.]

John Maindonald             email:john.maindonald at anu.edu.au
phone : +61 2 (6125)3473    fax  : +61 2(6125)5549
Centre for Mathematics&  Its Applications, Room 1194,
John Dedman Mathematical Sciences Building (Building 27)
Australian National University, Canberra ACT 0200.
http://www.maths.anu.edu.au/~johnm

On 27/11/2010, at 11:43 PM, Eric Edeline wrote:

Dear John,

thanks for your feed back and for the useful tutorial. Actually the random effect in question is normally distributed (I did not check before, sorry), so the problem comes from somewhere else. I am modeling fish body size from a large dataset as a function of many covariates, and adding a "species" effect (be it fixed or random) skews the residuals but drops the AIC:

m1<-lmer(log(Length) ~log(Slope)+log(Width)+Temp*log(D)+Temp*log(Compint2)+Temp*log(Predln102)+Temp*Year
+(1|Species/Station),
data=Data, na.action=na.omit, REML=TRUE) #AIC 73427, skewed residuals

m2<-lmer(log(Length) ~log(Slope)+log(Width)+Temp*log(D)+Temp*log(Compint2)+Temp*log(Predln102)+Temp*Year
+(1|Station),
data=Data, na.action=na.omit, REML=TRUE) #AIC 147157, Gaussian residuals

This looks puzzling to me. Would you have an idea for why introducing a normally distributed effect shews the residuals?

On 11/26/2010 10:51 PM, John Maindonald wrote:

Contrary to what is often claimed, it is not the normality of the
random effects themselves that matters, but the normality of
the sampling distribution of the relevant fixed effect.  In mixed
models, there is by comparison with iid models the additional
complication that normality can affect the trade-offs between
the different components in the fitted model.   Opportunities
for such trade-offs are large if there are several random effects
and there is imbalance or incompleteness (some combinations
of factor levels missing) in the fixed effects structure.  Non-normality
in the random effects can then be both hard to detect and have
implications for inference.

There is an examination of a data set with a relatively complicated
random effects structure in the overheads at:
http://www.maths.anu.edu.au/%7Ejohnm/r-book/2edn/xtras/mlm-ohp.pdf

John Maindonald             email:john.maindonald at anu.edu.au
phone : +61 2 (6125)3473    fax  : +61 2(6125)5549
Centre for Mathematics&   Its Applications, Room 1194,
John Dedman Mathematical Sciences Building (Building 27)
Australian National University, Canberra ACT 0200.
http://www.maths.anu.edu.au/~johnm

On 27/11/2010, at 7:04 AM, Eric Edeline wrote:

Dear list,

is non normality of random effects a serious issue for inference on the fixed effects? I am having a non normal random effect that tremendously improves model AIC.

Thanks!

-- 
Eric Edeline
Assistant Professor
UPMC-Paris6
UMR 7618 BIOEMCO
Ecole Normale Sup?rieure
46 rue d'Ulm
75230 Paris cedex 05
France

Tel: +33 (0)1 44 32 38 84
Fax: +33 (0)1 44 32 38 85

http://www.biologie.ens.fr/bioemco/biodiversite/edeline.html

_______________________________________________
R-sig-mixed-models at r-project.org  mailing list
https://stat.ethz.ch/mailman/listinfo/r-sig-mixed-models

-- 
Eric Edeline
Assistant Professor
UPMC-Paris6
UMR 7618 BIOEMCO
Ecole Normale Sup?rieure
46 rue d'Ulm
75230 Paris cedex 05
France

Tel: +33 (0)1 44 32 38 84
Fax: +33 (0)1 44 32 38 85

http://www.biologie.ens.fr/bioemco/biodiversite/edeline.html

Eric Edeline
Assistant Professor
UPMC-Paris6
UMR 7618 BIOEMCO
Ecole Normale Sup?rieure
46 rue d'Ulm
75230 Paris cedex 05
France

Tel: +33 (0)1 44 32 38 84
Fax: +33 (0)1 44 32 38 85

http://www.biologie.ens.fr/bioemco/biodiversite/edeline.html
Lets say that I want to look at the interaction between Species and some other variable...Temperature. The right specification would be 

(Temperature|Species) + (Temperature|Station:Species) 

or simply 

(1|Species) + (Temperature|Station:Species) 

Thanks, 

Gustavo S. Betini 
Dept. of Integrative Biology 
University of Guelph, Canada
Just in terms of the model specification, do you really mean
(1|Species/Station)?  That expands to

 (1|Species) + (1|Species:Station)

and wouldn't reduce to (1|Station) in a model specification.  I think
you meant "species within station", which would be written as
(1|Station/Species) although I prefer the more explicit form
(1|Station) + (1|Station:Species)
Lets say that I want to look at the interaction between Species and some other variable...Temperature. The right specification would be

(Temperature|Species) + (Temperature|Station:Species)

or simply

(1|Species) + (Temperature|Station:Species)
That depends on whether you feel that the changes in temperature
coefficient would be associated with the Species or with the
Station:Species combination or both.  I would try to start simply,
perhaps with

(1 + Temperature|Species) + (1|Station:Species)

It is difficult to estimate a large number of variance components
without a lot of data.  Starting with the "sumo" model that has every
possible term and interaction then seeing if you can pare it back
makes more sense in fixed-effects models than in mixed-effects models.
 Deletion strategies are easier to describe and evaluate when working
with fixed-effects only.  Also, in the fixed-effects case having a
model with too many terms in it doesn't usually prevent estimation of
coefficients, it just make the estimators less precise.

In a mixed-effects model, adding interaction terms increases the
number of variance-component parameters quadratically, not linearly.
And it is a lot more difficult to determine that the
variance-covariance matrix of the random effects is singular than to
detect that a model matrix is rank deficient.

So I would recommend building mixed-effects models using forward
selection, not backward selection, and proceeding cautiously.
Thanks,

Gustavo S. Betini
Dept. of Integrative Biology
University of Guelph, Canada

On 10-11-27 12:04 PM, Douglas Bates wrote:

Just in terms of the model specification, do you really mean
(1|Species/Station)? ?That expands to

?(1|Species) + (1|Species:Station)

and wouldn't reduce to (1|Station) in a model specification. ?I think
you meant "species within station", which would be written as
(1|Station/Species) although I prefer the more explicit form
(1|Station) + (1|Station:Species)

_______________________________________________
R-sig-mixed-models at r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-sig-mixed-models