Hi all, I have a questions about the following situation and was hoping to find clarification here. I have a data frame with the following variables: id, genotype, group, block, climate, response I measured a response of 7 genotypes in a randomized complete block design. I measured each genotype 8 times (n=48). I grouped my 7 genotypes into 3 for me more reasonable groups. I measured the response on the same 7 genotypes 3 times under different climatic conditions. I specified block and genotype as random and group as fixed. I believe the proper random statement should look like: block, genotype nested within group. I came up with the following code: fit1 <- lmer(weight ~ group*climate + (1|block) + (1|group/genotype) , data=df) The problem I have now is how can I include the fact that I measured the same genotypes at three different times? Can I say (1|group/genotype/id) instead of (1|group/genotype)? Thanks for any comments on this! Stefan
lmer model for repeated measure in RCB design
3 messages · Ben Bolker, Schreiber, Stefan
Schreiber, Stefan <Stefan.Schreiber at ...> writes:
Hi all, I have a questions about the following situation and was hoping to find clarification here. I have a data frame with the following variables: id, genotype, group, block, climate, response I measured a response of 7 genotypes in a randomized complete block design. I measured each genotype 8 times (n=48).
You have some missing combinations? (8*7=56, right?)
I grouped my 7 genotypes into 3 for me more reasonable groups. I measured the response on the same 7 genotypes 3 times under different climatic conditions. I specified block and genotype as random and group as fixed. I believe the proper random statement should look like: block, genotype nested within group. I came up with the following code: fit1 <- lmer(weight ~ group*climate + (1|block) + (1|group/genotype) , data=df) The problem I have now is how can I include the fact that I measured the same genotypes at three different times? Can I say (1|group/genotype/id) instead of (1|group/genotype)?
Is id a unique identifier for each observation? In that case it's definitely redundant with the residual variance and should not be included in the model statement. I'm still a little bit uncertain about your experimental design (thanks for the careful explanation, though). I'm going to make up one possible explanation. How unbalanced is it? Does climate represent another level of replication (e.g. are there three climate conditions that are measured for each group*genotype*block combination), or does it vary in an unbalanced way across group*genotype*block combinations? Would your total number of observations be 8 (blocks) * 7 (genotypes) * 3 (climate conditions)? You shouldn't include group both as a fixed effect (your fixed group*climate term expands to group+climate+group:climate) and a random effect (your group/genotype term expands to group+group:genotype). You should probably use (1|group:genotype) instead (make sure group and genotype are both stored as factors). Even if it weren't redundant, including a random effect of group (with only three groups) is likely to give you an estimated group-level variance of zero -- there aren't enough levels to estimate variance reliably. If genotypes have unique IDs then you don't need the explicit nesting or interaction syntax. If so, my best guess is that weight ~ group*climate + (1|block) + (1|genotype) is what you want. You might consider whether it's worth including other random terms -- the most complex model would include (group*climate|block) and (climate|genotype) -- but you might find that you were running out of signal ...
An embedded and charset-unspecified text was scrubbed... Name: not available URL: <https://stat.ethz.ch/pipermail/r-sig-mixed-models/attachments/20120111/d913c6e7/attachment-0001.pl>