GAMM big data (70K rand effects) guidance

Dear Steven,

It looks like you have on average about 7 measurements per patient. Are
they taken at somewhat fixed timepoints? E.g. something like ayfu = seq(0,
by = 50, length = 7) + rnorm(7, sd = 2). In that case you could consider to
make the model less complex and treat ayfu as categorical. You won't loose
that much information. Then the model becomes lmer(log(cd4 + 1) ~sex +
timepoint * CD4 + (1|PatientID)). I recommend renaming CD4. Having CD4 and
cd4 in the samen dataset is confusing.

Another thing that I would try is to limit to complexity of the smoother be
setting k = 3.

Best regards,

ir. Thierry Onkelinx
Instituut voor natuur- en bosonderzoek / Research Institute for Nature and
Forest
team Biometrie & Kwaliteitszorg / team Biometrics & Quality Assurance
Kliniekstraat 25
1070 Anderlecht
Belgium

To call in the statistician after the experiment is done may be no more
than asking him to perform a post-mortem examination: he may be able to say
what the experiment died of. ~ Sir Ronald Aylmer Fisher
The plural of anecdote is not data. ~ Roger Brinner
The combination of some data and an aching desire for an answer does not
ensure that a reasonable answer can be extracted from a given body of data.
~ John Tukey

2015-05-07 17:31 GMT+02:00 Steve Bellan <steve.bellan at gmail.com>:

GAMM big data (70K rand effects) guidance

Thread (3 messages)