Skip to content
Prev 17239 / 20628 Next

GLMM with many and highly correlated features

Dear John,

As I said before, a GLMM is out of the question due to complete
separation. Hence a simple GLM would be sufficient.

IMHO you first need to reduce the dimensionalty of the covariates from
350 down to 3(!), and then fit the GLM. Using the response in this
selection is cheating.

You could use the first 3 PCA axes to reduce the dimensionality. But
the interpretation of those axis would be hard.

Best regards,

ir. Thierry Onkelinx
Statisticus / Statistician

Vlaamse Overheid / Government of Flanders
INSTITUUT VOOR NATUUR- EN BOSONDERZOEK / RESEARCH INSTITUTE FOR NATURE
AND FOREST
Team Biometrie & Kwaliteitszorg / Team Biometrics & Quality Assurance
thierry.onkelinx at inbo.be
Havenlaan 88 bus 73, 1000 Brussel
www.inbo.be

///////////////////////////////////////////////////////////////////////////////////////////
To call in the statistician after the experiment is done may be no
more than asking him to perform a post-mortem examination: he may be
able to say what the experiment died of. ~ Sir Ronald Aylmer Fisher
The plural of anecdote is not data. ~ Roger Brinner
The combination of some data and an aching desire for an answer does
not ensure that a reasonable answer can be extracted from a given body
of data. ~ John Tukey
///////////////////////////////////////////////////////////////////////////////////////////



Op ma 3 dec. 2018 om 11:42 schreef <j.zavrakidis at nki.nl>: