Skip to content
Prev 16127 / 20628 Next

Different results for between/within groups and within group regression analyses

I don't think this set of questions was ever addressed in any of the subsequent threads on this or other forums .... and so I'll give you a quick non answer of bullet points:


* "significance" should not be the driving force in whether or not an estimate is correct. While it does tell you something about the relative error in that estimate, but it tells you very little about how well that estimate describes the data or predicts future data.


* Much as it is difficult and potentially problematic to interpret a main effect in the presence of interactions (after all, by definition, an interaction implies that the main effect changes conditional on another variable), it is also problematic to intentionally exclude a known interaction to better estimate the main effect.


* However, this above advice has to be tempered by the issues related to statistical power, the bias-variance tradeoff and over- vs. underfitting -- a lot of preprints, journal articles and recent textbooks by members of this list and their collaborators have tried to address different aspects of this in the context of mixed models, but the issues are fundamentally the same for *all* statistical models. My advice is to find the parsimonious model that best describes the data (so the best model according to AIC or BIC) and look at the predictions that model makes and derive your inferences from that, instead of getting hung up on the significance of any one coefficient. The effects package can be particularly useful here, especially for plotting these things.


Phillip
On 13/01/18 23:07, Luca Danieli wrote:
Hi Phillip,

sorry if I ask you a question.
In this moment I have a 3x4x8 matrix, where '3' is the number of groups, '8' the number of tests, and '4' the levels of the potential main effect.

Following your explanation, I was before thinking that leaving interactions out of the models would give you a better approximation of the main effect. But now that I read it again, I am unsure about it.

In my case, the '4' levels are nested in each set.
If I write lmer(Score ~ pot_ME + random effects) I have no statistical significance.
If I write lmer(Score ~ pot_ME*groups + random effects) I have statistical significance for the main effect (p<0.05) and a strong interaction (p<.001).
If I write lmer(Score ~ pot_ME*groups*tests + random effects) I have no statistical significance nor interactions.

What approach is the more correct to get information about the presence of a main effect?

(My parameters are not-continuous)

Best
Luca