Skip to content
Prev 3269 / 5632 Next

[R-meta] When to skip an extra level?

Dear Tim,

The question generally is when it makes sense to leave out a level if the data could be regarded as having a hierarchical structure (which is modeled in terms of nested random effects along the lines of '~ 1 | var1/var2/var3/...') and if so, which level(s) to leave out.

I don't think there is any general consensus on this or even much empirical evidence to back up any particular approach. However, in general, I would say that if the number of units at a particular level is very similar to the number of units at one level below it (e.g., there are 199 papers and 200 studies - so one paper describes two studies while the remaining 198 papers describe one study  -- to make the example from the second link even more extreme), then it becomes very difficult to distinguish the variances at those two levels and I would consider dropping one of the two levels. I don't have any super strong feelings on whether to then drop the upper (paper) or lower (study) level -- in the extreme scenario above, it is unlikely to matter. Dropping the paper level would treat the two studies from that one paper as independent. Dropping the study level would assume that the average true effects (averaged over whatever lower levels there are in the hierarchy below 'studies') in those two studies from that one paper are homogeneous. Neither is (probably) correct.

I cannot tell you where the exact point is (in terms of # of papers versus # of studies) where I would start to consider dropping a level.

Best,
Wolfgang