Skip to content
Prev 179521 / 398503 Next

Hmisc package: deff() command's formula for the design effect

On Wed, 6 May 2009, jjh21 wrote:

            
The formula in Hmisc is correct (if the correlation doesn't vary with the 
cluster size).  If you think of the formula for the variance of a sum, it 
involves adding up all the variances and covariances.  A cluster of size k 
has k^2-k covariances between members, so the total number of covariances 
is sum(k^2-k) over all the clusters, plus the sum(k) variances.

Another way to think of it is that the larger clusters get too much 
weight, so in addition to the rho*(B-1) factor that you would have for 
equal-sized clusters there is an additional loss of efficiency due to 
giving too much weight to the larger clusters.

 	-thomas