Skip to content

Estimating the standard error when you have sampling weights.

4 messages · Robert Wilkins, David Winsemius, Peter Dalgaard +1 more

#
I am having difficulty thinking that you cannot find general material  
by doing a Google search, but can tell you from memory that the US  
National Center for Health Statistics publishes on the WWW quite a bit  
of information about their survey methods.

For an R-centric answer: Have you looked at the survey package that  
Lumley created?

Doing  help.search("sampling") I also see that wtd.mean is available  
in Harrell's Hmisc. The help page for that function also has useful  
links.
#
David Winsemius wrote:
However, be very careful to note that there are

- frequency weights ("I have n of these")
- variance weights ("This is (like) the average of n obs")
- sampling weights ("In reality, there are n times more of these")

and the formula for the weighted mean may be the same, but those of the
SD or the SEM are quite different.

-pd

  
    
#
On Mon, 24 Nov 2008, Robert Wilkins wrote:

            
Alan Zaslavsky keeps a comprehensive list of software for complex surveys, 
at
http://www.hcp.med.harvard.edu/statistics/survey-soft/

Although I'm biased, I think the 'survey' package in R is better than 
either SPSS or SAS for this purpose. The main competition would be Stata 
and the specialised packages such as SUDAAN and WesVar.

If you just want means and proportions, though, any of the software would 
be perfectly adequate.

The PEAS project at Napier University has some nice introductory material, 
although some of their software comparisons are a bit out of date:
http://www.napier.ac.uk/depts/fhls/peas/

 	-thomas

Thomas Lumley			Assoc. Professor, Biostatistics
tlumley at u.washington.edu	University of Washington, Seattle