Skip to content
Prev 70456 / 398525 Next

read.spss in R 2.1.0 & make basic dataframe

On Thu, 26 May 2005, Bliese, Paul D LTC USAMH wrote:
Actually, most of this is me rather than Saikat.

I use use.value.labels=TRUE most of the time.  The main point of 
to.data.frame=TRUE is that it is quite a lot faster for large files, 
especially if you are going to use only a few of the variables. I think 
Brian Ripley spoke up in favour of it for this reason last time the issue 
was raised.

The reason I made trim.factor.names=FALSE the default was backwards 
compatibility, but it probably makes sense to switch it at some point.

Incidentally, PSPP (the original source of the code) now has a version 
that reads long variable names from post-version 12 SPSS files. This 
confirms that the "unrecognised record type 7, subtype 13" message really 
is due to long variable names and so is harmless.  It also means that 
anyone who wants long variable names badly enough could work out a patch.


 	-thomas