Skip to content

read.spss in R 2.1.0 & make basic dataframe

3 messages · Bliese, Paul D LTC USAMH, Thomas Lumley, Uwe Ligges

#
On Thu, 26 May 2005, Bliese, Paul D LTC USAMH wrote:
Actually, most of this is me rather than Saikat.

I use use.value.labels=TRUE most of the time.  The main point of 
to.data.frame=TRUE is that it is quite a lot faster for large files, 
especially if you are going to use only a few of the variables. I think 
Brian Ripley spoke up in favour of it for this reason last time the issue 
was raised.

The reason I made trim.factor.names=FALSE the default was backwards 
compatibility, but it probably makes sense to switch it at some point.

Incidentally, PSPP (the original source of the code) now has a version 
that reads long variable names from post-version 12 SPSS files. This 
confirms that the "unrecognised record type 7, subtype 13" message really 
is due to long variable names and so is harmless.  It also means that 
anyone who wants long variable names badly enough could work out a patch.


 	-thomas
#
The main problem you are experiencing is that edit() (more precisely the 
method edit.data.frame()) is a bit restricted - I think contributions 
are welcome.
Note that coding must be done very careful here (and is not trivial at 
all) in order to deal with different kinds of attributes, in particular 
names and factor stuff.

Uwe Ligges
Bliese, Paul D LTC USAMH wrote: