Problems replacing empty cells with NA values
Do 'str(data)' to see what the structure of the data is. Are the columns with <NA> character or factors? Is this what you are seeing:
x <- read.table(textConnection('123 456 a
+ NA 1 b + 2 NA c + 3 4 NA + 3 2 1'))
closeAllConnections() x
V1 V2 V3 1 123 456 a 2 NA 1 b 3 2 NA c 4 3 4 <NA> 5 3 2 1
str(x)
'data.frame': 5 obs. of 3 variables: $ V1: int 123 NA 2 3 3 $ V2: int 456 1 NA 4 2 $ V3: Factor w/ 4 levels "1","a","b","c": 2 3 4 NA 1
sum(is.na(x$V3))
[1] 1
On Fri, Jan 28, 2011 at 12:28 PM, Simone Santoro <miseno77 at hotmail.com> wrote:
Hi,
I am preparing a quite huge database in Excel, I replaced the empty cells with "NA", I formatted it like text, and saved the file like a *.txt.
After, in R:
data<-read.table("myfile.txt",header=T,sep="\t")
edit(data)
When doing this I can see that some columns are OK and they have NA cells, but other columns are not (there are empty cells where there should be NAs).
Surprisingly (at least to me), when I close the "edit window" I can see all the data in the "R console" (automatically closing that windows makes starting the visualization of those data in the same R console window) and there are not empty cells but, nevertheless, some columns have NA values but other ones have <NA> values.
Why?
Could it possibly cause any problem to the analysis?
Thanks
Best wishes
Simone
? ? ? ?[[alternative HTML version deleted]]
______________________________________________ R-help at r-project.org mailing list https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide http://www.R-project.org/posting-guide.html and provide commented, minimal, self-contained, reproducible code.
Jim Holtman Data Munger Guru What is the problem that you are trying to solve?