unique mismatch in R and Excel
We answered this on StackOverflow already. Excel was doing case-insensitive duplicate matching. http://stackoverflow.com/questions/20759346/counting-unique-values-in-r-and-excel/20759523#20759523 Barry
On Tue, Dec 24, 2013 at 5:43 PM, David Winsemius <dwinsemius at comcast.net> wrote:
On Dec 24, 2013, at 1:08 AM, Koushik Saha wrote:
i have a wired problem. i want to count the unique entry in a certain column.Here i have attached my csv file.
Files named with extension .csv do not typically make it through the R-help mail server.
i am doing this to get the unique entries in the column.
dat<-read.csv("C:/Project/Gawk-scripts/Book1.csv")
names(dat)<-c("user_name")
unique(dat$user_name)
results says i have 170 unique values.
But i am doing "remove duplicate entries" in excel i am having 147 unique
entries in the column.
Can anyone explain why there is a mismatch of the results or i am doing
something wrong.
Rename the file to have an extension of .txt. Then you mail-client will probably label it correctly as a MIME-TEXT file. -- David Winsemius Alameda, CA, USA
______________________________________________ R-help at r-project.org mailing list https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide http://www.R-project.org/posting-guide.html and provide commented, minimal, self-contained, reproducible code.