which rows are duplicates?

Michael Dewey · 2009-03-30T10:51:29Z

At 05:07 30/03/2009, Aaron M. Swoboda wrote: >I would like to know which rows are duplicates of each other, not >simply that a row is duplicate of another row. In the following >example rows 1 and 3 are duplicates. > > > x > y > z > data x y z >1 1 2 3 >2 3 4 4 >3 1 2 3 Does this do what you want? > x y z data data.u data.u x y z 1 1 2

Michael Dewey

Mon, Mar 30, 2009 3:51 AM

At 05:07 30/03/2009, Aaron M. Swoboda wrote:

Does this do what you want?
 > x <- c(1,3,1)
 > y <- c(2,4,2)
 > z <- c(3,4,3)
 > data <- data.frame(x,y,z)
 > data.u <- unique(data)
 > data.u
   x y z
1 1 2 3
2 3 4 4
 > data.u <- cbind(data.u, set = 1:nrow(data.u))
 > merge(data, data.u)
   x y z set
1 1 2 3   1
2 1 2 3   1
3 3 4 4   2

You need to do a bit more work to get them back into the original row 
order if that is essential.

Michael Dewey
http://www.aghmed.fsnet.co.uk

which rows are duplicates?

Thread (9 messages)