duplicates() function

Hadley Wickham · 2011-04-08T15:13:26Z

On Fri, Apr 8, 2011 at 9:59 AM, Duncan Murdoch wrote: > I need a function which is similar to duplicated(), but instead of returning > TRUE/FALSE, returns indices of which element was duplicated. ?That is, > >> x > duplicated(x) > [1] FALSE FALSE ?TRUE FALSE TRUE > >> duplicates(x) > [1] NA NA ?1 NA ?2 > > (so that I know that element 3 is a duplicate of element 1, and element 5 is > a duplicate of element 2, whereas the others were not duplicated a

Hadley Wickham

Fri, Apr 8, 2011 8:13 AM

On Fri, Apr 8, 2011 at 9:59 AM, Duncan Murdoch <murdoch.duncan at gmail.com> wrote:

I'd think of making it a lookup table.  The basic idea is

split(seq_along(x), x)

but there are probably much faster ways of doing it, depending on what
you need.  But for efficiency, you probably need a hashtable
somewhere.

Hadley

Assistant Professor / Dobelman Family Junior Chair
Department of Statistics / Rice University
http://had.co.nz/

duplicates() function

Thread (12 messages)