Skip to content
Prev 106806 / 398525 Next

Does SQL group by have a heavy duty equivalent in R

On Sun, 31 Dec 2006, Farrel Buchinsky wrote:

            
Why not use  duplicated() ?

For a data.frame with 200 rows of which about 50 are duplicates and 201 
columns finding the (non) duplicates takes little time on my year old AMD 
64 running Windows XP:
[1] 0.03 0.00 0.03   NA   NA
Finding the non-duplicated rows for which there is at least one 
replication:
[1] 0.05 0.00 0.05   NA   NA
Charles C. Berry                        (858) 534-2098
                                          Dept of Family/Preventive Medicine
E mailto:cberry at tajo.ucsd.edu	         UC San Diego
http://biostat.ucsd.edu/~cberry/         La Jolla, San Diego 92093-0717