sorting without order
Hi Marc,
continuing on Prof. Dalgaard's proposal, you could use:
ix <- unlist(split(seq(along=v), v), use.names=FALSE)
but even with this, `sort()' seems faster if you are interseted only
in grouping:
v <- sample(1:25000, 50000, TRUE)
######
system.time(ix <- do.call("c",split(seq(along=v),v)), gcFirst=TRUE)
[1] 0.13 0.00 0.13 NA NA
system.time(ix <- unlist(split(seq(along=v), v), use.names=FALSE),
gcFirst=TRUE)
[1] 0.06 0.00 0.07 NA NA
system.time(x <- sort(v), gcFirst=TRUE)
[1] 0.01 0.00 0.02 NA NA
I hope it helps.
Best,
Dimitris
----
Dimitris Rizopoulos
Ph.D. Student
Biostatistical Centre
School of Public Health
Catholic University of Leuven
Address: Kapucijnenvoer 35, Leuven, Belgium
Tel: +32/16/336899
Fax: +32/16/337015
Web: http://www.med.kuleuven.ac.be/biostat
http://www.student.kuleuven.ac.be/~m0390867/dimitris.htm
----- Original Message -----
From: "Marc Mamin" <M.Mamin at intershop.de>
To: <r-help at stat.math.ethz.ch>
Sent: Tuesday, November 23, 2004 10:58 AM
Subject: [R] sorting without order
Hello, In order to increase the performance of a script I'd like to sort very large vectors containing repeated integer values. I'm not interesting in having the values sorted, but only grouped. I also need the equivalent of index.return from the standard "sort" function: f(c(10,1,10,100,1,10)) => grouped: c(10,10,10,1,1,100) ix: c(1,3,6,2,5,4) is there a way to achieve this which would be faster than the standard sort function? Thanks for any hints, Marc Mamin
______________________________________________ R-help at stat.math.ethz.ch mailing list https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide! http://www.R-project.org/posting-guide.html