bigest part of vector

Stavros Macrakis · 2009-02-24T22:12:24Z

On Tue, Feb 24, 2009 at 3:01 PM, Bert Gunter wrote: > Nothing wrong with prior suggestions, but strictly speaking, (fully) sorting > the vector is unnecessary. > > y[y > quantile(y, 1- p/length(y))] > > will do it without the (complete) sort. (But sorting is so efficient anyway, > I don't think you could notice any difference). R uses an efficient quantile calculation, so it is significantly faster for large data sets: > big system.time(res

Stavros Macrakis

Tue, Feb 24, 2009 2:12 PM

On Tue, Feb 24, 2009 at 3:01 PM, Bert Gunter <gunter.berton at gene.com> wrote:

R uses an efficient quantile calculation, so it is significantly
faster for large data sets:

user  system elapsed
   0.56    0.14    0.70

user  system elapsed
   0.75    0.10    0.84

user  system elapsed
   0.61    0.08    0.68

user  system elapsed
   1.08    0.08    1.17

user  system elapsed
   4.67    0.03    4.72

user  system elapsed
   4.71    0.10    4.82

Surprisingly, perhaps, "order" is much slower than "sort":

user  system elapsed
  21.07    0.05   21.14

And you do need to be careful about your handling of ties:

[1] 4 4 4

[1] 4 4 4

Hope this helps.

        -s

bigest part of vector

Thread (7 messages)