Skip to content
Prev 273999 / 398506 Next

multicore by(), like mclapply?

I could be waay off base here, but my concern about presplitting the data is that you will have your data, and a second copy of our data that is something like a list where each element contains the portion of the data for that split.  Good speed wise, bad memory wise.  My hope with the technique I showed (again I may not have accomplished it) was to only have at anyone time, the original data and a copy of the particular elements being worked with.  Of course  this is not an issue if you have plenty of memory.
On Oct 10, 2011, at 12:19, Thomas Lumley <tlumley at uw.edu> wrote: