Very Large Data Sets
Tony Fagan asks:
List,
Sir,
Can R handle very large data sets (say, 100 million records) for data mining applications?
The question assumes that the data handling capacity is a property of the software alone, which is nonsense. It is partly a property of the software, partly of what you want to do with the records, but mostly of the system on which it is run.
My understanding is that Splus can not, but SAS can easily.
Try handling 100 million records with SAS (or anything else) on a 486 and see how easily it does it. More seriously, the consensus is that on the same modern system SAS is usually better able to handle large, dumb calculations than S-PLUS, which is (generally) better than R. Horses for courses. Bill Venables.
-----------------------------------------------------------------
Bill Venables, Statistician, CMIS Environmetrics Project.
Physical address: Postal address:
CSIRO Marine Laboratories, PO Box 120,
233 Middle St, Cleveland, Queensland Cleveland, Qld, 4163
AUSTRALIA AUSTRALIA
Telephone: +61 7 3826 7251 Email: Bill.Venables at cmis.csiro.au
Fax: +61 7 3826 7304
-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-
r-help mailing list -- Read http://www.ci.tuwien.ac.at/~hornik/R/R-FAQ.html
Send "info", "help", or "[un]subscribe"
(in the "body", not the subject !) To: r-help-request at stat.math.ethz.ch
_._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._