Skip to content
Back to formatted view

Raw Message

Message-ID: <199912230605.RAA09007@snowy.nsw.cmis.CSIRO.AU>
Date: 1999-12-23T06:05:23Z
From: Bill Venables
Subject: Very Large Data Sets
In-Reply-To: Your message of "Wed, 22 Dec 1999 22:38:30 MST." <000801bf4d07$f3441b60$af2c0e3f@hal> 

Tony Fagan asks:

> List,

Sir,

> Can R handle very large data sets (say, 100 million records) for data 
> mining applications? 

The question assumes that the data handling capacity is a
property of the software alone, which is nonsense.  It is partly
a property of the software, partly of what you want to do with
the records, but mostly of the system on which it is run.

> My understanding is that Splus can not, but SAS can easily.

Try handling 100 million records with SAS (or anything else) on a
486 and see how easily it does it.

More seriously, the consensus is that on the same modern system
SAS is usually better able to handle large, dumb calculations
than S-PLUS, which is (generally) better than R.  Horses for
courses.

Bill Venables.
-- 
-----------------------------------------------------------------
Bill Venables, Statistician, CMIS Environmetrics Project.

Physical address:                            Postal address:
CSIRO Marine Laboratories,                   PO Box 120,       
233 Middle St, Cleveland, Queensland         Cleveland, Qld, 4163
AUSTRALIA                                    AUSTRALIA

Telephone: +61 7 3826 7251     Email: Bill.Venables at cmis.csiro.au

      Fax: +61 7 3826 7304


-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-
r-help mailing list -- Read http://www.ci.tuwien.ac.at/~hornik/R/R-FAQ.html
Send "info", "help", or "[un]subscribe"
(in the "body", not the subject !)  To: r-help-request at stat.math.ethz.ch
_._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._