Skip to content
Prev 216 / 15274 Next

R vs. S-PLUS vs. SAS

On Sat, Dec 04, 2004 at 07:15:40AM -0500, Andrew Piskorski wrote:
Very good point, but you didn't CC Joe. Done now. Hi Joe :)
Discount my $0.02 severely enough as I don't really know what I am rambling
about, but here it goes anyway as talk is so cheap:

S implementations are from a 'workstation' design era. Data objects are in
Ram.  As Pat mentioned in this thread, they used to be way less efficient
than it is now. R made huge leaps. I haven

Our friendly listmembers from Insightful way want to complement me here with
factual data :)
SAS reflects its mainframe-age design, i.e. pass (efficiently) over huge
amounts of data that could never have been held in memory anyway.

The interactive/exploratory/graphical nature of S versus the
batch/non-interactive/non-graphical nature of SAS follows from relative
cleanly from that basic design premise.
Fei Chen, a doctoral student of Brian Ripley, gave a truly impressive
presentation at DSC 2003 about out-of-memory work with R. I bugged Brian
repeatedly about writeups on this, but apparently there are none. Fei now is
a professional data miner on truly gigantic data sets ...

It can be done, but it requires surgery on the engine.  For someone really
committed, it may be worth digging up Fei Chen's dissertation.  Might even
be a market niche for Insightful to explore. 

Dirk