Skip to content

[R-gui] your favourite spreadsheet-like data viewer or editor?

6 messages · Tom Aldenberg, Peter Dalgaard, Frank E Harrell Jr +2 more

#
On Nov 11, 2010, at 12:39 , Tom Aldenberg wrote:

            
Only to the extent that you believe that Excel provides the definitive and reliable approach to data handling. There are people who disagree with that, e.g. those who have accidentally gotten the student number included in the grade point average, overwritten a formula cell with a constant, etc. 

Notice that other systems with "spreadsheet-like" viewers/editors, quite deliberately do not include full spreadsheet functionality. E.g., rather than have users enter a formula cell and replicate it along a column, SAS JMP allows you to attach a "formula property" to a column, which (other advantages and disadvantages of that system untold) strikes me as a considerably more sane approach to data transformations. 

Also, of course, RExcel is Windows-only and requires that you have Excel installed.
#
On Thu, Nov 11, 2010 at 05:39:16AM -0600, Tom Aldenberg wrote:
Yes; that's an overkill for this situation.
Frank

  
    
#
On Fri, Nov 12, 2010 at 10:17 AM, Erich Neuwirth
<erich.neuwirth at univie.ac.at> wrote:
There are other reasons [1] why one may want to avoid doing statistics
in Excel, or even trusting it with one's data. My favourite one is the
properties of unary minus. In R,
[1] -9
[1] -9
[1] -9

Try the same expressions in OOo or Excel, and you may be surprised by
getting different results: sometimes positive and others negative.
There's even a fortune:
Thomas Lumley: The precedence of ^ is higher than that of unary minus. It may
be surprising, [...]
Herv? Pag?s: No, it's not surprising. At least to me... In the country where I
grew up, I've been teached that -x^2 means -(x^2) not (-x)^2.
   -- Thomas Lumley and Herv? Pag?s (both explaining that operator precedence
      is working perfectly well)
      R-devel (January 2006)

Another turn-off is the potential loss of precision just from passing
your data through Excel.

Anyways, for more detailed discussion read Patrick's document.
Personally I'm hopping off this potentially flamy wagon. Regards
Liviu

[1] http://lib.stat.cmu.edu/S/Spoetry/Tutor/spreadsheet_addiction.html
#
I have to answer again, and again partially because of a misleading
argument.

Let me state first that I think it is a matter of taste what kind of
software one wants to use in research. If a researcher dislikes Excel,
of course he can refuse to use it.
The situation is slightly different for academic teachers.
Not talking about spreadsheets and some of the statistics
they can do when Excel is installed on practically every
desktop in professional environments is dangerous
because you are implicitly telling your students
"we are so smart that we only are going to use software
which is not accessible to dumber people".

Not for your arguments (and I know Patrick's paper
and I disagree with quite a few - but not all - of his points).

-22 is indeed surprising, and people should be warned about it.
But by itself it is not a reason to refuse to use it.
If let Java compute 1234567788*2 you get -1825831720
No error message, no warning!
I think this is much worse, and still we have
rJava and many more R extensions based on Java.

You also mention the possible loss of precision when you pass data
through Excel. This is true if you use a character based transfer
(e.g. csv-files). One of the reasons Thomas and I developed
statconn and RExcel is that we wanted a transfer facility
retaining precision.

In fact, RExcel was written to overcome quite of the few shortcomings
Patrick mentions in his paper.

For academic teachers, I still think it is important to
know the shortcomings of Excel and also know which ones
can be alleviated by using RExcel.