rowSums()

Marc Schwartz · 2008-09-24T14:38:23Z

on 09/24/2008 09:06 AM Doran, Harold wrote: > Say I have the following data: > > testDat >> testDat > A B > 1 1 NA > 2 NA NA > 3 3 3 > > rowsums() with na.rm=TRUE generates the following, which is not desired: > >> rowSums(testDat[, c('A', 'B')], na.rm=T) > [1] 1 0 6 > > rowsums() with na.rm=F generates the following, which is also not > desired: > > >> rowSums(testDat[, c('A', 'B')], na.rm=F) > [1] NA NA 6 > > I see why this occur

Marc Schwartz

Wed, Sep 24, 2008 7:38 AM

on 09/24/2008 09:06 AM Doran, Harold wrote:

The behavior you observe is documented in ?rowSums in the Value section:

If there are no values in a range to be summed over (after removing
missing values with na.rm = TRUE), that component of the output is set
to 0 (*Sums) or NA (*Means), consistent with sum and mean.


So:

[1] 0


As per the definition of the sum of an empty set being 0, which I got
burned on myself a while back.

You could feasibly use:

  Res <- rowSums(testDat, na.rm = TRUE)
  is.na(Res) <- rowSums(is.na(testDat)) == ncol(testDat)

HTH,

Marc Schwartz

rowSums()

Thread (6 messages)