sum() returns NA on a long logical vector when nb of TRUE values exceeds 2^31

Martin Maechler · 2018-02-05T12:43:31Z

>>>>> Martin Maechler >>>>> on Thu, 1 Feb 2018 16:34:04 +0100 writes: > >>>>> Herv? Pag?s > >>>>> on Tue, 30 Jan 2018 13:30:18 -0800 writes: > > > Hi Martin, Henrik, > > Thanks for the follow up. > > > @Martin: I vote for 2) without *any* hesitation :-) > > > (and uniformity could be restored at some point in the > > future by having prod(), rowSums(), colSums(), and others > > align with the behavior o

Martin Maechler

Mon, Feb 5, 2018 4:43 AM

After finishing that... I challenged myself that one should be able to do
better, namely "no overflow" (because of large/many
integer/logical), and so introduced  irsum()  which uses a double 
precision accumulator for integer/logical  ... but would really
only be used when the 64-bit int accumulator would get close to
overflow.
The resulting code is not really beautiful, and also contains a
a comment     " (a waste, rare; FIXME ?) "
If anybody feels like finding a more elegant version without the
"waste" case, go ahead and be our guest ! 

Testing the code does need access to a platform with enough GB
RAM, say 32 (and I have run the checks only on servers with >
100 GB RAM). This concerns the new checks at the (current) end
of <R-devel_R>/tests/reg-large.R

In R-devel svn rev >= 74208  for a few minutes now.

Martin

sum() returns NA on a long *logical* vector when nb of TRUE values exceeds 2^31

Thread (2 messages)

sum() returns NA on a long logical vector when nb of TRUE values exceeds 2^31