[FORGED] Q re: logical indexing with is.na

Rolf Turner · 2019-03-10T02:57:05Z

On 3/10/19 2:36 PM, David Goldsmith wrote: > Hi! Newbie (self-)learning R using P. Dalgaard's "Intro Stats w/ R"; not > new to statistics (have had grad-level courses and work experience in > statistics) or vectorized programming syntax (have extensive experience > with MatLab, Python/NumPy, and IDL, and even a smidgen--a long time ago--of > experience w/ S-plus). > > In exploring the use of is.na in the context of logical indexing, I've come > across the following puzzling-to-me result: > >>

Rolf Turner

Sat, Mar 9, 2019 6:57 PM

On 3/10/19 2:36 PM, David Goldsmith wrote:

It happens because R is eco-concious and re-cycles. :-)

Try:

ok <- c(TRUE,TRUE,FALSE)
(1:4)[ok]

In general in R if there is an operation involving two vectors then
the shorter one gets recycled to provide sufficiently many entries to 
match those of the longer vector.

This in the foregoing example the first entry of "ok" gets used again,
to make a length 4 vector to match up with 1:4.  The result is the same 
as (1:4)[c(TRUE,TRUE,FALSE,TRUE)].

If you did (1:7)[ok] you'd get the same result as that from
(1:7)[c(TRUE,TRUE,FALSE,TRUE,TRUE,FALSE,TRUE)] i.e. "ok" gets
recycled 2 and 1/3 times.

Try 10*(1:3) + 1:4, 10*(1:3) + 1:5, 10*(1:3) + 1:6 .

Note that in the first two instances you get warnings, but in the third
you don't, since 6 is an integer multiple of 3.

Why aren't there warnings when logical indexing is used?  I guess 
because it would be annoying.  Maybe.

Note that integer indices get recycled too, but the recycling is limited 
so as not to produce redundancies.  So

(1:4)[1:3] just (sensibly) gives

[1] 1 2 3

and *not*

[1] 1 2 3 1

Perhaps a bit subtle, but it gives what you'd actually *want* rather 
than being pedantic about rules with a result that you wouldn't want.

cheers,

Rolf Turner

P.S.  If you do

y[1:3][!is.na(y[1:3])]

i.e. if you're careful to match the length of the vector and the that of 
the indices, you get what you initially expected.

R. T.

P^2.S.  To the younger and wiser heads on this list:  the help on "[" 
does not mention that the index vectors can be logical.  I couldn't find 
anything about logical indexing in the R help files.  Is something 
missing here, or am I just not looking in the right place?

R. T.

Honorary Research Fellow
Department of Statistics
University of Auckland
Phone: +64-9-373-7599 ext. 88276

[FORGED] Q re: logical indexing with is.na

Thread (9 messages)