NaN from function
On 23-Feb-2012 Jonathan Williams wrote:
Dear Helpers,
I wrote a simple function to standardise variables if they
contain more than one value. If the elements of the variable
are all identical, then I want the function to return zero.
When I submit variables whose elements are all identical to
the function, it returns not zero, but NaNs.
zt=function(x){if (length(table(x)>1)) y=(x-mean(x))/sd(x) else if
(length(table(x)==1)) y=0; return(y)}
zt(c(1:10))
#[1] -1.4863011 -1.1560120 -0.8257228 -0.4954337 -0.1651446 0.1651446
#0.4954337 0.8257228 1.1560120 1.4863011
zt(rep(1,10))
#[1] NaN NaN NaN NaN NaN NaN NaN NaN NaN NaN
Would you be so kind as to point out what I am doing wrong, here?
How can I obtain zeros from my function, instead of NaNs?
(I obtain NaNs also if I set the function to zt=function(x){
if (length(table(x)>1)) y=(x-mean(x))/sd(x) else if
(length(table(x)==1)) y=rep(0, length(x)); return(y)} ).
Thanks, in advance, for your help,
Jonathan Williams
The issue here, Jonathan, is that when you evaluate
(x-mean(x))/sd(x) for a vector x whose elements are all equal,
not only is (x-mean(x)) = 0, but also sd(x) = 0, so you are
asking the function to return the result of 0/0. Since this
is undefined, the result is NaN.
A basic solution for this special case would be
zt=function(x){
if (sd(x) == 0) return(0*x) else return( (x-mean(x))/sd(x) )
}
This should cover the case where length(table(x))==1 (see also below).
I'm not happy about your conditions
if (length(table(x)>1))
if (length(table(x)==1))
since they ask for "length(table(x)>1)", which doesn't seem
to represent any natural criterion. E.g.:
length(table(1:10)>1)
# [1] 10
length(table(rep(1,10))>1)
# [1] 1
if(length(table(1:10)>1)) y <- "Yes" else y <- "No" ; y
# [1] "Yes"
if(length(table(rep(1,10))>1)) y <- "Yes" else y <- "No" ; y
# [1] "Yes"
length(table(1:10)==1)
# [1] 10
length(table(rep(1,10))==1)
# [1] 1
if(length(table(1:10)==1)) y <- "Yes" else y <- "No" ; y
# [1] "Yes"
if(length(table(rep(1,10))==1)) y <- "Yes" else y <- "No" ; y
# [1] "Yes"
I suspect you meant to write
if (length(table(x))>1)
and
if (length(table(x)))==1)
since this distinguishes between two more more different values
(length(table(x)) > 1) and all equal values (length(table(x)) == 1).
Ted.
-------------------------------------------------
E-Mail: (Ted Harding) <Ted.Harding at wlandres.net>
Date: 23-Feb-2012 Time: 16:40:03
This message was sent by XFMail