The two chisq.test p values differ when the contingency table (PR#3896)

Kurt Hornik · 2003-08-21T20:20:48Z

>>>>> dmurdoch writes: >> Date: Wed, 16 Jul 2003 01:27:25 +0200 (MET DST) >> From: shitao@ucla.edu >>> x >> [,1] [,2] >> [1,] 149 151 >> [2,] 1 8 >>> c2x >> for(i in (1:20)){c2x > simulate.p.value=T,B=100000)$p.value)} >>> c2tx >> for(i in (1:20)){c2tx > + B=100000)$p.value)} >>> cbind(c2x,c2tx) >> c2x c2t

Kurt Hornik

Thu, Aug 21, 2003 1:20 PM

Argh.  Very interesting ...

I think it works to use

            STATISTIC <- sum(sort((x - E) ^ 2 / E, decreasing = TRUE))

instead: this starts by summing the big values, and hence if at all
slightly 'underestimates' the real value, which is fine for the
comparisons.

Fix committed to r-devel.  Thanks for looking into this.

-k