Skip to content
Prev 11315 / 12125 Next

[R-pkg-devel] GPU programming in R packages

Ivan,

I am working on a package that implements several tests for the multivariate twosample problem. Some of them are based on the distance between the points. Now say the two data sets have 1000 observations each in 10 dimensions. Calculating such a test statistic requires roughly 1000*1000*10=10^7 operations. Now some of them don't come with a p value but one needs to use the permutation methods, say another 1000 times, and we are up to 10^10 operations. Finally one might want to do some power study, and so we need another 10*1000, and are at 10^14. And finally I really want to do 20 or 30 such studies. So it is clear we need something very fast.

Currently I have everything implemented in Rcpp and using parallel programing, but even that is not good enough. So I thought doing the calculation of the pairwise distances using GPU might be worth looking into.

Wolfgang