Skip to content
Prev 8701 / 10988 Next

[Rcpp-devel] examples of using cula matrix multiplication in Rcpp

I?ve been playing around with Rcpp and CUDA (CUBLAS and Magma in particular) for quite a while now and definitely find it useful for improving performance. My interest is mostly in spatial models and gaussian processes where the rate limiting step is usually O(n^3) matrix decomposition where n is between 1000 to 5000.

For these types of tasks I routinely see ~2x improvements over RcppArmadillo & OpenBLAS using a $100 consumer grade card, which isn?t huge but makes a big difference when the overall runtime is around 80 hours per model.

If anyone is interested in looking at some code I have the early stages of a package up on github: https://github.com/rundel/RcppGP <https://github.com/rundel/RcppGP>. In particular the gpu_mat class has a reasonably mature interface for moving data between armadillo and cuBLAS.

-Colin

-----

Colin Rundel
Assistant Professor of the Practice
Duke University, Department of Statistical Science
www.stat.duke.edu/~cr173/
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.r-forge.r-project.org/pipermail/rcpp-devel/attachments/20150516/3ae83d1e/attachment.html>