Skip to content
Prev 12172 / 63424 Next

Distributed computing

Unfortunately, there is a major complication to this approach:  the distribution
of data.  ScaLAPACK (and PLAPACK) requires the data to be distributed in a
special way before calculation functions can be called.  Given a generic R
matrix, we have to distribute the data before we can call ScaLAPACK functions on
it.  We then have to collect the answer before we can return it to R.  Because
of this serious overhead, replacing all LAPACK calls with ScaLAPACK calls would
not be recommended.  Future versions of our package [1] may include some type of
automatic benchmarking to decide when problems are large enough to be worth
sending to ScaLAPACK.


David Bauer

[1] http://www.aspect-sdm.org/Parallel-R/