Skip to content

Rmpi not spawning across nodes

1 message · Stephen Weston

#
We have a similar cluster to yours, and I am able to spawn workers on
multiple nodes using the procedure that you describe (except that I
don't use the qsub "-V" option). I'm using R 3.0.2, Rmpi 0.6.3, and
Open MPI 1.6.5 on a RHEL 6.2 cluster, however, we didn't use the nopsm
option when building Open MPI. (Note that I eventually installed Rmpi
using the "--no-test-load" option to avoid the "error obtaining unique
transport key" problem.)

We configured Open MPI 1.6.5 using the options:

  --enable-shared --enable-static --with-tm --with-openib --with-hwloc=internal

Since you appear to be using a PBS-derived system, you might want to
try using "--with-tm" (if you're not already) to see if that makes a
difference. That option does relate to remote execution, so it seems
worth trying.

In any case, I'd be very interested to hear if and how you solve the problem.

- Steve

On Thu, Jun 26, 2014 at 10:30 AM, Russell Pierce
<russell.s.pierce at gmail.com> wrote: