Skip to content
Prev 122 / 2152 Next

Rmpi with PBSPro and OpenMPI

Thank you. I was using version 0.5-5. It seems that upgrading to version
0.5-7 seems to have worked, mostly. If I try saving my workspace with
mpi.quit("yes"), I get the following:

[n026:09730] *** Process received signal ***
[n026:09730] Signal: Segmentation fault (11)
[n026:09730] Signal code:  (128)
[n026:09730] Failing at address: (nil)
[n026:09730] [ 0] /lib64/tls/libc.so.6 [0x2a95c84500]
[n026:09730] [ 1] /lib64/ld-linux-x86-64.so.2 [0x2a9555d334]
[n026:09730] [ 2] /lib64/ld-linux-x86-64.so.2 [0x2a9555d724]
[n026:09730] [ 3] /lib64/ld-linux-x86-64.so.2 [0x2a9556119f]
[n026:09730] [ 4] /lib64/ld-linux-x86-64.so.2 [0x2a95560ef2]
[n026:09730] [ 5] /usr/lib64/libvapi.so(vipul_cleanup+0x50)
[0x2a9965a4c0]
[n026:09730] *** End of error message ***
mpirun noticed that job rank 0 with PID 9730 on node n026c exited on
signal 11 (Segmentation fault).

Mark Lyman


-----Original Message-----
From: Hao Yu [mailto:hyu at stats.uwo.ca] 
Sent: Tuesday, March 10, 2009 11:47 AM
To: Lyman, Mark
Cc: r-sig-hpc at r-project.org
Subject: Re: [R-sig-hpc] Rmpi with PBSPro and OpenMPI

Hi Mark,

What is the version of Rmpi you are using? Version 0.5-5 or older had a
bug in Rprofile but it was solved since 0.5-6.

.Last never intends to be a way to close R slaves. It is only used when
some one doesn't close R salves and master properly. Here is what I
normally do
{karl:58}orterun -n 4 R --no-save -q
master (rank 0, comm 1) of size 4 is running on: karl
slave1 (rank 1, comm 1) of size 4 is running on: karl
slave2 (rank 2, comm 1) of size 4 is running on: karl
slave3 (rank 3, comm 1) of size 4 is running on: karl
mpi.close.Rslaves()
[1] 1
mpi.quit()

Please note that master and slaves are created from one communicator.
They
live or die together, unlike spawning where master can live even slaves
quit.

Hao
Lyman, Mark wrote:
question
could
error
fine,
more
to