Skip to content
Prev 1175 / 2152 Next

socket cluster/gotoblas2 configuration confusion

Dear Steve,
It's getting even more weird - I don't seem to be able to reproduce the 
behaviour ...

I was able (twice, but not reproducibly) to get it working as I want:

I opened terminal (xfce4-term) on my desktop and log into the server 
with ssh -X -C claudia at 172.17.42.86
and start an R session there,
all worked well: 2 workers using 6 cores each for the multiplication.

proc status output:

 > cat (readLines(sprintf('/proc/%d/status', Sys.getpid())), sep = "\n")
Name:	R
State:	R (running)
SleepAVG:	98%
Tgid:	31571
Pid:	31571
PPid:	7664
TracerPid:	0
Uid:	508	508	508	508
Gid:	509	509	509	509
FDSize:	256
Groups:	509
VmPeak:	  560904 kB
VmSize:	  560904 kB
VmLck:	       0 kB
VmHWM:	  276500 kB
VmRSS:	  276496 kB
VmData:	  405528 kB
VmStk:	     140 kB
VmExe:	    2856 kB
VmLib:	   18248 kB
VmPTE:	     916 kB
StaBrk:	19587000 kB
Brk:	22682000 kB
StaStk:	7fffabd50130 kB
Threads:	6
SigQ:	1/79872
SigPnd:	0000000000000000
ShdPnd:	0000000000000000
SigBlk:	0000000000000000
SigIgn:	0000000000000000
SigCgt:	0000000180001e4a
CapInh:	0000000000000000
CapPrm:	0000000000000000
CapEff:	0000000000000000
Cpus_allowed: 
00000000,00000000,00000000,00000000,00000000,00000000,00000000,00ffffff
Mems_allowed:	00000000,00000001

For the other processes,
either via
* terminal -> ssh -> R
* terminal -> emacs -> ess -> R
* terminal -> ssh -> xfce4-panel -> terminal -> R
* terminal -> ssh -> xfce4-panel -> emacs -> ess -> R
I sometimes get 2 cores working in parallel as shown by the cpu graph 
applet, sometimes the applet indicates only one core and the snow timing 
plot indicates both workers worked at the same time, but took twice as 
long as system.time of the matrix multiplication

The proc status is different for those:


 > cat (readLines(sprintf('/proc/%d/status', Sys.getpid())), sep = "\n")
Name:   R
State:  R (running)
SleepAVG:       98%
Tgid:   2983
Pid:    2983
PPid:   2956
TracerPid:      0
Uid:    508     508     508     508
Gid:    509     509     509     509
FDSize: 256
Groups: 509
VmPeak:   560768 kB
VmSize:   545148 kB
VmLck:         0 kB
VmHWM:    342960 kB
VmRSS:    327336 kB
VmData:   389768 kB
VmStk:       144 kB
VmExe:      2856 kB
VmLib:     18248 kB
VmPTE:      1012 kB
StaBrk: 121ab000 kB
Brk:    1a342000 kB
StaStk: 7fffa9d4aa10 kB
Threads:        6
SigQ:   2/79872
SigPnd: 0000000000000000
ShdPnd: 0000000000000000
SigBlk: 0000000000000000
SigIgn: 0000000000000004
SigCgt: 0000000180001e4a
CapInh: 0000000000000000
CapPrm: 0000000000000000
CapEff: 0000000000000000
Cpus_allowed: 
00000000,00000000,00000000,00000000,00000000,00000000,00000000,00000001
Mems_allowed:   00000000,00000001

So the last part of cpus_allowed is 00000001 instead of 00ffffff.

What exactly does that tell me? The man page was not particularly 
enlightning...

How can I change that restriction?

Thanks a lot for your help,

Claudia

PS: I have to leave soon for a seminar over the weekend, so I won't be 
able to try out things again before Monday.