Running jobs on a linux cluster: ways to communicate finished jobs, so fill in with new jobs - R-SIG-HPC

Laura S · 2010-08-10T15:15:03Z

An embedded and charset-unspecified text was scrubbed... Name: not available URL:

Tue, Aug 10, 2010 9:57 AM #

On Tue, Aug 10, 2010 at 10:15 AM, Laura S <leslaura at gmail.com> wrote:

You don't give us much to go on. What scheduler does your cluster use?
for example.

Here's what I'd do. Write a shell script that runs all of the programs
one after the other.  Without knowing more about the scheduling scheme
on your cluster, I can't say exactly how I would go about it.

If you have access to a BASH shell, for example, it should be as simple as

#!/bin/bash

R --vanilla -f yourRprogram1.R

R --vanilla -f yourRprogram2.R

=====================

and so forth. If you rewrite the first line of your R code to use
Rscript or littler, then you don't even need to bother with the "R
--vanilla -f" part, as each R program will become self aware (and take
over the world, like in Terminator).

If you run exactly the same R program over and over again, make a for loop.

As long as you have the details worked out on each individual run of
the model, the rest of it is not even really a "cluster" problem. You
have to run one after the other.

FYI, I've been uploading practical working examples for our Rocks
Linux cluster using the Torque/OpenPBS scheduling system.  Maybe some
will help you.

http://pj.freefaculty.org/cgi-bin/mw/index.php?title=Cluster:Main

I think I could work out an example of the sort you describe if you
tell us a bit more about how the separate simulation runs talk to each
other.

Or, I should add, if the runs go one after the other, why don't you
put them all in 1 R program.  ??

Paul E. Johnson
Professor, Political Science
1541 Lilac Lane, Room 504
University of Kansas