Skip to content
Prev 7786 / 21312 Next

[Bioc-devel] Memory issues with BiocParallel::SnowParam()

I have had (potentially transient and environment-related) problems with
bplapply
in gQTLstats.   I substituted the foreach abstractions and the code
worked.  I still
have difficulty seeing how to diagnose the trouble I ran into.

I'd suggest that you code so that you can easily substitute parallel- or
foreach- or
BatchJobs-based cluster control.  This can help crudely isolate the source
of trouble.

It would be very nice to have a way of measuring resource usage in cluster
settings,
both for diagnosis and strategy selection.  For jobs that succeed,
BatchJobs records
memory used in its registry database, based on gc().  I would hope that
there are
tools that could be used to help one figure out how to factor a task so
that it is feasible
given some view of environment constraints.

It might be useful for you to build an AMI and then a cluster that allows
replication of
the condition you are seeing on EC2.  This could help with diagnosis and
might be
a basis for defining better instrumentation tools for both diagnosis and
planning.

On Fri, Jul 10, 2015 at 12:23 AM, Leonardo Collado Torres <lcollado at jhu.edu>
wrote: