Skip to content
Prev 9541 / 21318 Next

[Bioc-devel] GenomicRanges, streaming and Tabix

I agree that supporting out of core representations is the way to go.
Implementations should push the iteration as far down as possible, so
that the user does not have to worry about it for common operations.
The DelayedArray package is a good example of this. The GenomicRanges
data structures are already engineered to support alternative
implementations. If both GRanges and DataFrame had out-of-core
implementations, then along with DelayedArray we would have an
out-of-core SummarizedExperiment. There is a question of granularity,
i.e., would SummarizedExperiment be composed of individual out-of-core
components, or would SummarizedExperiment itself have an out-of-core
implementation? Maybe we would want both, depending on the underlying
system(s)?

Michael




On Thu, Jul 21, 2016 at 11:12 AM, Simon Anders <simon.anders at fimm.fi>
(by way of Simon Anders <simon.anders at fimm.fi> wrote: