[Bioc-devel] SummarizedExperiment vs ExpressionSet

One thing that?s become apparent working on epivizr is that it may be useful to think about ?rowData? in a SummarizedExperiment as having two distinct components: row coordinates and row metadata. In the current class rowData is a ?GenomicRanges? which contains both coordinates (the ranges) and metadata (mcols(rowData)). In metagenomics (the other application my group works a lot with), we think of the taxonomy as providing coordinates. The distinction is worthwhile thinking about since there are certain operations we do on coordinates that we don?t do with metadata (and conversely).

Thinking about it this way, the ?ExpressionSet? object would be data without coordinates. So, I would avoid making ?GenomicRanges? behave like ?DataFrame? since this distinction between coordinates and metadata is lost. The ?emptyRanges? proposal gets closer to this since this corresponds to ?no coordinates?, but it may be worth thinking in the long term on making the coordinate/metadata distinction more general.

Hector

On Wed, Nov 26, 2014 at 12:38 PM, Tim Triche, Jr. <tim.triche at gmail.com>
wrote:

[Bioc-devel] SummarizedExperiment vs ExpressionSet

Thread (10 messages)