Skip to content
Prev 997 / 21312 Next

[Bioc-devel] RFC: eSet with two color data

Wolfgang Huber <huber at ebi.ac.uk> writes:
What are typical actions with such an object?  I'm particularly
interested in access patterns for subsetting.  Is getting a matrix for
each color a common thing to do?

I think the data organization of the expression values in option B
(congruent matrices in assayData, one for each color) has some
advantages in terms of accessing a given color in an efficient manner.
Ratios of colors is vectorized easily and fast.  With option A neigher
operation is quite as straight forward I think.

It is true that option B would require some amount of coding.  Martin
Morgan and I discussed this a bit we realized that one could have
phenoData exactly the same as in option A.  The phenoData table would
have a special column (label/dye/color/colour) and values would
correspond to named matrices in assayData.  The eSet extension would
then handle subsetting (this is the infrastructure that would need
coding).

I suspect that the efficiency difference in obtaining an expression
matrix for a particular dye will make option B worth the effort.

+ seth