Skip to content
Prev 4911 / 21318 Next

[Bioc-devel] Data Package Size Issues (.idat and .rda)

On 11/07/2013 09:26 PM, Nicolas De Jay wrote:
Hi Nicolas -- Some things to bear in mind.

Files are compressed in package tar balls, so your IDAT files may have a 
considerably smaller effective size.

Generally, original text files are a much better way to store external data than 
Rda files. For instance, rda files require updating when / if the class 
definition changes, and the provenance and content of the data is unambiguous.

Experiment data packages are meant to provide reusable examples for pedagogic 
purposes. One would hope that minfiData fulfills this requirement. If not, then 
it would be better to continue the current discussion with Kasper and others in 
the community to identify an appropriately comprehensive data set for use across 
many relevant packages.

There is no formal statement about the maximum size of experiment data packages, 
but one would need to make a strong argument for why a Gb of experiment data is 
necessary (including why existing experiment data packages are fundamentally 
inadequate), especially if it is to support a single package.

Martin