Recently heard about GenomicsDB from Intel: https://github.com/Intel-HLS/GenomicsDB/wiki My understanding is that the upcoming GATK 4? will target it directly for storing genotypes. It's based on the array-oriented database TileDB, which I think distinguishes itself from e.g. SciDB by special support for sparse data. Seems TileDB might be a good candidate for a DelayedArray backend. But I guess we would to represent it as an entire SummarizedExperiment/VCF. Thoughts? Michael
[Bioc-devel] GenomicsDB
1 message · Michael Lawrence