Skip to content

[Bioc-devel] range-directed metadata management

4 messages · Vincent Carey, Steve Lianoglou, Michael Lawrence

#
Hi,

On Thu, Jul 10, 2014 at 1:52 PM, Vincent Carey
<stvjc at channing.harvard.edu> wrote:
You might consider just stuffing it all in the database.

SQLite supports RTrees, which is a spatial index, so you could in
theory get the fast overlap stuff baked in w/o a need to have a
parallel GRanges object to index into the database:
http://www.sqlite.org/rtree.html

Before the reboot of the GenomicFeatures package (we're talking around
2008/2009?) I was doing something like that for genomic annotations.

The way that Hadley has abstracted db access in dplyr to make a
database look like a data.frame and respond to all the "data
manipulation verbs" in the same way gives me inspiration to believe
that we can do the same and make the database look essentially like a
GRanges / VRanges object and get cooking that way.

Hopefully this answer was at least minimally aligned in the direction
of what you were asking ;-)

-steve