Skip to content

[Bioc-devel] VariantAnnotation: Same locus, multiple samples

2 messages · Michael Lawrence, Valerie Obenchain

#
I don't see how this can be fixed. The two data structures are semantically
incompatible; they encode different types of information, so information is
lost in both directions. Even if we collapsed the alts, there is no way (as
far as I know) to say that data for one individual + alt combination is
absent. We could put NA (".") for every value concerning that alt, but it
seems too big of an assumption to say that all(is.na())) implies omission
of the VRanges element. In other words, VCF is rectangular and VRanges is
ragged, and there is no established way to encode the raggedness in the VCF.



On Mon, Dec 8, 2014 at 11:27 AM, Valerie Obenchain <vobencha at fredhutch.org>
wrote:

  
  
#
(Resending - the last message didn't post to the list.)

I was thinking the absence of a header in VRanges would make collapsing 
difficult and with your comments it's clear this isn't a good idea.

I like the description you gave of the differences in class content and 
geometry and have added them to the VRanges man page.

Valerie
On 12/08/14 13:25, Michael Lawrence wrote: