Skip to content
Prev 5110 / 21312 Next

[Bioc-devel] readGAlignmentPairs function broken?

Hi Leonard,

What happened between GenomicAlignments 0.99.8 and
GenomicAlignments 0.99.10 is that I modified
readGAlignmentPairsFromBam() to use the new pairing algo
(implemented in scanBam()) instead of the pairing algo
implemented by findMateAlignment():

------------------------------------------------------------------------
r84106 | hpages at fhcrc.org | 2013-12-09 23:49:43 -0800 (Mon, 09 Dec 2013) 
| 3 lines

Modify readGAlignmentPairsFromBam() to use 
scanBam(BamFile(asMates=TRUE), ...)
instead of findMateAlignment() for the pairing.

------------------------------------------------------------------------

The new pairing algo (implemented by Val and Martin) is fast and memory
efficient, and, last but not least, supports 'yieldSize' for reading the
BAM file by chunk. The same pairing algo is also used by
readGAlignmentsListFromBam().

readGAlignmentPairsFromBam() actually calls readGAlignmentsListFromBam()
and then turns the returned GAlignmentsList object into a
GAlignmentPairs object. I need to do some timings but I suspect this
transformation is taking a little bit too long and there might be room
for optimization (like e.g. avoiding the GAlignmentsList intermediate
representation).

I'll keep you updated.

Thanks,
H.
On 12/19/2013 12:58 PM, Leonard Goldstein wrote: