Skip to content

[Bioc-devel] Slow performance on scanBam

5 messages · Martin Morgan, James Bullard

#
Hi James -- I don't think there's anything in existence to make this easier, but 
I'll expose something in the next 24 hours; is your data shareable? There might 
be deeper things to be done for processing this small-but-numerous style data.

Martin
On 05/12/2014 05:32 PM, James Bullard wrote:

  
    
7 days later
#
On 05/13/2014 08:17 AM, James Bullard wrote:
Hi Jim -- I updated the seqinfo,BamFile-method to do more work in C, and for 
scanBamHeader to optionally parse only the targets|text part of the header. I 
also reverted a change to seqinfo,BamFile-method, introduced in Rsamtools 
version 1.15.28, to try to place seqlevels into 'natural' order; now they are 
returned in the order they appear in the file.

Together these should make for much faster code, for your sim.bam about 3.5 (vs 
185) seconds for seqinfo, and ~7s for scanBam.

This is in Rsamtools version 1.17.16, which is in svn now but won't make it to 
biocLite until tomorrow, all being well...

Martin