Skip to content
Prev 2955 / 21312 Next

[Bioc-devel] naming of TxDb packages

Hi Michael,
On 11-11-03 06:36 PM, Michael Lawrence wrote:
This suggests that you have a Mart called "hg19" (see below why).
The naming scheme when 'Data source' is "BioMart" seems to be a little
bit different. For example, if I use makeTranscriptDbFromBiomart() with
biomart="ensembl" and dataset="hsapiens_gene_ensembl", then I get:

   > GenomicFeatures:::.makePackageName(txdb)
   [1] "TxDb.Hsapiens.BioMart.ensembl.GRCh37.p5"

Token #4 ("ensembl") is the name of the Mart. I'm a little bit
surprised with token #5 though. I would have expected it to be
the ensembl version (eventually followed by the reference genome)
because one can always infer the reference genome from the ensembl
version but not the other way around. In other words, if Ensembl
makes 2 or more releases based on the same reference genome, our
current naming scheme won't differentiate the 2 TxDb packages.
Wouldn't it be better if we had something like:

   TxDb.Hsapiens.BioMart.ensembl.63
   TxDb.Hsapiens.BioMart.ensembl.64

Anyway, back to your problem. Yes in your case the technical mode
doesn't really matter so it's really up to you. Maybe being explicit
about the reference genome (with *.UCSC.hg19.*) is more important
than the technical mode?

H.