Skip to content

finding most highly transcribed genes - ranking, sorting and subsets?

3 messages · alison waller, Martin Morgan

#
Thanks so much Martin,

This method is definitely more straightforward.  And you are right I don't
think I was doing anything wrong before. However, I thought that rank, would
rank the highest 1st, however after looking at the results using your
methods, I realized it ranks the lowest number 1.  So I modified it for
rank>18500.  And now I'm getting 300 rows for which the intensity is
consistenly high.

However, I am still laking some information.  For the results I can get a
matrix of 300 rows and the corresponding intensities (from m) or rank (from
h), but what I really want is the name of the original row, which
corresponds to a specific spot on the array).

I did msubset<-m[hrows,] and as mentioned I just get the rows numbered
1-300, while I want to essentially pickout the 300 rows from the original
19,000 rows maintaing the original row designation as it corresponds to a
specific gene.

Thanks again for any suggestions,

Alison

-----Original Message-----
From: Martin Morgan [mailto:mtmorgan at fhcrc.org] 
Sent: Thursday, December 06, 2007 4:06 PM
To: alison waller
Subject: Re: [R] finding most highly transcribed genes - ranking, sorting
and subsets?

Hi Alison --

I'm not sure where your problem is coming from, but R can help you to
more efficiently do your task. Skipping the bioc terminology and data
structures, you have a matrix
you'd like to determine the rank of values in each column
identfiy those with high rank
and find the rows for which the rank is always high
you can then use hrows to subset your original matrix (m[hrows,]) or
otherwise, e.g., how many rows with high rank
[1] 0

or perhaps the distribution of the number of columns in which high
ranking genes occur.
0    1    2    3    4 
5996 3132  765  100    7 

Martin

"alison waller" <alison.waller at utoronto.ca> writes:
up-
missed
as
I
http://www.R-project.org/posting-guide.html

  
    
#
Hi Alison --

It's a funny twist of terminology, isn't it? high rank (we're #1!)
corresponds to low value. Maybe a wimpy stats joke? Anyway, (a) if m
is assigned rownames (e.g., from the appropriate column of the 'genes'
data frame in the limma object, rownames(m) <- maList$genes$GeneName)
they'll be caried through the analysis and (b) if you've extracted m
from a limma MAList, then subsetting the MAList with hrow
(maList[hrow,]) will give you a new MAList with all the info carrying
through. This would be the better way to go.

Martin

"alison waller" <alison.waller at utoronto.ca> writes:

  
    
#
Thanks - great, should have thought of option b)


-----Original Message-----
From: Martin Morgan [mailto:mtmorgan at fhcrc.org] 
Sent: Friday, December 07, 2007 12:52 PM
To: alison waller
Cc: R-help at stat.math.ethz.ch
Subject: Re: [R] finding most highly transcribed genes - ranking, sorting
and subsets?

Hi Alison --

It's a funny twist of terminology, isn't it? high rank (we're #1!)
corresponds to low value. Maybe a wimpy stats joke? Anyway, (a) if m
is assigned rownames (e.g., from the appropriate column of the 'genes'
data frame in the limma object, rownames(m) <- maList$genes$GeneName)
they'll be caried through the analysis and (b) if you've extracted m
from a limma MAList, then subsetting the MAList with hrow
(maList[hrow,]) will give you a new MAList with all the info carrying
through. This would be the better way to go.

Martin

"alison waller" <alison.waller at utoronto.ca> writes:
would
(from
consistently
as
were
However,