for loop and if problem

Richard Cotton · 2009-01-06T16:33:53Z

> I'm heaving difficulties with a dataset containing gene names and positions > of those genes. > Not such a big problem, but each gene has multiple exons so it's hard to say > where de gene starts and where it ends. I want the starting and ending > position of each gene in my dataset. > Attached is the dataset: > http://www.nabble.com/file/p21312449/genlistchrompos.csv genlistchrompos.csv > Column 'B' is the gene name, 'G' is the starting position and 'H' is the > stop position. > You can l

Richard Cotton

Tue, Jan 6, 2009 8:33 AM

positions

say

genlistchrompos.csv

which(diff(as.numeric(data$Gene))!=0)

will give you a vector of the last row in each gene.  The start position 
is obviously the next row after the previous end.

Also take a look at 

split(data, data$Gene)

Regards,
Richie.

Mathematical Sciences Unit
HSL


------------------------------------------------------------------------
ATTENTION:

This message contains privileged and confidential inform...{{dropped:20}}

Thread (9 messages)

Sake for loop and if problem Jan 6 Richard Cotton for loop and if problem Jan 6 Philipp Pagel for loop and if problem Jan 6 Charles C. Berry for loop and if problem Jan 6 Sake for loop and if problem Jan 12 jim holtman for loop and if problem Jan 12 David Winsemius for loop and if problem Jan 12 Brian Ripley for loop and if problem Jan 12 Sake for loop and if problem Jan 13