help with loop over data frame
On Mon, 2007-02-19 at 13:38 +1100, Dr Remo Sammassimo wrote:
Dear List, This may be the fifth time Ive tried to send this to the list so apologies if there are multiple emails. I need some help getting started with this problem. I have a data frame containing a year of daily stock prices in the following format: Date Open High Low Close 1/15/2000 10 11 8 10 1/16/2000 12 12 10 11 etc.. I want to create a new data frame which shows only the rows where the column value "Open" for 'today' is higher than the column value "High" for the previous day (previous row). How do I loop over each day accessing values from different rows and columns, as is needed here? I have tried 'if' statements but none have worked. Any help appreciated. Regards, Alf Sammassimo Melbourne,Australia
I think that this should do it.
Presuming that your data frame is called 'DF':
Rows <- which(sapply(seq(along = rownames(DF))[-1],
function(x) DF[x, "Open"] > DF[x - 1, "High"])) + 1
DF.New <- DF[Rows, ]
The first line sets up a sequence from 2:nrows(DF) and then loops over
those indices. The indices are passed as 'x' to the function, which
compares the current row (x) "Open" value with the prior row (x - 1)
"High" value. This returns TRUE or FALSE for each row compared.
If TRUE, which() then returns the index of the row plus 1, since we do
not want the first row. Those indices are assigned to 'Rows', which is
then used to subset 'DF' and create 'DF.New'.
Just using the data you have above:
DF.New
Date Open High Low Close 2 1/16/2000 12 12 10 11 See ?which, ?sapply and ?seq HTH, Marc Schwartz