Skip to content
Prev 5233 / 15274 Next

Discretising intra-day data -- how to get by with less memory?

On Fri, Nov 27, 2009 at 07:37:03AM -0500, Gabor Grothendieck wrote:
The object I'm dealing with has 13,667,891 rows and a lot of columns.

I thought it might make sense to:

  thinz <- z[,1]
  figure out the row numbers for the aggregate(blah, tail, 1) operation in thinz
  discretised <- z[therownums,]

So instead of doing an aggregate(blah,tail,1), we'd analyse thinz and
come up with an integer vector therownums, and use that to make the
discretised object.

This would be memory efficient since thinz has only one column.