SLOW split() function

As another followup, given that you are doing numerous regression
models and (I presume) working with finance/stock data that is
strictly numeric (no need for special contrast coding, etc.), you can
substantially reduce the time spent estimating the coefficients.  A
simple way is to use lm.fit directly instead of lm.  For lm.fit, you
pass the y and x (design) matrices directly.  This skips a good deal
of overhead.  Here is one naive way, I imagine more speedups could be
gained by incorporating the intercept (1 vector) into d instead of
cbind()ing it.  The catch it that lm.fit requires matrices, not data
tables, so what you gain may be lost in having to do an extra
conversion.  In any case, here are the times on my system for the two
options (note I used N = 1000 * 100 because I am presently on a
glorified netbook).

SLOW split() function

Thread (11 messages)