Skip to content
Prev 2051 / 2152 Next

48K csv files, 1000 lines each. How to redesign? (big picture)

In fact 40GB is not that much data to play with. And fortunately you have
multiple solutions. :

1) Make R great again by adopting 64 bit of R. This would dramatically
increase your memory limit (
https://stat.ethz.ch/R-manual/R-devel/library/base/html/Memory-limits.html)

2) Make streaming or out of memory computation using packages like
bigmemory (
ftp://cran.r-project.org/pub/R/web/packages/bigmemory/bigmemory.pdf)

3) If you are willing to adopt 3rd party platforms, you can also try
importing them into Elasticsearch and play with Kibana (catch: it looks
good on resume). (https://www.elastic.co/products/kibana)
On Wed, Mar 1, 2017 at 2:50 PM, Paul Johnson <pauljohn32 at gmail.com> wrote: