High performance computing with R
Yes, there are some problems with Rstudio and multicore or computer cluster systems. I hope they will fix that in one of the next releases! If there is no computer cluster available you should try cloudnumbers.com. You will get access to a computer cluster in the cloud in less than 10 minutes. Everything is pre-configured, including R, all R packages and parallel computing technologies. Actually the beta phase is running. You will get free resources! Best Markus Date: Sun, 26 Jun 2011 21:39:38 -0400 From: Zachary Mayer <zach.mayer at gmail.com> To: r-sig-finance at r-project.org Subject: Re: [R-SIG-Finance] High performance computing with R Message-ID: <BANLkTi=Az-bXKujR=_wawz0sy2U3Nzo4qA at mail.gmail.com> Content-Type: text/plain; charset=windows-1252 Hello, I'm not 100% sure how to respond to an individual message from the daily digest, so I apologize if I am violating protocol here. Ben-- I suggest you do a little research into the foreach package for R, as well as the various foreach backends, which include doMC, doSMP, doSnow, doMPI and doRedis. foreach is a generalized framework to parallelize for loops in r. The various backends enable that parallelism using different technologies: doMC uses the "fork" command on linux, doSnow uses a "Snow" cluster, and doRedis uses a redis server. Each backend has various pros and cons. As stated before, doMC (and it's sister package multicore) are probably the best solution for a single machine: you can use the function 'mclapply' to replace the vanilla function 'lapply' and have instant parallelism with almost no extra work, but neither package works on windows or with Rstudio. doRedis is my current favorite solution for clusters of multiple machines on amazon EC2, but it takes a small amount of extra work to setup a redis server. The answer to your question really depends on your operating system, how many machines you have, and what technologies you are comfortable with. Do a some research before you commit to hardware, and re-write your code to make use of the 'foreach' looping structure. Good luck!
Dr. rer. nat. Markus Schmidberger
Senior Community Manager
Cloudnumbers.com GmbH
Chausseestra?e 6
10119 Berlin
www.cloudnumbers.com
E-Mail: markus.schmidberger at cloudnumbers.com
*************************
Amtsgericht M?nchen, HRB 191138
Gesch?ftsf?hrer: Erik Muttersbach, Markus Fensterer, Moritz v.
Petersdorff-Campen
Diese Nachricht kann vertrauliche Informationen enthalten. Sollten Sie
nicht der vorgesehene Empf?nger sein, so bitten wir um eine kurze
Nachricht. Jede unbefugte Weiterleitung oder Fertigung einer Kopie ist
unzul?ssig. Da wir nicht die Echtheit oder Vollst?ndigkeit der in
dieser Nachricht enthaltenen Informationen garantieren k?nnen,
schlie?en wir die rechtliche Verbindlichkeit der vorstehenden
Erkl?rungen und ?u?erungen aus.
This message may contain confidential information. If yo...{{dropped:6}}