An embedded and charset-unspecified text was scrubbed... Name: not available Url: https://stat.ethz.ch/pipermail/r-help/attachments/20080303/f8ec369e/attachment.pl
simulation study using R
6 messages · Davood Tofighi, jim holtman, Paul Gilbert
What is the format of the data you are storing (single value, multivalued vector, matrix, dataframe, ...)? This will help formulate a solution. What do you plan to do with the data? Are you going to do further analysis, write it to flat files, store it in a data base, etc.? How big are the data objects you are manipulating?
On Mon, Mar 3, 2008 at 7:05 PM, Davood Tofighi <dtofighi at asu.edu> wrote:
Dear All,
I am running a Monte Carlo simulation study and have some questions on how
to manage data storage efficiently at the end of each 1000 replication loop.
I have three conditions coded using the FOR {} loops and a FOR loop that
generates data for each condition, performs analysis, and computes a
statistic 1000 times. Therefore, for each condition, I will have 1000
statistic values. My question is what's the best way to store the 1000
statistic for each condition. Any suggestion on how to manage such
simulation studies is greatly appreciated.
Thanks,
--
Davood Tofighi
Department of Psychology
Arizona State University
[[alternative HTML version deleted]]
______________________________________________ R-help at r-project.org mailing list https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide http://www.R-project.org/posting-guide.html and provide commented, minimal, self-contained, reproducible code.
Jim Holtman Cincinnati, OH +1 513 646 9390 What is the problem you are trying to solve?
An embedded and charset-unspecified text was scrubbed... Name: not available Url: https://stat.ethz.ch/pipermail/r-help/attachments/20080303/ba1642f1/attachment.pl
One of the things you might take a look at is the 'filehash' package. It is an easy way of storing/retrieving R objects. I have an application where my objects are matrices of about the same size and I can quickly store the data and then come back later with a different script to do further analysis.
On 3/3/08, Davood Tofighi <dtofighi at asu.edu> wrote:
Thanks for your reply. For each condition, I will have a matrix or data frames of 1000 rows and 4 columns. I also have a total of 64 conditions for now. So, in total, I will have 64 matrices or data frames of 1000 rows and 4 columns. The format of data I would like to store would be data frames or matrices. I also would like to store the data for later use, e.g., a plot of the empirical distribution of the chi^2, or to compute the power of Chi^2 across 1000 reps for each condition. On Mon, Mar 3, 2008 at 7:03 PM, jim holtman <jholtman at gmail.com> wrote:
What is the format of the data you are storing (single value, multivalued vector, matrix, dataframe, ...)? This will help formulate a solution. What do you plan to do with the data? Are you going to do further analysis, write it to flat files, store it in a data base, etc.? How big are the data objects you are manipulating? On Mon, Mar 3, 2008 at 7:05 PM, Davood Tofighi <dtofighi at asu.edu> wrote:
Dear All, I am running a Monte Carlo simulation study and have some questions on
how
to manage data storage efficiently at the end of each 1000 replication
loop.
I have three conditions coded using the FOR {} loops and a FOR loop that
generates data for each condition, performs analysis, and computes a
statistic 1000 times. Therefore, for each condition, I will have 1000
statistic values. My question is what's the best way to store the 1000
statistic for each condition. Any suggestion on how to manage such
simulation studies is greatly appreciated.
Thanks,
--
Davood Tofighi
Department of Psychology
Arizona State University
[[alternative HTML version deleted]]
______________________________________________ R-help at r-project.org mailing list https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide
http://www.R-project.org/posting-guide.html and provide commented, minimal, self-contained, reproducible code. -- Jim Holtman Cincinnati, OH +1 513 646 9390 What is the problem you are trying to solve? -- Davood Tofighi Department of Psychology Arizona State University P.O. BOX 871104 Tempe, AZ 85287-1104 Tel.:480-727-7884
Jim Holtman Cincinnati, OH +1 513 646 9390 What is the problem you are trying to solve?
1 day later
Davood Tofighi wrote:
Thanks for your reply. For each condition, I will have a matrix or data frames of 1000 rows and 4 columns. I also have a total of 64 conditions for now. So, in total, I will have 64 matrices or data frames of 1000 rows and 4 columns. The format of data I would like to store would be data frames or matrices. I also would like to store the data for later use,
I generally find it is better to store the seed and other data you need to reproduce the experiment, rather than trying to store the data (see, for example, package setRNG). If you save only the summary statistics you need, then you can usually do it in memory. (Be sure to assign variables for the statistics to their full size first and then populate them, rather than extending them at each step.) If you write things to files then it will slow down your simulation a lot. In fact, in most cases it will be quicker to re-run the experiment than it will be to read the data from disk. Paul
e.g., a plot of the empirical distribution of the chi^2, or to compute the power of Chi^2 across 1000 reps for each condition. On Mon, Mar 3, 2008 at 7:03 PM, jim holtman <jholtman at gmail.com> wrote:
What is the format of the data you are storing (single value,
multivalued vector, matrix, dataframe, ...)? This will help formulate
a solution. What do you plan to do with the data? Are you going to
do further analysis, write it to flat files, store it in a data base,
etc.? How big are the data objects you are manipulating?
On Mon, Mar 3, 2008 at 7:05 PM, Davood Tofighi <dtofighi at asu.edu> wrote:
Dear All,
I am running a Monte Carlo simulation study and have some questions on
how
to manage data storage efficiently at the end of each 1000 replication
loop.
I have three conditions coded using the FOR {} loops and a FOR loop that
generates data for each condition, performs analysis, and computes a
statistic 1000 times. Therefore, for each condition, I will have 1000
statistic values. My question is what's the best way to store the 1000
statistic for each condition. Any suggestion on how to manage such
simulation studies is greatly appreciated.
Thanks,
--
Davood Tofighi
Department of Psychology
Arizona State University
[[alternative HTML version deleted]]
______________________________________________ R-help at r-project.org mailing list https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide
http://www.R-project.org/posting-guide.html and provide commented, minimal, self-contained, reproducible code. -- Jim Holtman Cincinnati, OH +1 513 646 9390 What is the problem you are trying to solve?
====================================================================================
La version fran?aise suit le texte anglais.
------------------------------------------------------------------------------------
This email may contain privileged and/or confidential in...{{dropped:26}}
An embedded and charset-unspecified text was scrubbed... Name: not available Url: https://stat.ethz.ch/pipermail/r-help/attachments/20080305/211a235b/attachment.pl