simulation study using R

An embedded and charset-unspecified text was scrubbed...
Name: not available
Url: https://stat.ethz.ch/pipermail/r-help/attachments/20080303/f8ec369e/attachment.pl
What is the format of the data you are storing (single value,
multivalued vector, matrix, dataframe, ...)?  This will help formulate
a solution.  What do you plan to do with the data?  Are you going to
do further analysis, write it to flat files, store it in a data base,
etc.?  How big are the data objects you are manipulating?
Dear All,

I am running a Monte Carlo simulation study and have some questions on how
to manage data storage efficiently at the end of each 1000 replication loop.
I have three conditions coded using the FOR {} loops and a FOR loop that
generates data for each condition, performs analysis, and computes a
statistic 1000 times. Therefore, for each condition, I will have 1000
statistic values. My question is what's the best way to store the 1000
statistic for each condition. Any suggestion on how to manage such
simulation studies is greatly appreciated.
Thanks,

--
Davood Tofighi
Department of Psychology
Arizona State University

       [[alternative HTML version deleted]]

______________________________________________
R-help at r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.

Jim Holtman
Cincinnati, OH
+1 513 646 9390

What is the problem you are trying to solve?
An embedded and charset-unspecified text was scrubbed...
Name: not available
Url: https://stat.ethz.ch/pipermail/r-help/attachments/20080303/ba1642f1/attachment.pl
One of the things you might take a look at is the 'filehash' package.
It is an easy way of storing/retrieving R objects.  I have an
application where my objects are matrices of about the same size and I
can quickly store the data and then come back later with a different
script to do further analysis.
Thanks for your reply. For each condition, I will have a matrix or data
frames of 1000 rows and 4 columns. I also have a total of 64 conditions for
now. So, in total, I will have 64 matrices or data frames of 1000 rows and 4
columns. The format of data I would like to store would be data frames or
matrices. I also would like to store the data for later use, e.g., a plot of
the empirical distribution of the chi^2, or to compute the power of Chi^2
across 1000 reps for each condition.

On Mon, Mar 3, 2008 at 7:03 PM, jim holtman <jholtman at gmail.com> wrote:
What is the format of the data you are storing (single value,
multivalued vector, matrix, dataframe, ...)?  This will help formulate
a solution.  What do you plan to do with the data?  Are you going to
do further analysis, write it to flat files, store it in a data base,
etc.?  How big are the data objects you are manipulating?

On Mon, Mar 3, 2008 at 7:05 PM, Davood Tofighi <dtofighi at asu.edu> wrote:
Dear All,

I am running a Monte Carlo simulation study and have some questions on
how
to manage data storage efficiently at the end of each 1000 replication
loop.
I have three conditions coded using the FOR {} loops and a FOR loop that
generates data for each condition, performs analysis, and computes a
statistic 1000 times. Therefore, for each condition, I will have 1000
statistic values. My question is what's the best way to store the 1000
statistic for each condition. Any suggestion on how to manage such
simulation studies is greatly appreciated.
Thanks,

--
Davood Tofighi
Department of Psychology
Arizona State University

       [[alternative HTML version deleted]]

______________________________________________
R-help at r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide
http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.

--
Jim Holtman
Cincinnati, OH
+1 513 646 9390

What is the problem you are trying to solve?

--
Davood Tofighi
Department of Psychology
Arizona State University
P.O. BOX 871104
Tempe, AZ 85287-1104
Tel.:480-727-7884

Jim Holtman
Cincinnati, OH
+1 513 646 9390

What is the problem you are trying to solve?
Thanks for your reply. For each condition, I will have a matrix or data
frames of 1000 rows and 4 columns. I also have a total of 64 conditions for
now. So, in total, I will have 64 matrices or data frames of 1000 rows and 4
columns. The format of data I would like to store would be data frames or
matrices. I also would like to store the data for later use, 
I generally find it is better to store the seed and other data you need 
to reproduce the experiment, rather than trying to store the data (see, 
for example, package setRNG). If you save only the summary statistics 
you need, then you can usually do it in memory. (Be sure to assign 
variables for the statistics to their full size first and then populate 
them, rather than extending them at each step.) If you write things to 
files then it will slow down your simulation a lot. In fact, in most 
cases it will be quicker to re-run the experiment than it will be to 
read the data from disk.

Paul
e.g., a plot of
the empirical distribution of the chi^2, or to compute the power of Chi^2
across 1000 reps for each condition.

On Mon, Mar 3, 2008 at 7:03 PM, jim holtman <jholtman at gmail.com> wrote:

What is the format of the data you are storing (single value,
multivalued vector, matrix, dataframe, ...)?  This will help formulate
a solution.  What do you plan to do with the data?  Are you going to
do further analysis, write it to flat files, store it in a data base,
etc.?  How big are the data objects you are manipulating?

On Mon, Mar 3, 2008 at 7:05 PM, Davood Tofighi <dtofighi at asu.edu> wrote:

Dear All,

I am running a Monte Carlo simulation study and have some questions on

how

to manage data storage efficiently at the end of each 1000 replication

loop.

I have three conditions coded using the FOR {} loops and a FOR loop that
generates data for each condition, performs analysis, and computes a
statistic 1000 times. Therefore, for each condition, I will have 1000
statistic values. My question is what's the best way to store the 1000
statistic for each condition. Any suggestion on how to manage such
simulation studies is greatly appreciated.
Thanks,

--
Davood Tofighi
Department of Psychology
Arizona State University

       [[alternative HTML version deleted]]

______________________________________________
R-help at r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide

http://www.R-project.org/posting-guide.html

and provide commented, minimal, self-contained, reproducible code.

--
Jim Holtman
Cincinnati, OH
+1 513 646 9390

What is the problem you are trying to solve?

====================================================================================

La version fran?aise suit le texte anglais.

------------------------------------------------------------------------------------

This email may contain privileged and/or confidential in...{{dropped:26}}
An embedded and charset-unspecified text was scrubbed...
Name: not available
Url: https://stat.ethz.ch/pipermail/r-help/attachments/20080305/211a235b/attachment.pl