Creating data
Scott, Have you considered looking at phonetics data? I refer for instance to f0 (fundamental pitch) as a time series, based on recordings of speech. I think I heard that (for instance) data from recordings of native Unangax speakers (Unangan/Unangas, also called Aleut) are posted on the Web somewhere. Jake Jacob A. Wegelin Assistant Professor Department of Biostatistics Virginia Commonwealth University 730 East Broad Street Room 3006 P. O. Box 980032 Richmond VA 23298-0032 U.S.A. E-mail: jwegelin at vcu.edu URL: http://www.people.vcu.edu/~jwegelin
On Tue, 24 Mar 2009, Scott F. Kiesling wrote:
Hi everyone- I'm currently teaching a graduate course in statistics for linguistics using R. I have used up most of the 'authentic' data I have been able to collect for homework and demonstrations. I can think of plenty more possible data sets, but I am finding the creation of them challenging, and my creations are often somewhat unlealistic (generally, too 'neat' and obvious). So, I was wondering if anyone had any tips on creating 'realistic' data sets, or links/books that describe it. For a simple example, let's say I want to create a dataset with students from different countries and academic departments who took an English test. I want to make some differences (significant and not) and possibly even interactions among the scores by country and department. I have been doing this through various iterations of sample() and rnorm(), and jitter() to get some randomness, but things are still coming out pretty neatly. Is this the right (or a good) method? Advice? Thanks in advance- SFK -- Scott F. Kiesling, PhD Associate Professor Department Chair Department of Linguistics University of Pittsburgh, 2816 CL Pittsburgh, PA 15260 http://www.linguistics.pitt.edu Office: +1 412-624-5916
_______________________________________________ R-sig-teaching at r-project.org mailing list https://stat.ethz.ch/mailman/listinfo/r-sig-teaching