Creating data
I'm currently teaching a graduate course in statistics for linguistics using R. I have used up most of the 'authentic' data I have been able to collect for homework and demonstrations. I can think of plenty more possible data sets, but I am finding the creation of them challenging, and my creations are often somewhat unlealistic (generally, too 'neat' and obvious).
Why don't you give us some parameters of the types of data you are looking for and we can suggest possible sources of new data. You can see a few links I've collected at http://delicious.com/hadley/data. If you want something really big, there's also the data expo challenge: http://stat-computing.org/dataexpo/2009/ Regards, Hadley