Skip to content
Prev 279923 / 398506 Next

Fixed! Thanks all:RE: scatterplot to boxplot translation?

Kelly:

Glad you got what you were looking for, but this whole thread begs the
question; (Why) should you do this? You lose information in binning
the continuous data, of course. Perhaps your answer is that the point
scatter in the data is too noisy to clearly discern what's going on, a
legitimate response. One might  then -- or in general -- consider
overlaying a fitted smooth (nonparameteric) curve to the data to
reveal the "trend." There are a zillion ways to do this in R: both
lattice and ggplot have built-in capabilities to do this easily, as
does base R with ?scatter.smooth. If that's too easy, you can do it by
hand via ?lowess (or it's more flexible cousin, ?loess),
smooth.spline, etc. In actuality, your binning strategy is a crude,
non-smooth version of such smoothing, so it's not that far-fetched. Or
as some of the choicer R-Help pages say, cutting and boxplotting is to
smoothing as histograms are to nonparametric density estimates.

Cheers,
Bert


On Fri, Dec 9, 2011 at 12:05 PM, Vining, Kelly
<Kelly.Vining at oregonstate.edu> wrote: