Skip to content
Prev 17014 / 63424 Next

boxplot() defaults {was "boxplot in extreme cases"}

Hi Martin,

I'm not sure this innovation (medlwd=3 default) is a good idea. Boxplots 
are designed to display many samples simultaneously on a graph, and it is 
important they be as clean and as simple as possible. To my eye, and to 
everyone in my lab, the thickened median line is rather distracting and 
makes the boxplots look more cluttered ("ugly" one of my postdocs said). 
The thickened line also goes against Tufte's principle of using minimum ink 
to represent the message.

Yours and Erich's point about distinguishing the median==1st quartile case 
from the median==3rd quartile case is well taken. How about making medlwd=3 
(or medlwd=2) the default behaviour only when the median coincides with one 
of the quartiles? That might satisfy everyone?

I notice that there wasn't any follow up discusssion of this post of the 
r-devel list. Did this suggestion get any support? The boxplots have been 
so well accepted in their current form for many, many years, decades even, 
so one should be especially cautious of making changes without some sort of 
consensus.

Best
Gordon
---------------------------------------------------------------------------------------
Dr Gordon K Smyth, Senior Research Scientist, Bioinformatics,
Walter and Eliza Hall Institute of Medical Research,
1G Royal Parade, Parkville, Vic 3050, Australia
Tel: (03) 9345 2326, Fax (03) 9347 0852,
Email: smyth at wehi.edu.au, www: http://www.statsci.org