Skip to content

OLS Regression diagnostic measures check list - what to consider?

3 messages · Tal Galili, Greg Snow, Liviu Andronic

#
First a note, while that is a nice list, I think it needs a disclaimer about only running tests that answer a meaningful question for the data/problem being studied.  If all those tests are run on datasets, I would be most suspicious of those datasets which passed all the tests.  Also, failing some of those tests does not mean that there is a problem with the regression model or its inferences.

This leads to what I think needs to be included on such lists (or replace such lists):  The methods described in the paper:

Buja, A., Cook, D. Hofmann, H., Lawrence, M. Lee, E.-K., Swayne,
     D.F and Wickham, H. (2009) Statistical Inference for exploratory
     data analysis and model diagnostics Phil. Trans. R. Soc. A 2009
     367, 4361-4383 doi: 10.1098/rsta.2009.0120

Which in short says to create several plots, one is the residual (or other) plot from the real data, the rest are based on simulated data that fulfills all the assumptions.  If you cannot tell which plot is "real", then any violations of the assumptions are not practically significant.

The vis.test function in the TeachingDemos package implements a version of this test.
#
On 5/5/10, Tal Galili <tal.galili at gmail.com> wrote:
Two on-line resources would be "REGRESSION DIAGNOSTICS" by John Fox
[1] and ?Practical Regression and Anova using R? by Julian Faraway
[2].

Regards
Liviu

[1] http://socserv.socsci.mcmaster.ca/jfox/Courses/Brazil-2009/index.html
[2] http://cran.r-project.org/doc/contrib/Faraway-PRA.pdf