Skip to content

multiple regression

3 messages · Kingsford Jones, David Hewitt

#
I completely agree.
Gelman and Hill (multilevel modeling), the book referred to by Gustaf
in another thread, provides some nice examples where rescaling input
variables leads to easier-to-interpret coefficients. I agree with KJ
that doing so is not a cure-all, but it can help. Mostly it just
relates to having a "smart" idea of the effects of interest and what
they mean. Of course, rescaling or standardizing is often not an easy
fix, as KJ notes.

...SNIP...
It is indeed tough, but I don't think partial correlations/SSEs are a
good route. What methods are you referring to in particular? I can't
see how this would help except in the simplest linear models.
Agreed. And thus the search for one "best" (or "useful", although this
is a much fuzzier idea) model is a bad idea. Multimodel inference and
model averaging using some formal selection criterion has not been
mentioned, but protects you in such situations to some extent. Far
better than focusing on one model from a selection method of any type.
#
On Tue, Feb 16, 2010 at 11:15 PM, David Hewitt <dhewitt37 at gmail.com> wrote:
Hi David,

My aim wasn't to hold up those metrics as improved measures of
importance, but rather to mention the idea calculating a metric over
all possible orderings of the model.  E.g, see the Gromping paper I
cited earlier in the thread, or for more seminal work:

@article{1987,
title = {Relative Importance by Averaging Over Orderings},
author = {Kruskal, William},
journal = {The American Statistician},
volume = {41},
number = {1},
jstor_formatteddate = {Feb., 1987},
pages = {6--10},
abstract = {Many ways have been suggested for explicating the
ambiguous concept of relative importance for independent variables in
a multiple regression setting. There are drawbacks to all the
explications, but a relatively acceptable one is available when the
independent variables have a relevant, known ordering: consider the
proportion of variance of the dependent variable linearly accounted
for by the first independent variable; then consider the proportion of
remaining variance linearly accounted for by the second independent
variable; and so on. When, however, the independent variables do not
have a relevant ordering, that approach fails. The primary suggestion
of this article is to rescue the idea by averaging relative importance
over all orderings of the independent variables. Variations and
extensions of the idea are described.},
year = {1987},
publisher = {American Statistical Association}
}

Kingsford
#
Gotcha. I think we are on the same track. If I understand things
correctly, the new methods coming out for multimodel inference and
model-averaging are the grown-up versions of this older idea. I think
Chatfield really pushed all this in the mid-1990s with his work on
model selection uncertainty.