Skip to content

Effect size in GLIM models

1 message · Reader Tom

#
Dear All

Thanks very much for the rapid reply Prof Ripley. I had been looking at
this anaysis for my colleague (Prof Behnke) and suggested that he
contact the R mailing list because I couldn't answer his question. I
think some of the detail got lost in translation (he grew up with the
GLIM package!). So here are some more details:

We are indeed following your guidelines in the MASS book, and using
glm.nb to analyse some data on the abundance of several parasite species
in mice. We proceeded with model selection as suggested, and we are
reasonably happy that we end up with decent models for our several
parasite species. 

The question that Prof Behnke asked is: if we fit similar models
(initial full models have the same factors and covariates) with
different response variables (abundances of different species of
parasite), is there a way of comparing the relative effect sizes of the
key explanatory variables across different models? For example, if we
find that the best models for two different species include the term
"sex", is there a way of determining if sex explains more of the
variance in parasite abundance in species A than in species B? 

In a simple ANOVA with Guassian errors, we might compare the percentage
variance explained. We could also look at the overall r^2 for the models
and determine how well (relatively) our different models perform. We
might end up concluding that for species A, we have found the most
important biolgoical factors explaining parasite abundance, but that for
species B we have yet to explain a large proportion of the variance.

Is there something similar we can do with our glm.nb models? Clearly the
coefficients will tell us about relative effect sizes WITHIN a given
model, but what can we do when comparing completely different response
variables?!

Regards

Tom Reader

-----Original Message-----
From: Prof Brian Ripley [mailto:ripley at stats.ox.ac.uk] 
Sent: 17 January 2007 14:02
To: Behnke Jerzy
Cc: r-help at stat.math.ethz.ch; Reader Tom
Subject: Re: [R] Effect size in GLIM models
On Wed, 17 Jan 2007, Behnke Jerzy wrote:

            
I think there is consensus that effect sizes are not measured by
significance tests.  If you have a log link (you did not say), the model
coefficients have a direct interpretation via multiplicative increases
in rates.
What exactly do you mean by 'GLIM models in R with negative binomial
errors'?  Negative binomial regression is within the GLM framework only
for fixed shape theta. Package MASS has glm.nb() which extends the
framework and you may be using without telling us.  (AFAIK GLIM is a
software package, not a class of models.)

I suspect you are using the code from MASS without reference to the book
it supports, which has a worked example of model selection.
'probability' of what?
If theta is not held fixed, anova() is probably not appropriate: see the
help for anova.negbin.
I don't know anything to suggest percentages of LR statistics are
reasonable summary measures.  There are extensions of R^2 to these
models, but AFAIK they share the well-attested drawbacks of R^2.
We do ask for more information in the posting guide and the footer of
every message.  I have had to guess uncomfortably much in formulating my
answers.