Skip to content
Prev 10296 / 20628 Next

Model validation for Presence / Absence, (binomial) GLMs

Highland Statistics Ltd <highstat at ...> writes:
That's an interesting document: I think the bottom line is:
  
  * if the Bernoulli data can be grouped, i.e. if there are
in general multiple observations with the same set of covariates,
then overdispersion can be identified, because the data are
really equivalent to a binomial response within the groups.

  For example, the trivial example

grp  resp
A    1
A    0
A    1
B    0
B    0
B    1

is equivalent to:

grp  successes total
A    2         3
B    1         3

this sort of aggregation can be done with plyr::ddply, or
in lots of different ways -- it's generally faster and 
more efficient to fit the data in the grouped form than
in the binary form.

It might have been more convenient for readers to be able to
go directly to the link:

http://www.highstat.com/BGS/GLMGLMM/pdfs/
 HILBE-Can_binary_logistic_models_be_overdispersed2Jul2013.pdf
{URL broken to make Gmane  happy}

rather than having to click 5 times to get through the
Highland Statistics material to the PDF of interest ...