Skip to content
Prev 6709 / 20628 Next

Binomial vs. logistic regression & the consequences of aggregation

On 21 September 2011 23:56, Jeremy Koster <helixed2 at yahoo.com> wrote:
As long as you stay with the same aggregation of the data there are no
implications for model selection using AIC as Ben explained. The
really interesting question (to me at least) is how to deal with BIC
where the penalty term depends on the number of observations. BIC can
actually choose different models depending on whether you aggregate
the data or not. That brings up the question of whether to count the
number of observations as the number of binomial observations (rows in
X) as most people seem to be doing it, or as the number of Bernoulli
trials (sum of case weights). Since data carries the same information
whether they are aggregated or not, it is somewhat unfortunate that
BIC can depends on it.

Cheers
Rune