Skip to content
Prev 258144 / 398502 Next

logistic regression: wls and unbalanced samples

Many thanks for your messages.

I will take a look at the survey package.
I was concerned with the issues raised by Cramer (1999) in "Predictive
performance of the binary logit model in unbalanced samples".

In this particular case, misclassification costs are much higher for
the smaller group (defaults) than for the larger group (non-defaults).
However, I have no specific guidelines for how much higher. If I
understood correctly, using sampling weights would help improve
accuracy on the smaller group and, at least, I would be able to
explain the rationale for the different weights.

To cite properly, I was referring to lrm in the Design package
(Harrel, 2008). Sorry to have intruded the list with such question,
but - once again - thank you for your answers.

On Wed, Apr 27, 2011 at 7:29 AM, Prof Brian Ripley
<ripley at stats.ox.ac.uk> wrote: