Skip to content

Logistic Regression: variable selection based on p value?

3 messages · pufftissue pufftissue, Erik Iverson, Frank E Harrell Jr

#
Puff -

There are many strategies, ideas, and literature on this topic.  A great 
introduction that leads to many of the references that are interesting 
is Frank Harrell's book, "Regression Modeling Strategies".  I would 
highly recommend it.
pufftissue pufftissue wrote:
#
pufftissue pufftissue wrote:
It depends on your goals, but in general problems caused by stepwise 
regression arise from using P-value cutoffs that are too small rather 
than cutoffs that are too large.  There are many reasons not to remove 
any variables, if you want valid confidence intervals and P-values and 
discrimination indexes.  Note that AUC is not a great objective 
function; that's why we have the log likelihood.

Frank