Skip to content
Back to formatted view

Raw Message

Message-ID: <CAAb4XW18x1SSHihrtZnwLFtLkOOLX+nLsQMJjMfFdyFxojje4A@mail.gmail.com>
Date: 2015-04-23T17:23:24Z
From: Mark Drummond
Subject: Predictions on training set shorter than training set

Hi all,

Given a simple logistic regression on a training data set using glm,
the number of predicted values is less than the number of observations
in the training set:

> fit.train.pred <- predict(fit, type = "response")
> nrow(train)
[1] 62660
> length(fit.train.pred)
[1] 58152
>

As a relative newcomer, I've run lots of simple glm, CART etc. models
but this is the first time I have seen this happen.

Is this a common issue and is there a fix? An option to predict() perhaps?

-- 
Cheers, Mark

Mark Drummond
mark at markdrummond.ca

When I get sad, I stop being sad and be Awesome instead. TRUE STORY.