logistic regression and spatial autocorrelation

Hi,
to weigh in on this:
@Aitor, Harrell's rules of thumb are assuming independent predictors
without
any fancy covariance function. To model the covariance of the residuals
you are now estimating extra 
2nd order parameters from the data, so even more data is needed to
stabilize the parameter estimates.
The good news is that in the residual space it is the numbers of
adjacent 0's or 1's that matter. 

However, if the goal is prediction of species occurrence at unoccupied
sites, than
you may want to think about the problem differently and use either
indicator kriging,
kind of a spatial tobit model to predict probabilities of occurrence
based on Gaussian random
fields, or, you might want to look at geoRglm, for geostatistics in the
glm framework. The problem here
is, as another poster mentioned, is you may have more of a network than
a continuous random field, you may
get around that by using an anisotropic variogram.

Otherwise, in a prediction model in a regression context, over fitting
is going to be more of an issue
than autocorrelation of the residuals. Putting the spatial coordinates,
or the principal components of the spatial
weight matrix as one of the predictors may be good enough. Spatial
autocorrelation really effects the estimates
of the variance, and comes into play if you want to do inference, or
estimate confidence intervals/prediction intervals.

Again, all this assumes you are more interested in prediction than
modeling mechanism.

Nicholas

logistic regression and spatial autocorrelation

Thread (2 messages)