Skip to content
Prev 366914 / 398506 Next

GLM HELP NEEDED!

Learning to read R error messages carefully and for full meaning is an essential step toward full mastery of this wonderful gift from the R Core.

The business about "negative values" is a complete distraction. That was the consequent of an if statement and was only in there to show you where to look in the function if you were so inclined . The error is actually being thrown much earlier in parsing that statement by the "<" operator inside the `if` statement. The "real" error message that says:
Or in English:

 missing value where TRUE/FALSE needed


Examine this code:
Error in if (any(y < 0)) stop("negative values not allowed for the 'Poisson' family") : 
  missing value where TRUE/FALSE needed
In addition: Warning message:
In Ops.factor(y, 0) : ?<? not meaningful for factors

Because the "<" operator is not defined for factors the result that is passed to `if` is of length 0. Setting the factor variable on the RHS and using the integer values on hte LHS succeeds.
Call:  glm(formula = y ~ x, family = "poisson")

Coefficients:
(Intercept)           x2           x3           x4           x5  
  4.676e-11    6.931e-01    1.099e+00    1.386e+00    1.609e+00  

Degrees of Freedom: 4 Total (i.e. Null);  0 Residual
Null Deviance:	    3.591 
Residual Deviance: 6.661e-16 	AIC: 24.35

Duncan Murdoch points out that fractional values in the LHS of a formula for Poisson regression will not be accepted (since the poisson distribution is discrete), and if you do in fact need Poisson regression that you would need to use the quasi-binomial family.

On the other hand ... If those were counts in the thousands and needed to be converted to "whole numbers", you might need to convert the factor values to numeric with:

a$transinduc <- as.numeric( gsub( "[,]", "", a$transinduc) )