Skip to content

R-sig-ecology Digest, Vol 23, Issue 2

1 message · Highland Statistics Ltd

#
1. Low counts (Tore Chr Michaelsen)
Tore,

I'm actually trying to write a paper on exactly the same (low numbers) 
problem. But it doesn't go very fast. The first thing you have to ask 
yourself is whether the fact that there are no zeros is because you 
cannot have zeros...or is it just by chance? In the first case, consider 
zero truncated GLMs. The problem that I face myself with clutch size 
data with values between 1 and 5 is underdispersion. 
Hence....underdispersed zero truncated GLMs. And that brings you to 
generalized Poisson GLMs. Yes....there is always more shit. Now...I 
noticed that the zero truncation is not a real problem (i.e. similar 
SEs) as long as the fitted values are around 4 or 5 (or higher). In the 
snake carcasses data in  Chapter 11 of our mixed modelling book, the 
mean was between 1 and 2..and in that case differences between SEs of 
Poisson GLM and trunctated Poisson GLMs were about a factor 3.

As to your diagonal lines...those are due to your discrete values... In 
fact...those "lines" are always present..also in linear regression..but 
then you don't notice them. The extreme case is binary data.

So...summarising...think first about truncation....then check for 
underdispersion because you have a small range of observed values.

Alain