lme4, cloglog vs. binomial link (peter dalgaard)
Hi Peter and List, I confess that I have difficulty in seeing the connection with Poisson processes. When we are fishing indefinitely we can assume an endless supply of fish capture events so a Poisson process seems reasonable. In my case of an insect leaving a habitation each 'process' will just be a single departure event at some future time which is not exactly observed. Perhaps I'm thinking about this in the wrong way? Murray
On 8/06/2012 10:52 p.m., peter dalgaard wrote:
Hi Murray, I think this is pretty strongly related to proportional hazards modelling. If you are looking at it from a Poisson (*) process point of view, the rate of event per unit of time when observing a number of independent processes should be proportional to time and the number of processes, and the probability of at least one event in a fixed length of time T is then 1 - exp(- n T lambda) (or 1 - exp (-n Lambda(T)) if you have a time-varying intensity, Lambda being the integrated intensity). -pd (*) Could be fun if this was actually about fish... On Jun 6, 2012, at 23:21 , Murray Jorgensen wrote:
*Hi Peter, Tibor et al.
I came across an ecological situation recently where a cloglog link seemed
to be called for. I won't remove the context to the following explanatory
note that I wrote but I'm sure the same kind of situation could be
reasonably common:
We wish to explore the probability of one or more females departing a
cavity between two site visits as a function of the habitation state of the
cavity at the first visit. More strictly we study the probability of a
decrease in the number of females inhabiting the cavity between the two
visits. Clearly this probability will be zero if no females inhabit the
cavity at the first visit. More generally the probability will be larger as
the number of female inhabitants increase as each has the opportunity to
depart.
Although this dependance on the initial number of females is part of what
we want to study we are more interested in questions such as the influence
of the initial number of males on the probability of female decrease. We
are indeed also interested in the effect of the initial numbers of females
on the probability of female decrease, but more in the sense that we would
like to know whether this is greater than, less than or equal to what would
be predicted by a simple model.
One naive model that could be considered is that the decrease probability
would be proportional to the number of females. This might work if the
decrease probability was very low but for larger decrease probabilities
would predict decrease probabilities greater than one. A less naive model
would assume that each female departs with the same probability,
independently of the other females.
Then if the probability of a single female departing is $p$ and there are
$x$ females in the cavity the probability of 1 or more departing is $p_x =
1 - (1-p)^x$.
The link function for the complementary log-log link is $\eta =
\log(-\log(1-p))$. To examine the effect of multiple initial females we
evaluate this at $p_x$.
1-p_x = (1-p)^x
\log(1-p_x) = x\log(1-p)
\log(-\log(1-p_x)) = \log(x)+\log(-\log(1-p))
Thus the effect of an initial habitation of $x$ females is a shift of
$\log(x)$ on the linear predictor scale if a complementary log-log link is
used in a GLM or GLMM for the probability of female decrease. This means
that the naive model can be accommodated by including $\log(x)$ as an
offset. If $x$ were also included as a covariate, a significant
coefficient would indicate a departure from the naive model.
Regards, Murray
*
Message: 5 Date: Wed, 6 Jun 2012 22:54:16 +0200 From: peter dalgaard<pdalgd at gmail.com> To: Tibor Kiss<tibor at linguistics.rub.de> Cc: r-sig-mixed-models at r-project.org Subject: Re: [R-sig-ME] lme4, cloglog vs. binomial link Message-ID:<7658F572-AEE2-4127-AB77-321B5B6C3D69 at gmail.com> Content-Type: text/plain; charset=us-ascii On Jun 4, 2012, at 13:07 , Tibor Kiss wrote:
[...snippage...] My questions are as follows: 1. Is it correct to assume that given a cloglog link, the less frequent
response should be considered the success? No, cloglog is asymmetric, so it will make a difference which outcome is considered success, but there is no mathematical reason to choose between them. In survival data, the cloglog comes out of the proportional hazards model when you have death within a fixed time period as the response (exact date of death not recorded). In that case, death is "success" (!); hopefully, it is the least likely outcome, but it might not be. If cloglog is just used as a generic link function, then no such logic applies.
2. Is it correct to conclude that the changes in the model have led to
less influence of the random factor? No. The scales are different. At the very least, you need to somehow compare it to the fixed effects on the same scale.
3. How shall I react to the increase in AIC?
(Or, equivalently, the deviance). The cloglog link model seems to give the worse fit to data.
A final question, which may not have an answer at all: I am most curious
to learn about possible modifications of the model so that an observed random effect can be minimized (while its presence cannot be denied). First, is that desirable, and why? The only logic, that I can think of, is that you want to get the fixed-effect part of the model right, so that the error is not mistakenly taken as part of the random variation. -- Peter Dalgaard, Professor, Center for Statistics, Copenhagen Business School Solbjerg Plads 3, 2000 Frederiksberg, Denmark Phone: (+45)38153501 Email: pd.mes at cbs.dk Priv: PDalgd at gmail.com ------------------------------
_______________________________________________ R-sig-mixed-models mailing list R-sig-mixed-models at r-project.org https://stat.ethz.ch/mailman/listinfo/r-sig-mixed-models End of R-sig-mixed-models Digest, Vol 66, Issue 10 **************************************************
-- Dr Murray Jorgensen http://www.stats.waikato.ac.nz/Staff/maj.html Department of Statistics, University of Waikato, Hamilton, New Zealand Email: maj at waikato.ac.nz majorgensen at ihug.co.nz Fax 7 838 4155 Phone +64 7 838 4773 wk Home +64 7 825 0441 Mobile 021 0200 8350 [[alternative HTML version deleted]]
_______________________________________________ R-sig-mixed-models at r-project.org mailing list https://stat.ethz.ch/mailman/listinfo/r-sig-mixed-models
Dr Murray Jorgensen http://www.stats.waikato.ac.nz/Staff/maj.html Department of Statistics, University of Waikato, Hamilton, New Zealand Email: maj at waikato.ac.nz majmurr at gmail.com Fax 7 838 4155 Phone +64 7 838 4773 wk Home +64 7 825 0441 Mobile 021 0200 8350