errors with lme4
[cc'ing to r-sig-mixed-models list]
On 11-11-24 03:25 PM, Alessio Unisi wrote:
Hi Ben, thanks for answer! sorry but i'm a new R-user and i'm not so skilled...also in statistic! : )) just few answer and question to yours...
Knowing *neither* R *nor* statistics can be a fairly dangerous combination. If you ask politely, people on the R help lists will often help with statistical questions, but they are technically off-topic. There are other places (such as http://stats.stackexchange.com/ ) for asking statistics questions ... and it would really be very best to see if you can get some local help (classes or helpful colleagues/fellow students/professors/consultants).
I can't prove it, but I strongly suspect that some of your coefficients are perfectly multicollinear.
some they are...yes..but not all...hatch and lay surely they are...what does it change i have to multiplicate instead of sum?
By "perfectly multicollinear" I don't mean that they are strongly collinear (which ecologists often worry about, correctly, but sometimes more than they need to) but rather "perfectly". For example, suppose you ran a 2x2 factorial design (e.g. effects of temperature and light) but ended up with a missing "corner" (e.g. no samples in the high-light/high-temperature combination). You would then be unable to estimate an interaction term, because you would be trying to estimate 4 parameters (intercept/grand mean, light effect, temperature effect, interaction) from only three independent sets of data. This is the same idea: some of your predictors probably line up *perfectly* with combinations of other predictors.
the idea of using glmer() or lmer() is because i had to deal with random factor (1|territory)...glm can't handle this? i don't think...
You're correct that you will need to get back to glmer() eventually, but I wanted you try out glm() because the presence of NAs in your coefficient vector will confirm that collinearity is the problem, not some other issue with glmer ...
How many observations are left after na.omit(fledge)?
sorry..i don't understand...
When you run the analysis, R will drop rows from your data set that have NAs in any of the predictors. It looks like you have a total of 152 observations, but I wonder how many there are with complete records. nrow(na.omit(fledge)) will tell you this.
What is the difference between your 'S1' and 'S2' temperature records?
those are temperature recorded in different time....S1 is during egg laying and incubation and S2 is during hatching and rearing of the chicks
Can we please see the results of summary(fledge)? It would be good if you were willing to post your whole data set somewhere for download (or at a pinch e-mail it to me). Ben Bolker
thank you alessio Ben Bolker ha scritto:
Alessio Unisi <franceschi6 <at> unisi.it> writes:
Dear R-users, i need help for this topic! I'm trying to determine if the reproductive success (0=fail, 1=success) of a species of bird is related to a list of covariates. These are the covariates: ? elev: elevation of nest (meters) ? seadist: distance from the sea (meters) ? meanterranova: records of temperature ? minpengS1: records of temperature ? wchillpengS1: records of temperature ? minpengS2: records of temperature ? wchillpengS2: records of temperature ? nnd: nearest neighbour distance ? npd: nearest penguin distance ? eggs: numbers of eggs ? lay: laying date (julian calendar) ? hatch: hatching date (julian calendar) I have some NAs in the data. I want to test the model with all the variable then i want to remove some, but the ideal model: GLM.1 <-lmer(fledgesucc ~ +lay +hatch +elev +seadist +nnd +npd +meanterranova +minpengS1 +minpengS2 +wchillpengS1 +wchillpengS2 +(1|territory), family=binomial(logit), data=fledge) doesn't work because of these errors: 'Warning message: In mer_finalize(ans) : gr cannot be computed at initial par (65)'. "matrix is not symmetric [1,2]" If i delete one or more of the T records (i.e. minpengS2 +wchillpengS2) the model works...below and example: GLM.16 <-lmer(fledgesucc ~ lay +hatch +elev +seadist +nnd +npd +meanterranova +minpengS1 +(1|territory), family=binomial(logit), data=fledge)
> summary(GLM.16)
Generalized linear mixed model fit by the Laplace approximation
Formula: fledgesucc ~ lay + hatch + elev + seadist + nnd + npd +
meanterranova + minpengS1 + (1 | territory)
Data: fledge
AIC BIC logLik deviance
174 204.2 -77 154
Random effects:
Groups Name Variance Std.Dev.
territory (Intercept) 0.54308 0.73694
Number of obs: 152, groups: territory, 96
I can't prove it, but I strongly suspect that some of your coefficients are perfectly multicollinear. Try running your model as a regular GLM: g1 <- glm(fledgesucc ~ +lay +hatch +elev +seadist +nnd +npd +meanterranova +minpengS1 +minpengS2 +wchillpengS1 +wchillpengS2 and see if some of the coefficients are NA. coef(g1) lm() and glm() can handle this sort of "rank-deficient" or multicollinear input, (g)lmer can't, as of now. I suspect that you may be overfitting your model anyway: you should aim for not more than 10 observations per parameter (in your case, since all your predictors appear to be continuous, How many observations are left after na.omit(fledge)? What is the difference between your 'S1' and 'S2' temperature records?
______________________________________________ R-help at r-project.org mailing list https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide http://www.R-project.org/posting-guide.html and provide commented, minimal, self-contained, reproducible code. ----- Nessun virus nel messaggio. Controllato da AVG - www.avg.com Versione: 2012.0.1873 / Database dei virus: 2101/4636 - Data di rilascio: 24/11/2011