Prev 14770 / 20628 Next

lme for data that is not normally distributed

Ben Bolker

Wed, Aug 3, 2016 1:14 PM

For what it's worth, this graph is assessing
linearity/heteroscedasticity rather than Normality (you would want a Q-Q
plot, not a fitted vs residuals plot, for that).  This doesn't look too
terrible, but there does seem to be a bit of 'flare' at the
large-fitted-value end, which supports Paul's suggestion that you try a
log transformation ...

On 16-08-03 03:58 PM, moses selebatso via R-sig-mixed-models wrote:

Thank you both Paul and Alain for your help. You both point out that
I shouldn't test for normality before running a model. I appreciate
that. Paul I have tried you new scripts and, I guess you were right
about experience in visually assessing for normality. Not straight
forward. Below is the plot, for your appreciation. library(lme4) 
install.packages("devtools") library(devtools) 
devtools::install_github("pcdjohnson/GLMMmisc") library(GLMMmisc) 
data<-read.csv("clipboard",sep="\t") m <- lmer(Distance ~ Time + (1 |
ID), data = data) sim.residplot(m) Regards, Moses SELEBATSO Home:
(+267) 318 5219 (H)  Mobile:  (+267) 716 39370  or  (+267) 738
39370"Those who will ALWAYS agree with you may be oppressed by you"

On Wednesday, 3 August 2016, 15:54, Paul Johnson
<paul.johnson at glasgow.ac.uk> wrote:



Hi Moses,

I wouldn?t test normality of residuals ? better to assess them by
eye. I know this sounds ad hoc but given that almost no real
distribution in nature is perfectly normal, the question should be
?how non-normal can the residuals be before seriously harming my
inferences??. This is a more difficult question to answer and
basically requires experience. A test conflates the degree of
non-normality and sample size  so a significant result can mean
?quite normal but high n? while a non-significant result can mean
?very non-normal but low n?:

set.seed(1) x <- rpois(1000, 50) hist(x)  # looks beautifully normal 
shapiro.test(x) # significantly non-normal hist(log(x[1:20])) # looks
pretty bad shapiro.test(log(x[1:20])) # passes the test

Given that your distance response measure is (probably) constrained
to be positive, there?s a good change that it?s right-skewed and
potentially made more normal by log-transformation (if there are no
zero distances).

A good way to visually assess residuals is to plot them against the
fitted values, then compare these to residuals simulated from the
fitted model ? they should look similar, give or take sampling
variation. You can do this with a function I recently wrote called
sim.residplot (available here:
https://github.com/pcdjohnson/GLMMmisc), although you?ll have to
refit your model using lmer in the lme4 package:

library(lme4) library(GLMMmisc) m <- lmer(Distance ~ Time + (1 | ID),
data = data) sim.residplot(m) # repeat a few times to allow for
sampling variation

Good luck, Paul

On 3 Aug 2016, at 14:25, moses selebatso via R-sig-mixed-models
<r-sig-mixed-models at r-project.org> wrote:

Thank very much for your helpful advice. I ran the model and tested
the residuals. They are not normally distributed, and I am still
stuck with how I proceed. I tried to copy the output on the email,
but I get an error message that the message format cannot sent. 
Regards, Moses

On Wednesday, 3 August 2016, 12:15, Highland Statistics Ltd
<highstat at highstat.com> wrote:

Date: Wed, 3 Aug 2016 09:40:20 +0000 (UTC) From: moses selebatso
<selebatsom at yahoo.co.uk> To: R-sig-mixed-models
<r-sig-mixed-models at r-project.org> Subject: [R-sig-ME] lme for
data that is not normally distributed Message-ID: 
<127496753.15122202.1470217220406.JavaMail.yahoo at mail.yahoo.com> 
Content-Type: text/plain; charset="UTF-8"

?Hello I have some data that I would to analyse with mixed models
(lme). As a standard procedure I tested for the normality of the
data and it is not normal. Any ideas of how deals with this kind
of data? I have a sample below and the model that I was hoping to
use (if?the data?was normal) m <-
lme(Distance~Time,random=~1|ID,data=data).


Checking normality of the response variable before doing the
analysis is a misconception. Why should it be normally distributed?
Fit your model and check your residuals for normality.


Alain




|


| ID |


| Time |


| Distance |


|


| 10187A |


| Pre_dry |


| 4.31287 |


|


| 10187A |


| Pre_dry |


| 6.867578 |


|


| 10187A |


| Pre_dry |


| 4.640427 |


|


| 10187A |


| Post_dry |


| 4.497807 |


|


| 10187A |


| Post_dry |


| 9.726069 |


|


| 10187A |


| Post_dry |


| 5.150089 |




Regards, Moses SELEBATSO? [[alternative HTML version deleted]]



------------------------------

Subject: Digest Footer

_______________________________________________ 
R-sig-mixed-models mailing list R-sig-mixed-models at r-project.org 
https://stat.ethz.ch/mailman/listinfo/r-sig-mixed-models

------------------------------

End of R-sig-mixed-models Digest, Vol 116, Issue 4 
**************************************************

_______________________________________________ 
R-sig-mixed-models at r-project.org mailing list 
https://stat.ethz.ch/mailman/listinfo/r-sig-mixed-models




[[alternative HTML version deleted]]

_______________________________________________ 
R-sig-mixed-models at r-project.org mailing list 
https://stat.ethz.ch/mailman/listinfo/r-sig-mixed-models

_______________________________________________ 
R-sig-mixed-models at r-project.org mailing list 
https://stat.ethz.ch/mailman/listinfo/r-sig-mixed-models

Thread (6 messages)

Highland Statistics Ltd lme for data that is not normally distributed Aug 3 moses selebatso lme for data that is not normally distributed Aug 3 Paul Johnson lme for data that is not normally distributed Aug 3 moses selebatso lme for data that is not normally distributed Aug 3 Ben Bolker lme for data that is not normally distributed Aug 3 John Maindonald lme for data that is not normally distributed Aug 3