Skip to content
Prev 300485 / 398503 Next

R code for to check outliers

On 18/07/2012 10:14 AM, Bert Gunter wrote:
I think you are assuming that Sajeeka will handle the outliers 
incorrectly.   It happens often enough, but I don't think it's polite to 
make that assumption.

My answer to the question would have been to ask the question, "how do 
you define outliers?"  Certainly it's possible to define outliers in the 
context of a model, and their presence is an indication of problems with 
the model.  The correct response might be to weaken the assumptions of 
your model and use a robust procedure as Michael suggested (which might 
mean throwing away the outliers), or it might be to change the model in 
some other way.  Your advice to consult a subject matter expert is good, 
but in my experience, they often put more faith in their models than 
they should, so as a statistician, I think you should point out 
discrepancies like outliers.  Which means it's good to have a function 
to detect them.

Duncan Murdoch