I have an input CSV with columns off irregular length that contain NAs. I want to strip the NAs out and produce a multiple violin plot automatically labelled using the headers. At the moment I do this
Code:
ds1 = read.csv("http://www.lecturematerials.co.uk/data/spelling.csv")
library(vioplot)
y6<-na.omit(ds1$y6)
y5<-na.omit(ds1$y5)
y4<-na.omit(ds1$y4)
y3<-na.omit(ds1$y3)
y2<-na.omit(ds1$y2)
y1<-na.omit(ds1$y1)
vioplot(y6, y5, y4,y3,y2,y1,horizontal=TRUE, names=c("Y6", "Y5","Y4","Y3","Y2","Y1"), col = "lightblue")
Two queries:
1. Is there a more elegant way of automatically stripping the NAs, passing the columns to the function along with the header names??
ds2 <- lapply( ds1, na.omit)
2. Can I easily add the sample size to each violin plotted??
No documentation for ?violplot? in specified packages and libraries:
you could try ???violplot?