Handling data with thousands of variables
H?vard Wahl Kongsg?rd wrote:
In machine learning settings it's not uncommon that the data has
thousands of variables. The same is also the case with genetic
studies.
In R what is the best approach for handling such data? Any personal
experience with handling such data in R?
For my case the raw data is a response variable and a unstructured
tuple with string keywords.
1341,{"Harry","Larry","Kline"}
54232,{"Mary","Kline","Larry"}
54232,{"David","Line","Lars"}
- H?vard
_______________________________________________ R-sig-hpc mailing list R-sig-hpc at r-project.org https://stat.ethz.ch/mailman/listinfo/r-sig-hpc
Did you have a look to Bioconductor : http://www.bioconductor.org/ http://manuals.bioinformatics.ucr.edu/home/ht-seq#R_BACK ? IHTH. Kinds, Mauricio
======================================================= Linux user #454569 -- Ubuntu user #17469 ======================================================= "Don't wish for less problems, wish for more skills. Don't wish it were easier, wish you were better." (Jim Rohn)