An embedded and charset-unspecified text was scrubbed... Name: not available URL: <https://stat.ethz.ch/pipermail/r-help/attachments/20110123/77d9720d/attachment.pl>
Extracting information from text data
2 messages · Deb Midya, Peter Ehlers
On 2011-01-23 19:28, Deb Midya wrote:
Hi R-Users, Thanks in advance. I am using R-2.12.0 on Windows XP. I am trying to produce an n X m matrix from text data stored in different files. Where n = number of words (say w1, w2, ?, wn). M is the number of documents (say d1, d2, ?, dm) A. Using package tm I am using package tm to do the job. I have provided the code below:
my.corpus<- Corpus(DirSource(my.path), readerControl = list (reader=readPlain))
In readLines(y, encoding = x$Encoding) : incomplete final line found on 'M:\textmine/slr.txt'
So it looks like your slr.txt file has a problem. Inspect it with your editor.
x<- TermDocMatrix(my.corpus)
Error: could not find function "TermDocMatrix"
Where did you get the idea that package tm has this function? I see a function TermDocumentMatrix(). As you can see, R provides a very helpful reminder that you should check the name of the function. Peter Ehlers
B. Using package(s) other than tm Once again, thank you very much for the time you have given. Regards, Deb The code: library(tm) my.path<- 'M:\\textmine' my.corpus<- Corpus(DirSource(my.path), readerControl = list (reader=readPlain)) x<- TermDocMatrix(my.corpus) x [[alternative HTML version deleted]]