Text Pattern Recognition - Model
Wrong list. This list is about R, not about statistics/statistical learning. Post to a stats list like stats.stackexchange.com for methods issues. Once you figure out what you want to do, R almost certainly can do it -- search to find out what fuctions/packages to use. Cheers, Bert Bert Gunter "Data is not information. Information is not knowledge. And knowledge is certainly not wisdom." -- Clifford Stoll On Mon, Aug 17, 2015 at 7:27 AM, Anshuk Pal Chaudhuri
<anshuk.p at motivitylabs.com> wrote:
Hi,
I have a training dataset which has two columns which has around 70 values.
1. "PNRNo" whose values like UT768G, CXKA, 4IOI59, 4BV7TW...(typical PNR number patterns)
2. I have created one more factor variable mentioning (IsPNR) - so all the values are 1 (true)
My first objective is to create a model on this training set which would recognize the text pattern.
Second objective: The model would then be used to predict IsPNR with new set of test values like "Anshuk", "4EL58S"...as 0 and 1...
Which model would be best for recognizing such kind of pattern and having decent accuracy? I tried naiveBayes, but I don't think it is all doing a good job. Its predicting all the test values as true. I suppose bayes is not meant for this.
Regards,
Anshuk Pal Chaudhuri
[[alternative HTML version deleted]]
______________________________________________ R-help at r-project.org mailing list -- To UNSUBSCRIBE and more, see https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide http://www.R-project.org/posting-guide.html and provide commented, minimal, self-contained, reproducible code.