Message-ID: <CAGxFJbTHKsUdfSe6z4xwwm2SbY74Dp2vq4=etrUCSDfTk=4c5Q@mail.gmail.com>
Date: 2015-08-17T16:20:30Z
From: Bert Gunter
Subject: Text Pattern Recognition - Model
In-Reply-To: <SG2PR02MB063496A4D43CBB478903E5FBF0790@SG2PR02MB0634.apcprd02.prod.outlook.com>
Wrong list. This list is about R, not about statistics/statistical
learning. Post to a stats list like stats.stackexchange.com for
methods issues. Once you figure out what you want to do, R almost
certainly can do it -- search to find out what fuctions/packages to
use.
Cheers,
Bert
Bert Gunter
"Data is not information. Information is not knowledge. And knowledge
is certainly not wisdom."
-- Clifford Stoll
On Mon, Aug 17, 2015 at 7:27 AM, Anshuk Pal Chaudhuri
<anshuk.p at motivitylabs.com> wrote:
> Hi,
>
> I have a training dataset which has two columns which has around 70 values.
>
> 1. "PNRNo" whose values like UT768G, CXKA, 4IOI59, 4BV7TW...(typical PNR number patterns)
>
> 2. I have created one more factor variable mentioning (IsPNR) - so all the values are 1 (true)
>
> My first objective is to create a model on this training set which would recognize the text pattern.
>
> Second objective: The model would then be used to predict IsPNR with new set of test values like "Anshuk", "4EL58S"...as 0 and 1...
>
> Which model would be best for recognizing such kind of pattern and having decent accuracy? I tried naiveBayes, but I don't think it is all doing a good job. Its predicting all the test values as true. I suppose bayes is not meant for this.
>
>
> Regards,
> Anshuk Pal Chaudhuri
>
>
> [[alternative HTML version deleted]]
>
> ______________________________________________
> R-help at r-project.org mailing list -- To UNSUBSCRIBE and more, see
> https://stat.ethz.ch/mailman/listinfo/r-help
> PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
> and provide commented, minimal, self-contained, reproducible code.