Message-ID: <22321.33369.653084.726426@stat.math.ethz.ch>
Date: 2016-05-10T06:40:25Z
From: Martin Maechler
Subject: Regression in match() in R 3.3.0 when matching strings with different character encodings
In-Reply-To: <CAGh0NYpi3fEc70szuKFRMhbuhx=TrGMt0QfCcgWBFHVDz9GbcQ@mail.gmail.com>
>>>>> Peter Haverty <haverty.peter at gene.com>
>>>>> on Mon, 9 May 2016 09:47:48 -0700 writes:
> Dear Kirill,
> You are correct, that is a new bug introduced in PR16491. The appropriate
> fix and regression tests have been added via PR16885, which has been merged
> into trunk. I believe that means the fix will be released with R 3.3.1.
Yes, definitely.
Kirill, as seem to use code which does trigger the bug, you may want to
switch using 'R-patched', i.e.,
> R.version.string
[1] "R version 3.3.0 Patched (2016-05-09 r70591)"
( where the subversion revision must be >= 70591 )
> I checked your example and the second "match" now properly returns 1 with
> the patched code.
> Please have a look at
> https://bugs.r-project.org/bugzilla3/show_bug.cgi?id=16885
> http://developer.r-project.org/blosxom.cgi/R-devel/NEWS
> Thank you for your report. I hope the benefits of this speedup will
> eventually outweigh this unfortunate bug in my PR16491.
I'm pretty sure that your hope will be fulfilled.
> Regards,
> Pete
> ____________________
> Peter M. Haverty, Ph.D.
Martin Maechler, ETH Zurich