Skip to content
Back to formatted view

Raw Message

Message-ID: <03c7bca0-1e73-a9b0-7fd3-431d5abff7ea@effectivedefense.org>
Date: 2022-07-19T17:54:35Z
From: Spencer Graves
Subject: [R-pkg-devel]  Warning... unable to translate 'Ekstr<f8>m' to a wide string; Error... input string 1 is invalid
In-Reply-To: <CAHqSRuTcLCu7wzQMQniFQ_dAi2FdBRuXqKX1rL6SifRGwJPq9A@mail.gmail.com>

On 7/19/22 12:42 PM, Bill Dunlap wrote:
> Adding the initial zeroes is a bit safer, as would be \u{df}; either 
> protects against the next character being a hex digit.? There are 6 byte 
> utf-8 'characters', but I don't think R's parser accepts more than 4.


	  Thanks.  Tomas' blog was good in documenting the need and some of the 
pitfalls, but I don't know the difference between "\ua0", "\u00a0", 
"\u{a0}" or anything else, and I don't know how to find documentation 
that would explain that.  As I wrote years ago, it's hard to RTFM if I 
don't know which FMTR ;-)


	  Most important, I think for my current issue:  How can I find the 
correct development version of help('iconv')?


	  Since I copied the example used in subNonStandardCharacters.Rd from 
help('iconv'), I should be fine if I do what the R Core Team did with 
help('iconv').  Or if I guess and guess wrong, I could get another email 
from Prof Brian Ripley, ordering me to fix something.  I could search 
myself for the current development version of the base package, but I'm 
not sure I'd know if I got the correct version and not some other 
experiment that is different from the actual official development version.


	  ???
	  Spencer

> 
> -Bill
> 
> On Tue, Jul 19, 2022 at 10:32 AM Spencer Graves 
> <spencer.graves at effectivedefense.org 
> <mailto:spencer.graves at effectivedefense.org>> wrote:
> 
>     Hi, Bill, Tomas, et al.:
> 
> 
>     On 7/19/22 12:10 PM, Bill Dunlap wrote:
>      > Have you tried changing the \x's in that file with \u's?
>      >
>      >? > qx <- c("\xf6", "\xf8", "\xdf", "\xfc")
>      >? > Encoding(qx) <- "latin1"
>      >? > qu <- c("\uf6", "\uf8", "\udf", "\ufc")
>      >? > Encoding(qu)
>      > [1] "UTF-8" "UTF-8" "UTF-8" "UTF-8"
>      >? > qx == qu
>      > [1] TRUE TRUE TRUE TRUE
> 
> 
>     I have not tried anything yet for three reasons:
> 
> 
>      ? ? ? ? ? 1.? I don't know that I have access to anything that can
>     do the
>     proper test that's required, so I can know if I've fixed it or not.
> 
> 
>      ? ? ? ? ? 2.? Tomas' blog included examples that seemed to say to
>     replace
>     "\xa0" with "\u00a0", NOT "\ua0", and I don't know if this difference
>     matters or not.
> 
> 
>      ? ? ? ? ? 3.? Can someone provide me with a link to the correct
>     development
>     version of help('iconv')?? The current version includes the exact
>     offending "\x" strings that I have.? If I know the fix in the correct
>     development version of help('iconv'), I can copy that.? Without that,
>     I'm being asked to correct something that may not have been
>     corrected in
>     the development version of the base package.
> 
> 
>      ? ? ? ? ? Thanks,
>      ? ? ? ? ? Spencer
> 
>      >
>      > (charToRaw shows that qu and qx are not byte-for-byte identical:
>     '=='
>      > coerces the latin1 strings to utf-8.)
>      >
>      > -Bill
>      >
>      > On Tue, Jul 19, 2022 at 9:38 AM Spencer Graves
>      > <spencer.graves at effectivedefense.org
>     <mailto:spencer.graves at effectivedefense.org>
>      > <mailto:spencer.graves at effectivedefense.org
>     <mailto:spencer.graves at effectivedefense.org>>> wrote:
>      >
>      >? ? ?Hi, Tomas:
>      >
>      >
>      >? ? ?On 7/19/22 2:20 AM, Tomas Kalibera wrote:
>      >? ? ? >
>      >? ? ? > On 7/19/22 08:37, Spencer Graves wrote:
>      >? ? ? >> Hello:
>      >? ? ? >>
>      >? ? ? >>
>      >? ? ? >> ????? What's the recommended fix for "?Warning in
>      >? ? ?gsub(gsLi$pattern,
>      >? ? ? >> gsLi$replacement, xo) : unable to translate 'Ekstr<f8>m'
>     to a wide
>      >? ? ? >> string; Error in gsub(gsLi$pattern, gsLi$replacement, xo)
>     : input
>      >? ? ? >> string 1 is invalid"?
>      >? ? ? >>
>      >? ? ? >>
>      >? ? ? >> ????? This is in:
>      >? ? ? >>
>      >? ? ? >>
>      >? ? ? >>
>      >
>     https://github.com/sbgraves237/Ecfun/blob/master/man/subNonStandardCharacters.Rd
>     <https://github.com/sbgraves237/Ecfun/blob/master/man/subNonStandardCharacters.Rd>
>      >   
>      ?<https://github.com/sbgraves237/Ecfun/blob/master/man/subNonStandardCharacters.Rd <https://github.com/sbgraves237/Ecfun/blob/master/man/subNonStandardCharacters.Rd>>
>      >
>      >? ? ? >>
>      >? ? ? >>
>      >? ? ? >>
>      >? ? ? >> ????? R-devel is now rejecting some non-ASCII characters
>     that it
>      >? ? ? >> previously accepted;? see below.
>      >? ? ? >
>      >? ? ? > Please see
>      >? ? ? >
>      >
>     https://blog.r-project.org/2022/06/27/why-to-avoid-%5Cx-in-regular-expressions
>     <https://blog.r-project.org/2022/06/27/why-to-avoid-%5Cx-in-regular-expressions>
>      >   
>      ?<https://blog.r-project.org/2022/06/27/why-to-avoid-%5Cx-in-regular-expressions <https://blog.r-project.org/2022/06/27/why-to-avoid-%5Cx-in-regular-expressions>>
>      >
>      >? ? ? >
>      >? ? ? >
>      >? ? ? > Looking at the code I guess you should change the strings
>     in icx
>      >? ? ?to use
>      >? ? ? > \u escapes instead of \x. The use of \x as it is there was
>     probably
>      >? ? ? > correct when the code was ran in Latin-1 encoding, but not
>     in other
>      >? ? ? > encodings. Using \u would make it portable. Feel free to
>     ask more
>      >? ? ?if my
>      >? ? ? > guess is wrong and reading the blog post doesn't help.
>      >
>      >
>      >? ? ? ? ? ? ? ? "subNonStandardCharacters.Rd" copies examples from:
>      >
>      >
>      >
>     https://www.rdocumentation.org/packages/base/versions/3.6.2/topics/iconv
>     <https://www.rdocumentation.org/packages/base/versions/3.6.2/topics/iconv>
>      >   
>      ?<https://www.rdocumentation.org/packages/base/versions/3.6.2/topics/iconv <https://www.rdocumentation.org/packages/base/versions/3.6.2/topics/iconv>>
>      >
>      >
>      >? ? ? ? ? ? ? ? This file still contains "\x" in 5 places.? What's the
>      >? ? ?recommended
>      >? ? ?fix?? Replace "\x" with "\u00" everyplace?
>      >
>      >
>      >? ? ? ? ? ? ? ? I could try that, but I don't know if I have access to
>      >? ? ?platforms that
>      >? ? ?would tell me if I fixed it or not ;-)
>      >
>      >
>      >? ? ? ? ? ? ? ? Thanks very much.
>      >? ? ? ? ? ? ? ? Spencer Graves
>      >
>      >? ? ? >
>      >? ? ? > Best
>      >? ? ? > Tomas
>      >? ? ? >
>      >? ? ? >
>      >? ? ? >
>      >? ? ? >>
>      >? ? ? >>
>      >? ? ? >> ????? Thanks,
>      >? ? ? >> ????? Spencer Graves
>      >? ? ? >>
>      >? ? ? >>
>      >? ? ? >> -------- Forwarded Message --------
>      >? ? ? >> Subject: CRAN package Ecfun and its reverse dependencies
>      >? ? ? >> Date: Wed, 13 Jul 2022 06:34:24 +0100
>      >? ? ? >> From: Prof Brian Ripley <ripley at stats.ox.ac.uk
>     <mailto:ripley at stats.ox.ac.uk>
>      >? ? ?<mailto:ripley at stats.ox.ac.uk <mailto:ripley at stats.ox.ac.uk>>>
>      >? ? ? >> Reply-To: CRAN at R-project.org
>      >? ? ? >> To: veronica.vinciotti at brunel.ac.uk
>     <mailto:veronica.vinciotti at brunel.ac.uk>
>      >? ? ?<mailto:veronica.vinciotti at brunel.ac.uk
>     <mailto:veronica.vinciotti at brunel.ac.uk>>,
>      >? ? ? >> spencer.graves at effectivedefense.org
>     <mailto:spencer.graves at effectivedefense.org>
>      >? ? ?<mailto:spencer.graves at effectivedefense.org
>     <mailto:spencer.graves at effectivedefense.org>>, hamedhaseli at gmail.com
>     <mailto:hamedhaseli at gmail.com>
>      >? ? ?<mailto:hamedhaseli at gmail.com <mailto:hamedhaseli at gmail.com>>,
>      >? ? ? >> dennis.prangle at gmail.com
>     <mailto:dennis.prangle at gmail.com> <mailto:dennis.prangle at gmail.com
>     <mailto:dennis.prangle at gmail.com>>
>      >? ? ? >> CC: CRAN at R-project.org
>      >? ? ? >>
>      >? ? ? >> Dear maintainers,
>      >? ? ? >>
>      >? ? ? >> This concerns the CRAN packages
>      >? ? ? >>
>      >? ? ? >> ? BDWreg DWreg Ecdat Ecfun gk
>      >? ? ? >>
>      >? ? ? >> maintained by one of you:
>      >? ? ? >>
>      >? ? ? >> ? Dennis Prangle <dennis.prangle at gmail.com
>     <mailto:dennis.prangle at gmail.com>
>      >? ? ?<mailto:dennis.prangle at gmail.com
>     <mailto:dennis.prangle at gmail.com>>>: gk
>      >? ? ? >> ? Hamed Haselimashhadi <hamedhaseli at gmail.com
>     <mailto:hamedhaseli at gmail.com>
>      >? ? ?<mailto:hamedhaseli at gmail.com
>     <mailto:hamedhaseli at gmail.com>>>: BDWreg
>      >? ? ? >> ? Spencer Graves <spencer.graves at effectivedefense.org
>     <mailto:spencer.graves at effectivedefense.org>
>      >? ? ?<mailto:spencer.graves at effectivedefense.org
>     <mailto:spencer.graves at effectivedefense.org>>>: Ecfun Ecdat
>      >? ? ? >> ? Veronica Vinciotti<veronica.vinciotti at brunel.ac.uk
>     <mailto:veronica.vinciotti at brunel.ac.uk>
>      >? ? ?<mailto:veronica.vinciotti at brunel.ac.uk
>     <mailto:veronica.vinciotti at brunel.ac.uk>>>: DWreg
>      >? ? ? >>
>      >? ? ? >> We have asked for an update fixing the check problems
>     shown at
>      >? ? ? >>
>     <https://cran.r-project.org/web/checks/check_results_Ecfun.html
>     <https://cran.r-project.org/web/checks/check_results_Ecfun.html>
>      >   
>      ?<https://cran.r-project.org/web/checks/check_results_Ecfun.html
>     <https://cran.r-project.org/web/checks/check_results_Ecfun.html>>>
>      >? ? ? >> with no update from the maintainer thus far.
>      >? ? ? >>
>      >? ? ? >> Thus, package Ecfun is now scheduled for archival on
>     2022-08-08, and
>      >? ? ? >> archiving this will necessitate also archiving its CRAN
>     strong
>      >? ? ?reverse
>      >? ? ? >> dependencies.
>      >? ? ? >>
>      >? ? ? >> Please negotiate the necessary actions.
>      >? ? ? >>
>      >? ? ? >> The CRAN Team
>      >? ? ? >>
>      >? ? ? >> ______________________________________________
>      >? ? ? >> R-package-devel at r-project.org
>     <mailto:R-package-devel at r-project.org>
>      >? ? ?<mailto:R-package-devel at r-project.org
>     <mailto:R-package-devel at r-project.org>> mailing list
>      >? ? ? >> https://stat.ethz.ch/mailman/listinfo/r-package-devel
>     <https://stat.ethz.ch/mailman/listinfo/r-package-devel>
>      >? ? ?<https://stat.ethz.ch/mailman/listinfo/r-package-devel
>     <https://stat.ethz.ch/mailman/listinfo/r-package-devel>>
>      >
>      >? ? ?______________________________________________
>      > R-package-devel at r-project.org
>     <mailto:R-package-devel at r-project.org>
>     <mailto:R-package-devel at r-project.org
>     <mailto:R-package-devel at r-project.org>>
>      >? ? ?mailing list
>      > https://stat.ethz.ch/mailman/listinfo/r-package-devel
>     <https://stat.ethz.ch/mailman/listinfo/r-package-devel>
>      >? ? ?<https://stat.ethz.ch/mailman/listinfo/r-package-devel
>     <https://stat.ethz.ch/mailman/listinfo/r-package-devel>>
>      >
>