Skip to content
Prev 61121 / 63424 Next

Installation failure in non-UTF-8 MBCS locale

On 10/16/22 19:35, G?bor Cs?rdi wrote:
Right, one should use UTF-8 (on all platforms) as the locale encoding.

For historical reasons, one can still parse UTF-8 when R is running e.g. 
in Latin 1 locale. This is still supported as older Windows systems 
don't use UTF-8 as the native encoding, yet.

When R runs in a non-UTF-8 multi-byte locale, it cannot parse UTF-8 R 
input files. This is due to how the parser works and supporting that 
would require a major rewrite which would not be worth the effort 
(instead effort has been spent on supporting UTF-8 as the native 
encoding on Windows).

Best
Tomas