Skip to content
Prev 47180 / 63421 Next

Support writing UTF-8 output in Windows

On 13-11-10 7:31 AM, Sverre Stausland wrote:
R does support UTF-8.  It all works smoothly in a UTF-8 locale, not so 
smoothly if you have your computer set up to use a different 8 bit encoding.
You got it right for the first 4.  Regarding (2) in your second list, 
that's right, and in fact UTF-8 is not supported as a native encoding.
And point (3) is optional, though UTF-8 is the dominant encoding nowadays.

The easiest solution is for you to switch to a Unix variant and set it 
up to use UTF-8 as the native encoding.

Next easiest would be for Microsoft to add UTF-8 as a code page.

Most difficult would be for R to handle UTF-8 properly on systems with 
limited support for it.

We probably will add small changes that let you work around the Windows 
problems, but they won't be very satisfactory to anyone.  I don't think 
we will make the big changes that would make R look like "a software 
program in 2013", since it would be so much work, and there's such an 
easy workaround.

Duncan Murdoch