Query about Text Preprocessing (Encoding)

Sun, May 29, 2016 7:45 AM

On 29/05/2016 3:20 AM, Khadija Shakeel wrote:

R doesn't currently have a translation team (see 
translation.r-project.org) for Urdu, so it may be hard for you to get 
Urdu-specific support.  However, I would guess the problems you are 
having are common to other languages that use non-Roman alphabets, and 
you may get some advice from the translation teams for one of them.

The general issues that I know of are:

  - R needs to know your encoding.  On Unix-alikes the best support is 
for UTF-8; Windows support is weaker, because Windows tends to use 
UTF-16 or other multibyte encodings, and R's support for those is mixed.

  - You need to make sure your graphics device supports your alphabet. 
Not all graphics devices have character support for all languages.

Duncan Murdoch

Query about Text Preprocessing (Encoding)

Thread (3 messages)