Skip to content
Prev 52961 / 63421 Next

issue with unz()?

If you use check.names=FALSE in your call to read.csv you can see that
the first column name starts with the 3 bytes ef bb bf, which is the
UTF-8 "byte-order mark" that Microsoft applications like to put at the
start of a text file stored in UTF-8.
[1] "???Accident_Index"
[1] ef bb bf 41 63 63 69 64 65 6e 74 5f 49 6e 64 65 78

I thought that adding fileEncoding="UTF-8-BOM" or perhaps
encoding="UTF-8-BOM" would take care of the issue, but it does not do
it for me.  You can remove them by hand with substring()
[1] "Accident_Index"
Bill Dunlap
TIBCO Software
wdunlap tibco.com
On Thu, Feb 9, 2017 at 4:13 PM, jing hua zhao <jinghuazhao at hotmail.com> wrote:
Message-ID: <CAF8bMcbqtTvLREZGJUcAgMXrZQZL+4JVOYDuWQ3y4t7DYk2ACg@mail.gmail.com>
In-Reply-To: <LOXP123MB11586751F324220E12A5F3B6A5440@LOXP123MB1158.GBRP123.PROD.OUTLOOK.COM>