Problem with scan() from UTF-8 encoded URL
, Thank you for trying. Strange. I am using R version 2.6.0 Patched (2007-11-09 r43408) on OSX and it is not working. I guess it has something to do with the language settings. However. Regards Marc Schwenzer
john seers (IFR) wrote:
Hello Works fine for me:
data
<-scan(file='http://en.wikipedia.org/wiki/Special:Recentchanges',what='c haracter') Read 3581 items So I don't think it is the Wikipedia end. Regards John Seers --- -----Original Message----- From: r-help-bounces at r-project.org [mailto:r-help-bounces at r-project.org] On Behalf Of EUROPOL Sent: 03 December 2007 16:51 To: r-help at stat.math.ethz.ch Subject: [R] Problem with scan() from UTF-8 encoded URL Hallo, I am trying to import a website and structure it from within R: The following code: data <- scan(file='http://en.wikipedia.org/wiki/Special:Recentchanges',what='cha racter') results in the error: Error in file(file, "r") : unable to open connection In addition: Warning message: cannot open: HTTP status was '403 Forbidden' in: file(file, "r") It seems that the error is connected to the UTF-8-format of wikipedia, since the following line works: data <- scan(file='http://www.google.de',what='character') I am looking forward to your answers. Greetings Marc Schwenzer
______________________________________________ R-help at r-project.org mailing list https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide http://www.R-project.org/posting-guide.html and provide commented, minimal, self-contained, reproducible code.