Skip to content
Prev 315388 / 398503 Next

readHTMLTable (XML package)

Ista,

Thank you. That more or less did the trick. I got the data though it's in a weird format compared to how it appears on the page and needs a lot of clean up. But I was kind of expecting that.
Dan


-----Original Message-----
From: Ista Zahn [mailto:istazahn at gmail.com] 
Sent: Tuesday, January 15, 2013 3:18 PM
To: Lopez, Dan
Cc: R help (r-help at r-project.org)
Subject: Re: [R] readHTMLTable (XML package)

Hi Dan,
On Tue, Jan 15, 2013 at 5:31 PM, Lopez, Dan <lopez235 at llnl.gov> wrote:
Ah, good.
Well that error is not because RCurl doesn't work with https protocol.
In my original example I meant to show

tabs <- readHTMLTable(getURL("https://en.wikipedia.org/wiki/List_of_countries_by_population"))

i.e., getURL() does work with https. (Well, maybe depending on your version of libcurl. See the getURL help page for details.)
Yes, I do :)
See below
This is an RCurl FAQ (see http://www.omegahat.org/RCurl/FAQ.html). The quick and dirty way is

getURL("https://hr-workforce-analytics.llnl.gov/wf_pi_pop.html",
ssl.verifypeer = FALSE)

Best,
Ista