Skip to content
Prev 315380 / 398506 Next

readHTMLTable (XML package)

Hi Ista,

It does exist. It?s a page in our company intranet.

It is https so it looks like I can't use RCurl either. I tried RCurl BTW and got the below error.

Do you have experience with pulling a table of an https site? If so how do I do that?
Error in readHTMLTable(getURL("https://hr-workforce-analytics.llnl.gov/wf_pi_pop.html")) : 
  error in evaluating the argument 'doc' in selecting a method for function 'readHTMLTable': Error in function (type, msg, asError = TRUE)  : 
  SSL certificate problem, verify that the CA cert is OK. Details:
error:14090086:SSL routines:SSL3_GET_SERVER_CERTIFICATE:certificate verify failed


Thanks.
Dan

-----Original Message-----
From: Ista Zahn [mailto:istazahn at gmail.com] 
Sent: Tuesday, January 15, 2013 12:22 PM
To: Lopez, Dan
Cc: R help (r-help at r-project.org)
Subject: Re: [R] readHTMLTable (XML package)

Hi Dan,

A couple of things: first, I think that file really does not exist (at least I can't open it in my web browser). Second, even if it did,
url() cannot download from https, according to the details section of ?url, which points you to RCurl. So, once you verify that you url actually exists you can do something like

library(XML)
library(RCurl)
tabs <- readHTMLTable(getURL("http://en.wikipedia.org/wiki/List_of_countries_by_population"))

Best,
Ista
On Tue, Jan 15, 2013 at 2:59 PM, Lopez, Dan <lopez235 at llnl.gov> wrote: