On Feb 25, 2022, at 10:58 AM, James Pustejovsky <jepusto at gmail.com> wrote:
The pdftools package might be helpful:
https://github.com/ropensci/pdftools <https://github.com/ropensci/pdftools>
It has very low-level utilities for extracting text from pdf. You'd still have to do some data clean-up to get the correlations into the form needed for analysis.
The tabulizer package is meant to provide tools customized for working with pdf tables:
https://github.com/ropensci/tabulizer <https://github.com/ropensci/tabulizer>
But it requires Java and it appears to be archived on CRAN. I'm not sure what its development status is. Caveat emptor, I guess.
James
On Fri, Feb 25, 2022 at 12:20 PM Kiet Huynh <kietduchuynh at gmail.com <mailto:kietduchuynh at gmail.com>> wrote:
Hello,
I was wondering if anyone knows of a way to automate in R (or any software) the process of importing correlation values from PDF to usable data in a table format that can be used in meta-analysis? My process has been to copy the correlations manually one-by-one from the PDF to excel (which takes a lifetime!), and then import the excel data into R. I'm sure there must be a better, faster, and less error-prone way to do this.
Thank you,
Kiet
----
Kiet D. Huynh, Ph.D.
Pronouns: he/him
CLEAR Goldblum-Carr Postdoctoral Fellow
Palo Alto University
1791 Arastradero Rd.
Palo Alto, CA 94304
[[alternative HTML version deleted]]