Message-ID: <06BA472A-FDA4-4F9B-B3FB-87BFA7090D28@me.com>
Date: 2011-12-28T00:00:37Z
From: Marc Schwartz
Subject: importing NPDB
In-Reply-To: <1325026552.96570.YahooMailNeo@web112316.mail.gq1.yahoo.com>
On Dec 27, 2011, at 4:55 PM, alison korte wrote:
> Does anyone have information about using the National Practitioner Data Bank (NPDB) in R? It's public use data available from the U.S. Department of Health and Human Services that can be downloaded as a .dat or .por file. I can't even figure out how to open the file in R.
The files that are available from:
http://www.npdb-hipdb.hrsa.gov/resources/publicData.jsp
are either tab delimited ASCII files (.DAT) or SPSS (.POR) files.
For R, you would want to use the former and import the files using ?read.delim, which defaults to using a '\t' (TAB) character as the delimiter. See the R Data Import/Export manual for more information:
http://cran.r-project.org/manuals.html
Technical details about the file structure are available in:
http://www.npdb-hipdb.hrsa.gov/resources/reports/PublicUseDataFile-DAT-Format.pdf
BTW, that took about two minutes of searching using Google.
HTH,
Marc Schwartz