An embedded and charset-unspecified text was scrubbed... Name: not available URL: <https://stat.ethz.ch/pipermail/r-help/attachments/20081222/156750dc/attachment.pl>
question about read datafile
2 messages · Lu, Zheng, jim holtman
Read in the data using readLines to read the complete line. Use grep/regexpr to scan for valid lines and then convert them to numeric by using strsplit/as.numeric.
On Mon, Dec 22, 2008 at 2:13 PM, Lu, Zheng <Zheng.Lu at mpi.com> wrote:
Dear all:
I have been thinking to import below one data file (.txt)into R by
read.table(..,skip=1, header=T). But How can I deal with the repeated
rows of TABLE NO.1 and names of data variables in the middle of this
data file. The similar block will be repeated 100 times, here only show
4 of them and within each block, data records also can vary, here only
paste 4 rows for example. I appreciate your consideration and help in
this holiday season. Happy Holiday!
TABLE NO. 1
ID GID TIME OBS AMT EVID
RATE ADDL II CMT WT IPRE
3.1000E+01 1.0000E+00 0.0000E+00 0.0000E+00 1.0000E+00 1.0000E+00
0.0000E+00 0.0000E+00 0.0000E+00 4.0000E+00 2.4000E-02 3.3918E+02
3.1000E+01 1.0000E+00 0.0000E+00 2.0500E+02 0.0000E+00 0.0000E+00
0.0000E+00 0.0000E+00 0.0000E+00 4.0000E+00 2.4000E-02 2.6267E+02
3.1000E+01 1.0000E+00 9.6000E+01 4.2100E+02 0.0000E+00 0.0000E+00
0.0000E+00 0.0000E+00 0.0000E+00 4.0000E+00 2.4000E-02 3.1781E+02
TABLE NO. 1
ID GID TIME OBS AMT EVID
RATE ADDL II CMT WT IPRE
3.1000E+01 1.0000E+00 0.0000E+00 0.0000E+00 1.0000E+00 1.0000E+00
0.0000E+00 0.0000E+00 0.0000E+00 4.0000E+00 2.4000E-02 5.7557E+01
3.1000E+01 1.0000E+00 0.0000E+00 2.0500E+02 0.0000E+00 0.0000E+00
0.0000E+00 0.0000E+00 0.0000E+00 4.0000E+00 2.4000E-02 8.8583E+01
3.1000E+01 1.0000E+00 9.6000E+01 4.2100E+02 0.0000E+00 0.0000E+00
0.0000E+00 0.0000E+00 0.0000E+00 4.0000E+00 2.4000E-02 1.7342E+02
3.1000E+01 1.0000E+00 1.6800E+02 5.3100E+02 0.0000E+00 0.0000E+00
0.0000E+00 0.0000E+00 0.0000E+00 4.0000E+00 2.4000E-02 2.0179E+02
TABLE NO. 1
ID GID TIME OBS AMT EVID
RATE ADDL II CMT WT IPRE
3.1000E+01 1.0000E+00 0.0000E+00 0.0000E+00 1.0000E+00 1.0000E+00
0.0000E+00 0.0000E+00 0.0000E+00 4.0000E+00 2.4000E-02 1.4389E+02
3.1000E+01 1.0000E+00 0.0000E+00 2.0500E+02 0.0000E+00 0.0000E+00
0.0000E+00 0.0000E+00 0.0000E+00 4.0000E+00 2.4000E-02 2.6147E+02
3.1000E+01 1.0000E+00 9.6000E+01 4.2100E+02 0.0000E+00 0.0000E+00
0.0000E+00 0.0000E+00 0.0000E+00 4.0000E+00 2.4000E-02 2.2634E+02
3.1000E+01 1.0000E+00 1.6800E+02 5.3100E+02 0.0000E+00 0.0000E+00
0.0000E+00 0.0000E+00 0.0000E+00 4.0000E+00 2.4000E-02 4.0733E+02
TABLE NO. 1
ID GID TIME OBS AMT EVID
RATE ADDL II CMT WT IPRE
3.1000E+01 1.0000E+00 0.0000E+00 0.0000E+00 1.0000E+00 1.0000E+00
0.0000E+00 0.0000E+00 0.0000E+00 4.0000E+00 2.4000E-02 2.2003E+02
3.1000E+01 1.0000E+00 0.0000E+00 2.0500E+02 0.0000E+00 0.0000E+00
0.0000E+00 0.0000E+00 0.0000E+00 4.0000E+00 2.4000E-02 3.2116E+02
3.1000E+01 1.0000E+00 9.6000E+01 4.2100E+02 0.0000E+00 0.0000E+00
0.0000E+00 0.0000E+00 0.0000E+00 4.0000E+00 2.4000E-02 3.3642E+02
3.1000E+01 1.0000E+00 1.6800E+02 5.3100E+02 0.0000E+00 0.0000E+00
0.0000E+00 0.0000E+00 0.0000E+00 4.0000E+00 2.4000E-02 4.7881E+02
...
...
...
zheng
This e-mail, including any attachments, is a confidential business communication, and may contain information that is confidential, proprietary and/or privileged. This e-mail is intended only for the individual(s) to whom it is addressed, and may not be saved, copied, printed, disclosed or used by anyone else. If you are not the(an) intended recipient, please immediately delete this e-mail from your computer system and notify the sender. Thank you.
[[alternative HTML version deleted]]
______________________________________________ R-help at r-project.org mailing list https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide http://www.R-project.org/posting-guide.html and provide commented, minimal, self-contained, reproducible code.
Jim Holtman Cincinnati, OH +1 513 646 9390 What is the problem that you are trying to solve?