Skip to content
Back to formatted view

Raw Message

Message-ID: <425155EC.2060906@statistik.uni-dortmund.de>
Date: 2005-04-04T14:57:48Z
From: Uwe Ligges
Subject: scan html: sep = "<td>"
In-Reply-To: <4251543B.8020707@gmx.ch>

Christoph Lehmann wrote:

> Hi
> I try to import html text and I need to split the fields at each <td> or 
> </td> entry
> 
> How can I succeed? sep = '<td>' doens't yield the right result

If it fits pairwise together, use
   sep=c("<td>", "</td>")

if not, you can read the whole lot with readLines and strsplit for both 
pattern after that, for example.

Uwe Ligges



> thanks for hints
> 
> ______________________________________________
> R-help at stat.math.ethz.ch mailing list
> https://stat.ethz.ch/mailman/listinfo/r-help
> PLEASE do read the posting guide! 
> http://www.R-project.org/posting-guide.html