VB6 HTML Help

bwhit · April 28th, 2006, 02:33 PM

I am new to HTML parsing with VB6. I am using a Webbrowser control to access a web page. I have used the following code to get the NAME field.

text1.text = WebBrowser1.Document.getElementsByTagName("input") .Item(j).Name

I am trying to extract the DOE,JOHN from the following HTML line:

</TD><TD CLASS="bckGray"> </TD><TD CLASS="bckGray">DOE, JOHN</TD><TD CLASS="bckGray">DOEJ</TD><TD CLASS="bckGray" align="CENTER">08 - 2006<INPUT VALUE="0" NAME="isCorrection464319" TYPE="HIDDEN">

All help would be greatly appreciated for this first time poster.
Thanks,
bwhit

Chintue · May 13th, 2006, 03:04 AM

I don't have any solid code to post for you, but I ran into a similar problem when I was trying to extract text records from their HTML output. Basically, what I ended up doing was the following:
Read things in lines.
In each line, look for the first ">"
This is the first place that you're likely to find data, if you read the next character. If the space between > and the next < is only 1, then you know that you're dealing with tags that run together. </TD><TD CLASS="bckGray"> for example.

What I did was go through and check to see where there were spaces greater than 1 between the close and opening of tags. <B>Here is text</B>. In your case, it looks like you're going to need to add a little extra checking to take out all of the nbsp garbage. I hope this idea makes sense, feel free to e-mail me.