Wrox Home  
Search P2P Archive for: Go

  Return to Index  

aspx thread: ?scraping


Message #1 by "subramanian" <subramanian@t...> on Sun, 29 Oct 2000 07:20:31 -0000
hi all,



what do we mean by website scraping?

how is that implemented using asp+



Regards,



subramanian



Message #2 by "Ben Prater" <bprater@p...> on Mon, 30 Oct 2000 09:59:13 -0500
Sub,



Scraping usually refers to simply sucking up a webpage, say news.com, then

using a search utility, like perl's regex and pulling out pieces of data,

say headlines and teasers and pumping them into a data structure that can be

used as xml or fed into a database.



It's what xml will prevent in future years. Overhead for scraping is much

higher than necessary -- and just plain yucky.



Ben



> -----Original Message-----

> From: subramanian [mailto:subramanian@t...]

> Sent: Sunday, October 29, 2000 2:21 AM

> To: ASP+

> Subject: [aspx] ?scraping

>

>

> hi all,

>

> what do we mean by website scraping?

> how is that implemented using asp+

>

> Regards,

>

> subramanian

>

>

Message #3 by Rob Howard <rhoward@m...> on Mon, 30 Oct 2000 07:26:14 -0800
Scraping - parsing HTML to derive information. 



The classic example is a site (such as investor.com) that provides stock

quote information. Here's a snip from that site:



<TR><TD ALIGN=RIGHT><A

HREF="http://moneycentral.msn.com/scripts/webquote.dll?iPage=qd&Symbol=MSFT&

FYI=True#Alerts" class=fyistyle>fyi</A> </TD><TD>Last</TD><TD

ALIGN=RIGHT NOWRAP><B> 69 7/16</B></TD><TD WIDTH

ROWSPAN=6> </TD><TD ALIGN=RIGHT> </TD><TD>Open</TD><TD ALIGN=RIGHT

NOWRAP> 67 1/2</TD></TR>



Scraping is logic that requests this html page and parses it to return only

the value '67 1/2'; the current value.



ASP+ Web Services supports a regular expression syntax to allow you to

easily parse an html page using regular expression matching to pull that

value out and return it to the caller.



Thanks,

Rob



---



-----Original Message-----

From: subramanian [mailto:subramanian@t...]

Sent: Sunday, October 29, 2000 12:21 AM

To: ASP+

Subject: [aspx] ?scraping





hi all,



what do we mean by website scraping?

how is that implemented using asp+



Regards,



subramanian



Message #4 by subramanian <subramanian@t...> on Tue, 31 Oct 2000 08:55:43 -0800
thanks a lot.sweet explanation



Regards,

Subramanian.





-----Original Message-----

From: Rob Howard [mailto:rhoward@m...]

Sent: Monday, October 30, 2000 7:26 AM

To: ASP+

Subject: [aspx] RE: ?scraping





Scraping - parsing HTML to derive information.



The classic example is a site (such as investor.com) that provides stock

quote information. Here's a snip from that site:



<TR><TD ALIGN=RIGHT><A

HREF="http://moneycentral.msn.com/scripts/webquote.dll?iPage=qd&Symbol=MSFT&

FYI=True#Alerts" class=fyistyle>fyi</A> </TD><TD>Last</TD><TD

ALIGN=RIGHT NOWRAP><B> 69 7/16</B></TD><TD WIDTH

ROWSPAN=6> </TD><TD ALIGN=RIGHT> </TD><TD>Open</TD><TD ALIGN=RIGHT

NOWRAP> 67 1/2</TD></TR>



Scraping is logic that requests this html page and parses it to return only

the value '67 1/2'; the current value.



ASP+ Web Services supports a regular expression syntax to allow you to

easily parse an html page using regular expression matching to pull that

value out and return it to the caller.



Thanks,

Rob



---



-----Original Message-----

From: subramanian [mailto:subramanian@t...]

Sent: Sunday, October 29, 2000 12:21 AM

To: ASP+

Subject: [aspx] ?scraping





hi all,



what do we mean by website scraping?

how is that implemented using asp+



Regards,



subramanian





---

Wrox Professional Web Developer Conference II

http://www.wroxconferences.com/WebDevEurope

November 29th - December 1st 2000, Amsterdam, Netherlands

Create powerful distributed web applications serving you now and

in the future. From ASP+ and VB.NET to XML and SQL Server 2000





  Return to Index