|
 |
aspx thread: ?scraping
Message #1 by "subramanian" <subramanian@t...> on Sun, 29 Oct 2000 07:20:31 -0000
|
|
hi all,
what do we mean by website scraping?
how is that implemented using asp+
Regards,
subramanian
Message #2 by "Ben Prater" <bprater@p...> on Mon, 30 Oct 2000 09:59:13 -0500
|
|
Sub,
Scraping usually refers to simply sucking up a webpage, say news.com, then
using a search utility, like perl's regex and pulling out pieces of data,
say headlines and teasers and pumping them into a data structure that can be
used as xml or fed into a database.
It's what xml will prevent in future years. Overhead for scraping is much
higher than necessary -- and just plain yucky.
Ben
> -----Original Message-----
> From: subramanian [mailto:subramanian@t...]
> Sent: Sunday, October 29, 2000 2:21 AM
> To: ASP+
> Subject: [aspx] ?scraping
>
>
> hi all,
>
> what do we mean by website scraping?
> how is that implemented using asp+
>
> Regards,
>
> subramanian
>
>
Message #3 by Rob Howard <rhoward@m...> on Mon, 30 Oct 2000 07:26:14 -0800
|
|
Scraping - parsing HTML to derive information.
The classic example is a site (such as investor.com) that provides stock
quote information. Here's a snip from that site:
<TR><TD ALIGN=RIGHT><A
HREF="http://moneycentral.msn.com/scripts/webquote.dll?iPage=qd&Symbol=MSFT&
FYI=True#Alerts" class=fyistyle>fyi</A> </TD><TD>Last</TD><TD
ALIGN=RIGHT NOWRAP><B> 69 7/16</B></TD><TD WIDTH
ROWSPAN=6> </TD><TD ALIGN=RIGHT> </TD><TD>Open</TD><TD ALIGN=RIGHT
NOWRAP> 67 1/2</TD></TR>
Scraping is logic that requests this html page and parses it to return only
the value '67 1/2'; the current value.
ASP+ Web Services supports a regular expression syntax to allow you to
easily parse an html page using regular expression matching to pull that
value out and return it to the caller.
Thanks,
Rob
---
-----Original Message-----
From: subramanian [mailto:subramanian@t...]
Sent: Sunday, October 29, 2000 12:21 AM
To: ASP+
Subject: [aspx] ?scraping
hi all,
what do we mean by website scraping?
how is that implemented using asp+
Regards,
subramanian
Message #4 by subramanian <subramanian@t...> on Tue, 31 Oct 2000 08:55:43 -0800
|
|
thanks a lot.sweet explanation
Regards,
Subramanian.
-----Original Message-----
From: Rob Howard [mailto:rhoward@m...]
Sent: Monday, October 30, 2000 7:26 AM
To: ASP+
Subject: [aspx] RE: ?scraping
Scraping - parsing HTML to derive information.
The classic example is a site (such as investor.com) that provides stock
quote information. Here's a snip from that site:
<TR><TD ALIGN=RIGHT><A
HREF="http://moneycentral.msn.com/scripts/webquote.dll?iPage=qd&Symbol=MSFT&
FYI=True#Alerts" class=fyistyle>fyi</A> </TD><TD>Last</TD><TD
ALIGN=RIGHT NOWRAP><B> 69 7/16</B></TD><TD WIDTH
ROWSPAN=6> </TD><TD ALIGN=RIGHT> </TD><TD>Open</TD><TD ALIGN=RIGHT
NOWRAP> 67 1/2</TD></TR>
Scraping is logic that requests this html page and parses it to return only
the value '67 1/2'; the current value.
ASP+ Web Services supports a regular expression syntax to allow you to
easily parse an html page using regular expression matching to pull that
value out and return it to the caller.
Thanks,
Rob
---
-----Original Message-----
From: subramanian [mailto:subramanian@t...]
Sent: Sunday, October 29, 2000 12:21 AM
To: ASP+
Subject: [aspx] ?scraping
hi all,
what do we mean by website scraping?
how is that implemented using asp+
Regards,
subramanian
---
Wrox Professional Web Developer Conference II
http://www.wroxconferences.com/WebDevEurope
November 29th - December 1st 2000, Amsterdam, Netherlands
Create powerful distributed web applications serving you now and
in the future. From ASP+ and VB.NET to XML and SQL Server 2000
|
|
 |