<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	>
<channel>
	<title>Comments on: Should scrAPIs be open source?</title>
	<atom:link href="http://thormuller.com/netpositive/2006/03/03/should-scrapis-be-open-source/feed/" rel="self" type="application/rss+xml" />
	<link>http://thormuller.com/netpositive/2006/03/03/should-scrapis-be-open-source/</link>
	<description>In defense of irrational exuberance</description>
	<pubDate>Thu, 21 Aug 2008 17:17:08 +0000</pubDate>
	<generator>http://wordpress.org/?v=2.6</generator>
		<item>
		<title>By: Thor</title>
		<link>http://thormuller.com/netpositive/2006/03/03/should-scrapis-be-open-source/#comment-148</link>
		<dc:creator>Thor</dc:creator>
		<pubDate>Thu, 23 Mar 2006 15:32:54 +0000</pubDate>
		<guid isPermaLink="false">http://thormuller.com/netpositive/2006/03/04/should-scrapis-be-open-source/#comment-148</guid>
		<description>Sean,
Good scrapers should have  error checking just as you describe, but given how quick-and-dirty so many scrapers are I'd guess that most don't. One of the advantages of a scrAPI initiative would be to popularize these best practices and make reusable code readily available. We need to remove all barriers to doing this.</description>
		<content:encoded><![CDATA[<p>Sean,<br />
Good scrapers should have  error checking just as you describe, but given how quick-and-dirty so many scrapers are I&#8217;d guess that most don&#8217;t. One of the advantages of a scrAPI initiative would be to popularize these best practices and make reusable code readily available. We need to remove all barriers to doing this.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: seanohagan</title>
		<link>http://thormuller.com/netpositive/2006/03/03/should-scrapis-be-open-source/#comment-147</link>
		<dc:creator>seanohagan</dc:creator>
		<pubDate>Thu, 23 Mar 2006 02:01:46 +0000</pubDate>
		<guid isPermaLink="false">http://thormuller.com/netpositive/2006/03/04/should-scrapis-be-open-source/#comment-147</guid>
		<description>I've always wanted to implement an automated way for a scraper to catch site changes and to notify the developer.

Each time a site is scraped, certain static features of the site should always be checked. This could include text or images that always occurs in the same place; text that always has the same format (ie. a date, a time, a price, etc.); other static HTML markers; etc.

As soon as the scraper detects that the site's "skeleton" has changed, it alerts the developer so that the proper modifications can be made.

Besides being run each time a scrape is executed, this check could be made hourly or at some other regular interval.

Perhaps this is already a standard feature of most scrapers?</description>
		<content:encoded><![CDATA[<p>I&#8217;ve always wanted to implement an automated way for a scraper to catch site changes and to notify the developer.</p>
<p>Each time a site is scraped, certain static features of the site should always be checked. This could include text or images that always occurs in the same place; text that always has the same format (ie. a date, a time, a price, etc.); other static HTML markers; etc.</p>
<p>As soon as the scraper detects that the site&#8217;s &#8220;skeleton&#8221; has changed, it alerts the developer so that the proper modifications can be made.</p>
<p>Besides being run each time a scrape is executed, this check could be made hourly or at some other regular interval.</p>
<p>Perhaps this is already a standard feature of most scrapers?</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Labnotes  &#187; Blog Archive   &#187; What exactly is a scrAPI?</title>
		<link>http://thormuller.com/netpositive/2006/03/03/should-scrapis-be-open-source/#comment-146</link>
		<dc:creator>Labnotes  &#187; Blog Archive   &#187; What exactly is a scrAPI?</dc:creator>
		<pubDate>Sun, 05 Mar 2006 02:01:10 +0000</pubDate>
		<guid isPermaLink="false">http://thormuller.com/netpositive/2006/03/04/should-scrapis-be-open-source/#comment-146</guid>
		<description>[...] 8217;s also raising the point that there could be an ecosystem of scrAPIs, including unifying services and open source lib [...]</description>
		<content:encoded><![CDATA[<p>[...] 8217;s also raising the point that there could be an ecosystem of scrAPIs, including unifying services and open source lib [...]</p>
]]></content:encoded>
	</item>
</channel>
</rss>
