<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
		>
<channel>
	<title>Comments on: False Reports About Yahoo! Blocking Googlebot On del.icio.us</title>
	<atom:link href="http://www.accuracast.com/search-daily-news/seo-7471/false-reports-about-yahoo-blocking-googlebot-on-delicious/feed/" rel="self" type="application/rss+xml" />
	<link>http://www.accuracast.com/search-daily-news/seo-7471/false-reports-about-yahoo-blocking-googlebot-on-delicious/</link>
	<description>Daily news from the world of Internet &#38; mobile search</description>
	<lastBuildDate>Fri, 19 Mar 2010 20:15:05 +0000</lastBuildDate>
	<generator>http://wordpress.org/?v=2.8.4</generator>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
		<item>
		<title>By: Colin Cochrane</title>
		<link>http://www.accuracast.com/search-daily-news/seo-7471/false-reports-about-yahoo-blocking-googlebot-on-delicious/comment-page-1/#comment-3828</link>
		<dc:creator>Colin Cochrane</dc:creator>
		<pubDate>Fri, 22 Feb 2008 01:36:44 +0000</pubDate>
		<guid isPermaLink="false">http://www.accuracast.com/search-daily-news/seo-7471/false-reports-about-yahoo-blocking-googlebot-on-delicious/#comment-3828</guid>
		<description>I felt it would be prudent to respond myself.

1) The directories being blocked were not the issue.  The robots.txt reference was used solely as a list of user-agents to test against.  

2) If del.icio.us is serving these 404s to prevent spoofing, then it is being done to prevent proxy hijacking, not content-scraping.  A content scraper could just spoof a normal Mozilla user-agent if it was worried about getting caught.


On a final note: I&#039;m not sure at what point I became a &quot;self-proclaimed pundit&quot;.  I simply encountered unusual behaviour from del.icio.us, did a little investigating, and wrote about what I found.   People took from that what they did.</description>
		<content:encoded><![CDATA[<p>I felt it would be prudent to respond myself.</p>
<p>1) The directories being blocked were not the issue.  The robots.txt reference was used solely as a list of user-agents to test against.  </p>
<p>2) If del.icio.us is serving these 404s to prevent spoofing, then it is being done to prevent proxy hijacking, not content-scraping.  A content scraper could just spoof a normal Mozilla user-agent if it was worried about getting caught.</p>
<p>On a final note: I&#8217;m not sure at what point I became a &#8220;self-proclaimed pundit&#8221;.  I simply encountered unusual behaviour from del.icio.us, did a little investigating, and wrote about what I found.   People took from that what they did.</p>
]]></content:encoded>
	</item>
</channel>
</rss>
