<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>Notes, links and conversation &#187; Technorati</title>
	<atom:link href="http://pascal.vanhecke.info/category/technorati/feed/" rel="self" type="application/rss+xml" />
	<link>http://pascal.vanhecke.info</link>
	<description></description>
	<lastBuildDate>Thu, 11 Aug 2011 14:38:56 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.3.1</generator>
		<item>
		<title>If incoming links meant money&#8230;</title>
		<link>http://pascal.vanhecke.info/2005/10/20/if-incoming-links-meant-money/</link>
		<comments>http://pascal.vanhecke.info/2005/10/20/if-incoming-links-meant-money/#comments</comments>
		<pubDate>Thu, 20 Oct 2005 21:15:30 +0000</pubDate>
		<dc:creator>Pascal Van Hecke</dc:creator>
				<category><![CDATA[Conversation]]></category>
		<category><![CDATA[Technorati]]></category>
		<category><![CDATA[WebWatch]]></category>

		<guid isPermaLink="false">http://pascal.vanhecke.info/2005/10/20/if-incoming-links-meant-money/</guid>
		<description><![CDATA[Recipe for instant blog fame: use some Web2.0 api and let your visitors visualize or recombine their own user data, or their own blog&#8217;s data. That was a bit the case for my own really simple &#8220;Top sources of del.icio.us links&#8221; posting, and it is definitely the case for this &#8220;Blog valution calculator&#8221;. It builds [...]<div class="tantan-getcomments"><a href="http://pascal.vanhecke.info/2005/10/20/if-incoming-links-meant-money/#comments"><img src="http://pascal.vanhecke.info/wp-content/plugins/tantan/get-comments.php?p=120" width="100" height="15" style="border:0;" /></a></div>]]></description>
			<content:encoded><![CDATA[<p>Recipe for instant blog fame: use some <a href="http://www.programmableweb.com/apis">Web2.0 api</a> and let your visitors visualize or recombine their own user data, or their own blog&#8217;s data.  That was a bit the case for my own really simple &#8220;<a href="http://pascal.vanhecke.info/2005/10/04/top-sources-of-delicious-links/">Top sources of del.icio.us links</a>&#8221; posting, and it is definitely the case for this <a href="http://www.business-opportunities.biz/projects/how-much-is-your-blog-worth/">&#8220;Blog valution calculator&#8221;</a>. <span id="more-120"></span><br />
It builds on a <a href="http://www.tnl.net/blog/entry/Doing_the_numbers_on_the_AOL-WeblogsInc_deal">provocative analysis</a> of the <a href="http://weblogsinc.com/">Weblogs Inc</a> takeover, uses the <a href="http://technorati.com/developers/tools/libraries.html">Technorati API</a> to count incoming links and renders this nice copy-pastable (another part of the recipe!) snippet:</p>
<div align="center">
<div style="border: 1px solid #cccccc; background-color: white; width: 115px; text-align: center; padding: 0 0 10px 0;">
<p style="margin: 0; text-align: center;"><img src="http://static.flickr.com/23/25822676_789bf55448_t.jpg" style="border:0;"/><br /> 		<span style="font-size: 11px;">My <a href="http://pascal.vanhecke.info">blog</a> is worth <b>$19,194.36</b>.</span><br /><span style="font-size: 10px;"><a href="http://www.business-opportunities.biz/projects/how-much-is-your-blog-worth/">How much is your blog worth?</a></span></p>
</div>
</div>
<p>The backlink will definitely push the author&#8217;s <a href="http://www.business-opportunities.biz">weblog</a> valuation!  (<a href="http://www.weblogzonderhaast.nl/index.php/weblog/deze-blog-is-geld-waard/">via</a>)</p>
<p>Note: you might be disappointed if you haven&#8217;t been posting for quite some time, <a href="http://www.zylstra.org/blog/archives/001804.html">this remark</a> by Ton Zijlstra explains why: </p>
<blockquote><p> &#8220;Instead of counting the total of all linking sites in the now almost 3 years of Technorati&#8217;s existence, they now look at the figures of the past 6 months. That favors dynamics in blogs more than seniority, which figures given the nature of blogs, doesn&#8217;t it?&#8221;</p></blockquote>
<div class="tantan-getcomments"><a href="http://pascal.vanhecke.info/2005/10/20/if-incoming-links-meant-money/#comments"><img src="http://pascal.vanhecke.info/wp-content/plugins/tantan/get-comments.php?p=120" width="100" height="15" style="border:0;" /></a></div>]]></content:encoded>
			<wfw:commentRss>http://pascal.vanhecke.info/2005/10/20/if-incoming-links-meant-money/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
		</item>
		<item>
		<title>Google Blog Search Bookmarklet</title>
		<link>http://pascal.vanhecke.info/2005/09/15/google-blog-search-bookmarklet/</link>
		<comments>http://pascal.vanhecke.info/2005/09/15/google-blog-search-bookmarklet/#comments</comments>
		<pubDate>Thu, 15 Sep 2005 00:03:59 +0000</pubDate>
		<dc:creator>Pascal Van Hecke</dc:creator>
				<category><![CDATA[Google]]></category>
		<category><![CDATA[Search]]></category>
		<category><![CDATA[Technorati]]></category>

		<guid isPermaLink="false">http://pascal.vanhecke.info/2005/09/15/google-blog-search-bookmarklet/</guid>
		<description><![CDATA[You can tell the difference between a cherished and nourished product like Technorati&#8217;s blog search from an early version like Google &#8216;s by small things, such as: have they thought of a bookmarklet? I didn&#8217;t find any mention of it, but it&#8217;s only a matter of find-and-replace to adapt Technorati&#8217;s favelet*: Update Aug 27th 2006: [...]<div class="tantan-getcomments"><a href="http://pascal.vanhecke.info/2005/09/15/google-blog-search-bookmarklet/#comments"><img src="http://pascal.vanhecke.info/wp-content/plugins/tantan/get-comments.php?p=107" width="100" height="15" style="border:0;" /></a></div>]]></description>
			<content:encoded><![CDATA[<p>You can tell the difference between a cherished and nourished product like <a href="http://technorati.com/search/">Technorati&#8217;s blog search</a> from an early version like <a href="http://blogsearch.google.com/">Google &#8216;s</a> by small things, such as: have they thought of a <a rel = "tag" href="http://en.wikipedia.org/wiki/Bookmarklet">bookmarklet</a>?   I didn&#8217;t find any <a href="http://blogsearch.google.com/blogsearch?q=google+%22blog+search%22+bookmarklet">mention of it</a>, but it&#8217;s only a matter of find-and-replace to adapt Technorati&#8217;s <a rel = "tag" href="http://en.wikipedia.org/wiki/Favelet">favelet</a>*:<span id="more-107"></span></p>
<p><strong>Update Aug 27th 2006: see <a href="http://pascal.vanhecke.info/2006/08/27/google-blog-search-bookmarklets-updated/">here</a></strong></p>
<h3 id="toc-bookmarklet-installation">Bookmarklet installation**</h3>
<p><strong>Firefox</strong></p>
<ul>
<li>Right-click this link: <a class="bookmarklet" href="javascript:w=window;d=document;var%20u;s='';if%20(d.selection)%20%7Bs=d.selection.createRange().text;%7D%20else%20if%20(d.getSelection!=u)%20%7Bs=d.getSelection();%7D%20else%20if%20(w.getSelection!=u)%20%7Bs='link%3A'+w.getSelection();%7D%20if%20(s.length%3C2)%7Bif(String(w.location).substring(0,6)=='about:')%7Bs=prompt('Google%20Blog%20Search%20for:',s);%7Delse%7Bs='link%3A'+w.location;%7D%7Dif%20(s!=null)%20w.location='http%3A//blogsearch.google.com/blogsearch?q='+escape(s);void(1);" title="Google Blog Search"><strong>Google Blog Search</strong></a> and choose &#8220;Bookmark This Link&#8221;</li>
<li>Click the &#8220;Create in&#8221; dropdown and choose &#8220;Bookmarks Toolbar Folder&#8221;</li>
</ul>
<p><strong>Internet Explorer</strong></p>
<ul>
<li>Right-click this link: <a class="bookmarklet" href="javascript:w=window;d=document;var%20u;s='';if%20(d.selection)%20%7Bs=d.selection.createRange().text;%7D%20else%20if%20(d.getSelection!=u)%20%7Bs=d.getSelection();%7D%20else%20if%20(w.getSelection!=u)%20%7Bs='link%3A'+w.getSelection();%7D%20if%20(s.length%3C2)%7Bif(String(w.location).substring(0,6)=='about:')%7Bs=prompt('Google%20Blog%20Search%20for:',s);%7Delse%7Bs='link%3A'+w.location;%7D%7Dif%20(s!=null)%20w.location='http%3A//blogsearch.google.com/blogsearch?q='+escape(s);void(1);" title="Google Blog Search"><strong>Google Blog Search</strong></a> and choose  &#8220;Add to Favorites&#8221;</li>
<li>Click &#8220;Yes&#8221; if there is a &#8220;May not be safe&#8221; popup</li>
<li>Choose &#8220;Links&#8221; in the &#8220;Create in&#8221; pane and click &#8220;OK&#8221;</li>
</ul>
<p>Now you&#8217;re off keywordsearching (when a few keywords in your browser window are selected) and linksearching (for blog links pointing to your current location).</p>
<p>What you get is the default search&#8230; you can adapt the &#8220;blogsearch?q=&#8221; phrase at the end of the javascript to any of the <a href="http://blogsearch.google.com/blogsearch/advanced_blog_search">advanced options</a> by test-and-replace the url.</p>
<p>BTW, if you use the <a href="http://mycroft.mozdev.org/">Firefox search plugin</a>, there&#8217;s already a <a href="http://mycroft.mozdev.org/quick/google.html">version available</a> for Blog search.</p>
<p>*  <em>which is how Technorati calls <a rel = "tag" href="http://en.wikipedia.org/wiki/Bookmarklets">bookmarklets</a></em><br />
** <em>instructions copied and adapted from <a href="http://del.icio.us/doc/bookmarklets">del.icio.us</a>, Safari and Opera users are probably intelligent enough to do it on their own ;-)</em></p>
<div class="tantan-getcomments"><a href="http://pascal.vanhecke.info/2005/09/15/google-blog-search-bookmarklet/#comments"><img src="http://pascal.vanhecke.info/wp-content/plugins/tantan/get-comments.php?p=107" width="100" height="15" style="border:0;" /></a></div>]]></content:encoded>
			<wfw:commentRss>http://pascal.vanhecke.info/2005/09/15/google-blog-search-bookmarklet/feed/</wfw:commentRss>
		<slash:comments>7</slash:comments>
		</item>
		<item>
		<title>Hunting spamblogs continued: splogs and  splogreporter</title>
		<link>http://pascal.vanhecke.info/2005/08/22/hunting-spamblogs-continued-splogs-and-splogreporter/</link>
		<comments>http://pascal.vanhecke.info/2005/08/22/hunting-spamblogs-continued-splogs-and-splogreporter/#comments</comments>
		<pubDate>Mon, 22 Aug 2005 21:18:34 +0000</pubDate>
		<dc:creator>Pascal Van Hecke</dc:creator>
				<category><![CDATA[Spamblogs]]></category>
		<category><![CDATA[Technorati]]></category>
		<category><![CDATA[WebWatch]]></category>
		<category><![CDATA[blogsearch]]></category>
		<category><![CDATA[icerocket]]></category>
		<category><![CDATA[spam]]></category>
		<category><![CDATA[splog]]></category>
		<category><![CDATA[vigilantism]]></category>

		<guid isPermaLink="false">http://pascal.vanhecke.info/2005/08/22/hunting-spamblogs-continued-splogs-and-splogreporter/</guid>
		<description><![CDATA[&#8220;Spamblog&#8221; got rebranded as splog. Mark Cuban (from IceRocket, a worthy competitor for Technorati) suggested email-verification. &#8220;Somewhat Frank&#8221; came up with SplogReporter.com, a centralised public blacklist of spamblogs, similar to my (failed) suggestion of tagging spamblogs using services like furl or del.icio.us. I still don&#8217;t think it will work however. Apart from the fact that [...]<div class="tantan-getcomments"><a href="http://pascal.vanhecke.info/2005/08/22/hunting-spamblogs-continued-splogs-and-splogreporter/#comments"><img src="http://pascal.vanhecke.info/wp-content/plugins/tantan/get-comments.php?p=86" width="100" height="15" style="border:0;" /></a></div>]]></description>
			<content:encoded><![CDATA[<p>&#8220;Spamblog&#8221; got rebranded as <a href="http://en.wikipedia.org/wiki/Splog" rel = "tag">splog</a>.  Mark Cuban (from <a href="http://www.icerocket.com/">IceRocket</a>, a worthy competitor for <a href="http://www.technorati.com/">Technorati</a>) <a href="http://www.blogmaverick.com/entry/1234000870054492/">suggested email-verification</a>. &#8220;Somewhat Frank&#8221; <a href="http://www.somewhatfrank.com/2005/08/what_is_splog_a.html">came up with</a> <a href="http://www.splogreporter.com/">SplogReporter.com</a>, a centralised public blacklist of spamblogs, similar to my (failed) <a href="http://pascal.vanhecke.info/2005/06/01/hunting-spamblogs/">suggestion of tagging spamblogs</a> using services like furl or del.icio.us.  I still don&#8217;t think it will work however.<span id="more-86"></span></p>
<p>Apart from the fact that this will lead to just another arms race with black hat bots &#8220;<a href="http://seoblackhat.com/2005/08/22/splog-splogs/">flagging multiple random blogs as splogs</a>&#8220;, (a counterattack you &#8216;ll have to ward off with <a href="http://pascal.vanhecke.info/2005/06/01/hunting-spamblogs/#comment-22">some intelligent reputation algorithms</a>), I think the idea doesn&#8217;t work for the same reasons tagging didn&#8217;t work:
<ul>
<li>there is no <em>immediate reward</em> for the user denouncing a blog as splog, as long as blog search engines don&#8217;t integrate this in their interface and leave out the splogged results immediately (for that user at least)</li>
<li>Franks <a href="http://www.somewhatfrank.com/2005/08/what_is_splog_a.html">suggestion</a> to the blog search engines to have a link to Splogreporter is nice, but I’m afraid none of the services (Technorati, Feedster, Pubsub, Yahoo Myweb2.0,…) will be motivated to share its results though (the ability to effectively filter out spam being a major competitive advantage!).</li>
</ul>
<div class="tantan-getcomments"><a href="http://pascal.vanhecke.info/2005/08/22/hunting-spamblogs-continued-splogs-and-splogreporter/#comments"><img src="http://pascal.vanhecke.info/wp-content/plugins/tantan/get-comments.php?p=86" width="100" height="15" style="border:0;" /></a></div>]]></content:encoded>
			<wfw:commentRss>http://pascal.vanhecke.info/2005/08/22/hunting-spamblogs-continued-splogs-and-splogreporter/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>Hunting spamblogs</title>
		<link>http://pascal.vanhecke.info/2005/06/01/hunting-spamblogs/</link>
		<comments>http://pascal.vanhecke.info/2005/06/01/hunting-spamblogs/#comments</comments>
		<pubDate>Tue, 31 May 2005 23:57:24 +0000</pubDate>
		<dc:creator>Pascal Van Hecke</dc:creator>
				<category><![CDATA[del.icio.us]]></category>
		<category><![CDATA[SocialSoftware]]></category>
		<category><![CDATA[Spamblogs]]></category>
		<category><![CDATA[Technorati]]></category>

		<guid isPermaLink="false">http://pascal.vanhecke.info/2005/05/20/hunting-spamblogs/</guid>
		<description><![CDATA[Several people have been complaining about Spamblogs and suggesting remedies: measures by weblog providers, such as Blogger (&#8220;Incorporated Subversion&#8221;) cyber vigilantism (&#8220;Simple Thoughts&#8221;) reputation-based filtering (Ross Mayfield, he links to an interview with a spammer btw!) search engine countermeasures (&#8220;Micro Persuasion&#8221;) It&#8217;s probably search engines that hold the key here. Spamblogs are annoying, not because [...]<div class="tantan-getcomments"><a href="http://pascal.vanhecke.info/2005/06/01/hunting-spamblogs/#comments"><img src="http://pascal.vanhecke.info/wp-content/plugins/tantan/get-comments.php?p=39" width="100" height="15" style="border:0;" /></a></div>]]></description>
			<content:encoded><![CDATA[<p><a href="http://www.loiclemeur.com/english/2005/05/brands_blog_spa.html">Several</a> <a href="http://www.psfk.com/2005/03/could_spamblogs.html">people</a> have been complaining about <a href="http://pascal.vanhecke.info/2005/05/30/spamblogs/">Spamblogs</a> and suggesting remedies: </p>
<ul>
<li><a href="http://incsub.org/blog/?p=475">measures by weblog providers</a>, such as Blogger (&#8220;Incorporated Subversion&#8221;)</li>
<li><a href="http://blog.taragana.com/index.php/archive/new-horizons-in-spamming-aka/">cyber vigilantism</a> (&#8220;Simple Thoughts&#8221;)</li>
<li><a href="http://ross.typepad.com/blog/2005/04/persistent_spam.html">reputation-based filtering</a> (Ross Mayfield, he links to an interview with a spammer btw!)</li>
<li><a href="http://www.micropersuasion.com/2005/03/technorati_sees.html">search engine countermeasures</a> (&#8220;Micro Persuasion&#8221;)</li>
</ul>
<p>It&#8217;s probably search engines that hold the key here.<span id="more-39"></span>  Spamblogs are annoying, not because of their mere existence, but because you notice them: they show up in search results.  Technorati is very much aware of it and tries to do its best (<a href="http://www.sifry.com/alerts/archives/000298.html">David Sifry&#8217;s March report</a>): </p>
<blockquote><p>&#8220;Part of the growth of new weblogs (30,000 &#8211; 40,000 each day)  created each day is due to an increase in spam blogs &#8211; fake blogs that are&#8230;&#8221;</p></blockquote>
<blockquote><p>&#8220;&#8230;we feel that we&#8217;ve been able to capture and identify most of the spam out there, but one should note that there is definitely blog spam that we don&#8217;t catch&#8230;&#8221;</p></blockquote>
<h3 id="toc-why-not-lend-technorati-et-al-a-hand">Why not lend Technorati (et al) a hand&#8230;</h3>
<p>&#8230; and tag the spamblogs that slipped through the net.  If you get annoyed by a Spamblog showing up in your <a href="http://www.technorati.com/members/">Technorati Watchlist</a> (or <a href="http://www.googlealert.com/faqs.php#intro">Google Alert</a>, or any other <a href="http://libraryclips.blogsome.com/2005/05/20/rss-filter-and-re-mix/">search feed</a>), report it, tag it: &#8220;<a href="http://www.technorati.com/tag/spamblog">spamblog</a>&#8221; * !  You could use <a href="http://del.icio.us/">del.icio.us</a> or <a href="http://www.furl.net/">furl</a>, the services that are being <a href="http://www.technorati.com/help/tags.html">aggregated</a> by Technorati.  </p>
<p>Yes, I know, there are obvious objections.  If your furl/del.icio.us rss feeds are being republished, you increase the Spamblog&#8217;s exposure, and without <a href="http://blog.searchenginewatch.com/blog/050118-204728">rel=&#8217;nofollow&#8217;</a>, even its pagerank, exactly what the spammer is aiming for.  You might keep the &#8220;bad&#8221; links in another account, or another bookmarking service than you normal link collection.  </p>
<p>Still, hunting spamblogs by tagging them as spam is a tool users have available at this very moment.  And it is simple and easy to use**.  For the spambusters, using the spamblog tag rss feeds in (semi) automatic blacklisting is piece of cake.  And yes again, there&#8217;s the danger of having controversial, opinionated blogs ostracised by adversaries &#8211; but Technorati et al can easily outweigh spamtags against &#8220;affirmative&#8221; tags and  incoming links.  So <a href="http://www.furl.net/furled.jsp?topic=spamblog" rel = "nofollow">let&#8217;s</a> <a href="http://del.icio.us/tag/spamblog/"  rel = "nofollow">start</a>!</p>
<p>* <em>as suggested in my <a href="http://pascal.vanhecke.info/2005/05/30/spamblogs/">introductory post</a>, I would limit the tag &#8220;spamblog&#8221; to machine-generated blogs &#8211; and distinguish them from &#8220;fake&#8221; or &#8220;character&#8221; blogs&#8230;</em><br />
** <em>unlike e.g. the <a href="http://developers.technorati.com/wiki/VoteLinks">Votelinks</a> concept, that hasn&#8217;t taken off so far</em></p>
<h3 id="toc-update-20-00h-june-1">Update 20.00h, June 1</h3>
<p><em>Continuing on &#8220;outweighing spamtags against &#8216;affirmative&#8217; tags and  incoming links&#8221;</em>:  Pubsub, a service  similar to Technorati already has a system called &#8220;<a href="http://www.pubsub.com/linkranks_about.php">Linkranks</a>&#8221; they use to filter search feeds: see the discussion at: <a href="http://hyku.com/blog/archives/000251.html">hyku.com</a>.  Technorati probably has something similar.<br />
Without any doubt, blog search services like Pubsub and Technorati definitely should include in their own interfaces as well an easy way for their users to report spamblogs (like you can report spam in Yahoo of Gmail), but centralised, independent &#8220;blogspam reservoirs&#8221; such as <a href="http://del.icio.us/tag/spamblog"  rel = "nofollow">http://del.icio.us/tag/spamblog</a> or <a href="http://www.furl.net/furled.jsp?topic=spamblog"  rel = "nofollow">http://www.furl.net/furled.jsp?topic=spamblog</a> could help all of them and would definitely be a step forward from sending an email to feedback@, which is the procedure now.</p>
<h3 id="toc-update-june-25">Update June 25</h3>
<p>Adsense has an easyway now to report spamblogs that are running Adsense Ads.  From <a href="http://www.jensense.com/archives/2005/06/matt_cutts_anno.html">Jensense</a>:</p>
<blockquote><p>&#8220;If you see a site violating the AdSense terms, you can now file an anonymous spam report that will get to the quality team for checking. To file a report, you simply go to the page that is showing AdSense ads and click on the &#8220;Ads by Google&#8221; (or &#8220;Ads by Goooooogle&#8221;) link. In the form on the next page, include the term &#8220;spamreport&#8221; and put in a short reason about why you feel the site is violating the AdSense terms or policies. You can also enter your own email address, if you wish, then click submit.&#8221;</p></blockquote>
<h3 id="toc-update-july-13" id ="updatejuly13">Update July 13</h3>
<p>Apparently, the idea of building a public blacklist of spamblogs on del.icio.us or other bookmarking services hasn&#8217;t taken off.<br />
If I had given it more thought then, I should have seen why:  there is no incentive for denouncing a blog as spamblog as long as blog search engines don’t use it (a chicken and egg problem: they won&#8217;t do this before some substantial amount of data has been collected ).<br />
And even then, there&#8217;s no <em>immediate</em> reward for doing so.  What the user wants (as I suggested at <a href="http://johnaugust.com/archives/2005/annoying-trend-watch-technorati-spam-blogs">other</a> <a href="http://blog.forret.com/blog/2005/07/amy-cross-spamming-technorati.html">complaints</a> on spam blogs and technorati spam), is a simple simple “report as spam” button or link in the  (web/email/rss) interface itself (comparable with the email spam buttons in Gmail, Yahoo Mail, Hotmail etc…) so that annoying blogs or feeds are filtered from the resultset <em>immediately</em>.  Definitely something to look forward in a <a href="http://www.sifry.com/alerts/archives/000314.html">next Technorati release</a>, see the <a href="http://blogsurvey.backbonemedia.com/archives/2005/07/will_technorati.html">David Sifry&#8217;s comments on this blog search wish list</a>.<br />
So far for the idea of having a <em>public</em> blacklist, because I&#8217;m afraid none of the services (Technorati, Feedster, Pubsub, Yahoo Myweb2.0,&#8230;) will be motivated to share its results though (the ability to effectively filter out spam being a major competitive advantage!)&#8230;</p>
<div class="tantan-getcomments"><a href="http://pascal.vanhecke.info/2005/06/01/hunting-spamblogs/#comments"><img src="http://pascal.vanhecke.info/wp-content/plugins/tantan/get-comments.php?p=39" width="100" height="15" style="border:0;" /></a></div>]]></content:encoded>
			<wfw:commentRss>http://pascal.vanhecke.info/2005/06/01/hunting-spamblogs/feed/</wfw:commentRss>
		<slash:comments>19</slash:comments>
		</item>
		<item>
		<title>Spamblogs in Action</title>
		<link>http://pascal.vanhecke.info/2005/05/31/spamblogs-in-action/</link>
		<comments>http://pascal.vanhecke.info/2005/05/31/spamblogs-in-action/#comments</comments>
		<pubDate>Mon, 30 May 2005 23:26:51 +0000</pubDate>
		<dc:creator>Pascal Van Hecke</dc:creator>
				<category><![CDATA[Spamblogs]]></category>
		<category><![CDATA[Technorati]]></category>

		<guid isPermaLink="false">http://pascal.vanhecke.info/2005/05/30/spamblogs-in-action/</guid>
		<description><![CDATA[While browsing through some recent greasemonkey blog buzz I came across this Dutch-language post on a blog (moretheterrier.co.uk / giftbasket/) that seemed to be entirely in English further on. Its content had been grabbed from (the Feedburner RSS feed containing) the original posting at Hans Mestrum&#8217;s weblog. I was really wondering how the entries for [...]<div class="tantan-getcomments"><a href="http://pascal.vanhecke.info/2005/05/31/spamblogs-in-action/#comments"><img src="http://pascal.vanhecke.info/wp-content/plugins/tantan/get-comments.php?p=41" width="100" height="15" style="border:0;" /></a></div>]]></description>
			<content:encoded><![CDATA[<p>While browsing through some  recent <a href="http://www.technorati.com/cosmos/search.html?sub=toolsearch&#038;url=greasemonkey">greasemonkey blog buzz</a> I came across this  <a href="http://moretheterrier.co.uk/giftbasket/?p=76" rel = "nofollow">Dutch-language post</a> on a blog (<a href="http://moretheterrier.co.uk/giftbasket/" rel = "nofollow">moretheterrier.co.uk / giftbasket/</a>) that seemed to be entirely in English further on.  Its content had been grabbed from (the Feedburner RSS feed containing)  the <a href="http://hmestrum.blogs.com/my_weblog/2005/05/bloggen_past_in.html">original posting</a> at <a href="http://hmestrum.blogs.com/my_weblog/">Hans Mestrum&#8217;s weblog</a>.  <span id="more-41"></span></p>
<p>I <a href="http://pascal.vanhecke.info/wp-content/upload_images/200530530_spamblog_technorati_giftbasket.png"><img src='http://pascal.vanhecke.info/wp-content/upload_images/thumb-200530530_spamblog_technorati_giftbasket.png' alt='Technorati search on giftbasket' align = 'right'/></a><a href="http://pascal.vanhecke.info/wp-content/upload_images/200530530_spamblog_gone_wild.png"><img src='http://pascal.vanhecke.info/wp-content/upload_images/thumb-200530530_spamblog_gone_wild.png' alt='thousands of cron job results on a blog' align = 'left'/></a> was really wondering how the entries for this apparently <a href="http://pascal.vanhecke.info/2005/05/30/spamblogs/">SpamBlog </a> were being collected, so I did a quick <a href="http://www.technorati.com/cosmos/search.html?rank=&#038;url=giftbasket">Technorati search on &#8220;giftbasket&#8221;</a> (right):<br />
The results were hilarious and revealed this (probably <a href="http://mu.wordpress.org/">multi-user</a>) WordPress blog at <a href="http://www.moretheterrier.co.uk/moretheterrier/" rel = "nofollow">www.moretheterrier.co.uk / moretheterrier/</a>, not only containing <a href="http://moretheterrier.co.uk/moretheterrier/?p=1262" rel = "nofollow">the odd stolen blog posting</a> but as well thousands of ouput results from cron (= scheduler on Unix) jobs (<a href="http://pascal.vanhecke.info/wp-content/upload_images/200530530_spamblog_gone_wild.png">click thumbnail</a> left).</p>
<p>In fact, it turns out this installation is driving other spamblogs such as the one at <a href="http://moretheterrier.co.uk/giftbasket/"  rel = "nofollow">moretheterrier.co.uk / giftbasket/</a>, <a href="http://moretheterrier.co.uk/clock/"  rel = "nofollow">moretheterrier.co.uk / clock/</a> and <a href="http://moretheterrier.co.uk/dolphin/"  rel = "nofollow">moretheterrier.co.uk / dolphin/</a>.  All of these are pointing to fake directory sites such as <a href=" http://www.tattooaftercareaffiliates.co.uk/" rel = "nofollow">www.tattooaftercareaffiliates.co.uk</a>, <a href="http://www.moretheterrier.co.uk/" rel = "nofollow">www.moretheterrier.co.uk</a>, <a href="http://www.dr-rock.co.uk/" rel = "nofollow">www.dr-rock.co.uk</a> and <a href="http://www.inkcarts.biz/">www.inkcarts.biz</a>.</p>
<p>The <a href="http://pascal.vanhecke.info/wp-content/upload_images/200530530_spamblog_mail.png"><img src='http://pascal.vanhecke.info/wp-content/upload_images/thumb-200530530_spamblog_mail.png' alt='mail a spam blog!' align='left' /></a>  <a href="http://pascal.vanhecke.info/wp-content/upload_images/200530530_spamblog_mail_result_cron.png"> <img src='http://pascal.vanhecke.info/wp-content/upload_images/thumb-200530530_spamblog_mail_result_cron.png' alt='trigger the script!!'  align='right' /></a> content is being fetched from several mailboxes (giftblog@moretheterrier.co.uk, dolphinblog@&#8230;, clockblog@&#8230;) by a (scheduled) <a href="http://wordpress.org/support/topic/6956">WordPress script</a> that enables users to post to their blog via email.  I don&#8217;t have a clue actually by what mechanism the mails are being generated. The nice thing however is&#8230; you can do this yourself: mail one of the adresses (screenshot left), then run the corresponding script (e.g. <a href="http://moretheterrier.co.uk/giftbasket/wp-mail.php">moretheterrier.co.uk / giftbasket / wp-mail.php</a>, screenshot right), and this is the result: </p>
<p><a href="http://pascal.vanhecke.info/wp-content/upload_images/200530530_spamblog_mail_result_online.png"><img src='http://pascal.vanhecke.info/wp-content/upload_images/thumb-200530530_spamblog_mail_result_online.png' alt='the result online!' /></a></p>
<p>Oh, by the way, there&#8217;s even more&#8230;  a Technorati search for <a href="http://www.technorati.com/cosmos/search.html?rank=&#038;url=http%3A%2F%2Fwww.moretheterrier.co.uk%2F">moretheterrier.co.uk</a> showed still some other (earlier) test setups for the same subject: </p>
<p><a href="http://moretheterrier2.blogspot.com/"  rel = "nofollow">http://moretheterrier2.blogspot.com/</a><br />
<a href="http://puppypottytraining1.blogspot.com/"  rel = "nofollow">http://puppypottytraining1.blogspot.com/</a></p>
<p>And there is also this (still earlier) setup at <a href="http://www.my-resource-site.com/"  rel = "nofollow" >http://www.my-resource-site.com/</a> using a &#8220;real <a href="http://www.my-resource-site.com/pets/seoautomator.php"  rel = "nofollow">SEO automator</a>&#8221; that, if I understand correctly, turns <a href="http://www.my-resource-site.com/pets/keywords1.txt"  rel = "nofollow">keywords</a> into <a href="http://www.my-resource-site.com/pets/puppies/"  rel = "nofollow">a list of</a> <a href="http://www.my-resource-site.com/pets/puppies/housebreaking-tips-for-puppies.php"  rel = "nofollow">keyword-optimized pages</a> linking to what maybe is the <a href="http://www.pottytrainyourpuppyin7days.com/"  rel = "nofollow">actual SEO-customer</a>&#8230; </p>
<p>Apparently WordPress turned out to be a more flexible system to build Spamblogs?</p>
<div class="tantan-getcomments"><a href="http://pascal.vanhecke.info/2005/05/31/spamblogs-in-action/#comments"><img src="http://pascal.vanhecke.info/wp-content/plugins/tantan/get-comments.php?p=41" width="100" height="15" style="border:0;" /></a></div>]]></content:encoded>
			<wfw:commentRss>http://pascal.vanhecke.info/2005/05/31/spamblogs-in-action/feed/</wfw:commentRss>
		<slash:comments>5</slash:comments>
		</item>
	</channel>
</rss>

