<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>Notes, links and conversation &#187; Comment Spam</title>
	<atom:link href="http://pascal.vanhecke.info/category/commentspam/feed/" rel="self" type="application/rss+xml" />
	<link>http://pascal.vanhecke.info</link>
	<description></description>
	<lastBuildDate>Thu, 11 Aug 2011 14:38:56 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.3.1</generator>
		<item>
		<title>Akismet: the Web 2.0 cure for comment spam</title>
		<link>http://pascal.vanhecke.info/2006/02/06/akismet-the-web-20-cure-for-comment-spam/</link>
		<comments>http://pascal.vanhecke.info/2006/02/06/akismet-the-web-20-cure-for-comment-spam/#comments</comments>
		<pubDate>Mon, 06 Feb 2006 00:21:18 +0000</pubDate>
		<dc:creator>Pascal Van Hecke</dc:creator>
				<category><![CDATA[Comment Spam]]></category>
		<category><![CDATA[WebWatch]]></category>

		<guid isPermaLink="false">http://pascal.vanhecke.info/2006/02/06/akismet-the-web-20-cure-for-comment-spam/</guid>
		<description><![CDATA[If you’re fed up with comment spam, have a look at Akismet… What Akismet is a comment spam filtering service, provided by Automattic, the company founded by WordPress developer Matt Mullenweg (together with some other core WordPress developers). Recently, the leader of Yahoo developer network joined the startup, so there apparently is money and potential… [...]<div class="tantan-getcomments"><a href="http://pascal.vanhecke.info/2006/02/06/akismet-the-web-20-cure-for-comment-spam/#comments"><img src="http://pascal.vanhecke.info/wp-content/plugins/tantan/get-comments.php?p=146" width="100" height="15" style="border:0;" /></a></div>]]></description>
			<content:encoded><![CDATA[<p>If you’re fed up with comment spam, have a look at <a href="http://akismet.com/">Akismet</a>… <span id="more-146"></span> </p>
<h3 id="toc-what">What</h3>
<p>Akismet is a comment spam filtering service, provided by <a href="http://automattic.com/">Auto<em>matt</em>ic</a>,  the company founded by WordPress developer <a href="http://photomatt.net/">Matt  Mullenweg</a> (together with <a href="http://automattic.com/about/">some other  core WordPress developers</a>). Recently, the leader of Yahoo developer network  <a href="http://gigaom.com/2006/01/11/automattic-wordpress/">joined the startup</a>,  so there apparently is money and potential… </p>
<h3 id="toc-how">How</h3>
<p>You need to install a plugin in your blogging engine. There (obviously) already  is <a href="http://akismet.com/download/">such a plugin</a> for WordPress (and  it <a href="http://wordpress.org/download/">comes by default with WordPress  2.0</a>), but there will soon be plugins for other blogging tools as well. In  fact, Akismet is suitable to check any kind of user-submitted content for spam.  There are generic software libraries that use the Akismet API on top of which  anti-spam plugins for all kinds of social software tools can be built: forums,  wikis&#8230; Have a look at the Akismet <a href="http://akismet.com/development/">development</a>.  </p>
<h3 id="toc-behind-the-scenes">Behind the scenes…</h3>
<p>The Akismet plugin sends a submitted comment to the Akismet server. The server  runs a series of tests and the result classifies the comment as spam or not.  Basically, the server works as an email spam filter. I wouldn’t be surprised  if a lot of the Akismet code was based on <a href="http://wiki.apache.org/spamassassin/SpamAssassin">Spamassassin</a>  (an open source email spam filter). Like Spamassassin, Akismet learns from its  mistakes: so if you notice a comment that got through and reclassify it as spam,  that result is sent to the Akismet server and similar comments will be more  likely to be marked as spam. Vice versa for “false positives”. Note that this  way you profit from other Akismet users’ efforts on their blogs as well (and  they from yours)! The Akismet service is comparable to the email spam database  <a href="http://razor.sourceforge.net/">Vipul&#8217;s Razor</a> (used by SpamAssassin)  and its commercial version <a href="http://www.cloudmark.com/">Cloudmark</a>,  that is used by some commercial email spam filters. </p>
<h3 id="toc-results">Results</h3>
<p>I have Akismet running on this blog and blogs I installed for other people:  I could safely turn of comment moderation, which makes the interaction between  the commenters a lot faster and livelier. I&#8217;ve only had 2 false positives in  about a month (very short comments or trackbacks). In order to get Akismet running,  you need to get an API key by registering as a user at <a href="http://wordpress.com/">WordPress.com</a>  (the hosted version of WordPress provided by Automattic). The Akismet service  is free for personal use, let’s hope they don’t run into performance problems  any time soon… </p>
<h3 id="toc-web2-0">Web2.0</h3>
<p>If you were looking for a quintessential <a href="http://en.wikipedia.org/wiki/Web2.0">Web2.0</a>  application, then Akismet is one: </p>
<ul>
<li> decentralised service</li>
<li> open api</li>
<li> the more people and applications that use it, the better it gets</li>
<li> results are driven by the collective judgement of the users, each of them  driven by self-interest</li>
<li> business model:</li>
<ul>
<li> build up a user base with free software (WordPress), then provide that  audience with a paid service. </li>
<li> the lock-in is in the data, not in the software</li>
<li> the software is open source, and helps drive the adoption of the API</li>
</ul>
<li> and it has nothing to do with Ajax :-) (unless you count the WP admin panel  in)</li>
</ul>
<div class="tantan-getcomments"><a href="http://pascal.vanhecke.info/2006/02/06/akismet-the-web-20-cure-for-comment-spam/#comments"><img src="http://pascal.vanhecke.info/wp-content/plugins/tantan/get-comments.php?p=146" width="100" height="15" style="border:0;" /></a></div>]]></content:encoded>
			<wfw:commentRss>http://pascal.vanhecke.info/2006/02/06/akismet-the-web-20-cure-for-comment-spam/feed/</wfw:commentRss>
		<slash:comments>3</slash:comments>
		</item>
		<item>
		<title>Social engineering in comment spam</title>
		<link>http://pascal.vanhecke.info/2005/11/28/social-engineering-in-comment-spam/</link>
		<comments>http://pascal.vanhecke.info/2005/11/28/social-engineering-in-comment-spam/#comments</comments>
		<pubDate>Mon, 28 Nov 2005 21:37:06 +0000</pubDate>
		<dc:creator>Pascal Van Hecke</dc:creator>
				<category><![CDATA[Comment Spam]]></category>

		<guid isPermaLink="false">http://pascal.vanhecke.info/2005/11/28/social-engineering-in-comment-spam/</guid>
		<description><![CDATA[A comment made on a previous posting: &#8220;I googled for something completely different, but found your page… and have to say thanks. nice read.&#8221; Wouldn&#8217;t you immediately hit the &#8220;approve&#8221; button? In this case, the comment poster&#8217;s url pointed to http://otaro.co.uk/, a page full of &#8220;cheap car hire&#8221; ads and links. Ironically, the poster&#8217;s email [...]<div class="tantan-getcomments"><a href="http://pascal.vanhecke.info/2005/11/28/social-engineering-in-comment-spam/#comments"><img src="http://pascal.vanhecke.info/wp-content/plugins/tantan/get-comments.php?p=133" width="100" height="15" style="border:0;" /></a></div>]]></description>
			<content:encoded><![CDATA[<p>A comment made on a <a href="http://pascal.vanhecke.info/2005/08/25/image-upload-plugin-for-tinymce/">previous posting</a>:</p>
<blockquote><p>&#8220;I googled for something completely different, but found your page… and have to say thanks. nice read.&#8221;</p></blockquote>
<p>Wouldn&#8217;t you immediately hit the &#8220;approve&#8221; button?<span id="more-133"></span></p>
<p>In this case, the comment poster&#8217;s url pointed to http://otaro.co.uk/, a page full of &#8220;cheap car hire&#8221; ads and links.  Ironically, the poster&#8217;s email was the confidence-building &#8220;please.no.spam@otaro.co.uk&#8221;.  But the best part of course was the combination &#8220;I googled for something completely different&#8221; (so you don&#8217;t expect me to write anything relevant) and &#8220;have to say thanks, nice read&#8221; (so you approve in a wink).  The blogger&#8217;s ego&#8230; the weakest spot in comment spam protection.</p>
<p>Now this was a simple one, because the hurried spammer left his link load on the first script run.  A more patient strategy would have been to build confidence in the targeted blogs by not giving away his intentions immediately.  Lots of (at least WordPress blogs) hold comments from unknown posters (ip&#8217;s, mail addresses&#8230;) in moderation by default.  Once a comment is approved, the poster is whitelisted.   The technique would work on the level of individual blogs, and, when smart enough to fool lots of users, maybe even for systems using the kind of collaborative filtering such as used in <a href="http://akismet.com/">Akismet</a>.</p>
<p>So you better think twice before approving another vague &#8220;thank-you&#8221; or &#8220;great reading&#8221; post&#8230;.</p>
<p><em>BTW: <a href="http://seoblackhat.com/">Seo BlackHat</a> is an interesting (and entertaining!) resource on all kinds of web spam &#8211; he hasn&#8217;t written on this yet, so maybe it was his? :-)</em></p>
<div class="tantan-getcomments"><a href="http://pascal.vanhecke.info/2005/11/28/social-engineering-in-comment-spam/#comments"><img src="http://pascal.vanhecke.info/wp-content/plugins/tantan/get-comments.php?p=133" width="100" height="15" style="border:0;" /></a></div>]]></content:encoded>
			<wfw:commentRss>http://pascal.vanhecke.info/2005/11/28/social-engineering-in-comment-spam/feed/</wfw:commentRss>
		<slash:comments>16</slash:comments>
		</item>
	</channel>
</rss>

