<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>Web 2.1 &#187; data mining</title>
	<atom:link href="http://web.2point1.com/tag/data-mining/feed/" rel="self" type="application/rss+xml" />
	<link>http://web.2point1.com</link>
	<description>Tim Whitlock&#039;s home in the Blogohedron</description>
	<lastBuildDate>Sat, 04 Sep 2010 20:37:00 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.0.1</generator>
		<item>
		<title>TwitBlock is born</title>
		<link>http://web.2point1.com/2009/07/27/twitblock-is-born/</link>
		<comments>http://web.2point1.com/2009/07/27/twitblock-is-born/#comments</comments>
		<pubDate>Mon, 27 Jul 2009 22:36:04 +0000</pubDate>
		<dc:creator>tim</dc:creator>
				<category><![CDATA[General]]></category>
		<category><![CDATA[data mining]]></category>
		<category><![CDATA[junk]]></category>
		<category><![CDATA[spam]]></category>
		<category><![CDATA[twitblock]]></category>
		<category><![CDATA[twitter]]></category>

		<guid isPermaLink="false">http://web.2point1.com/2009/07/27/twitblock-is-born/</guid>
		<description><![CDATA[A bulk blocking and spam filter tool for Twitter www.twitblock.org I&#8217;ve finally got round to building the Twitter app I&#8217;ve been thinking about for months. While everyone else is preoccupied with making fun, or cool apps, I&#8217;ve been thinking about the increasing problem of spam and junk followers on Twitter. I won&#8217;t go into why [...]]]></description>
			<content:encoded><![CDATA[<h3>A bulk blocking and spam filter tool for Twitter</h3>
<p><strong><a href="http://twitblock.org/">www.twitblock.org</a></strong></p>
<p>I&#8217;ve finally got round to building the Twitter app I&#8217;ve been thinking about for months. While everyone else is preoccupied with making fun, or cool apps, I&#8217;ve been thinking about the increasing problem of spam and junk followers on Twitter. I won&#8217;t go into why I think this is such a problem right now, plenty of time for that later.</p>
<p>This is just a quick announcement to say that I&#8217;ve released an early <em>alpha</em> version of a tool that I hope to develop into something genuinely useful. Currently it&#8217;s a <a href="http://twitblock.org/scan_followers.php">simple scanner</a> that analyses your followers for signs of &#8220;spammy&#8221; behaviour. I&#8217;ll post more details about these <em>indicators</em> soon, and I&#8217;ll also share some of the interesting discoveries I&#8217;ve been making about Twitter spam as I go on my mission.</p>
<p>UPDATE: I have posted <a href="http://web.2point1.com/2009/08/03/twitblock-spam-ratings-explained/">about these indicators</a></p>
<p><span id="more-125"></span></p>
<h3>Data mining for good, not evil</h3>
<p>One of the principal aims of <a href="http://twitblock.org/">TwitBlock</a> is to gather data in order to improve the service &#8211; i.e. to make it accurate enough that it could [in theory] be used to <em>automatically</em> filter spam out like an email junk filter endeavours.</p>
<p>By logging into TwitBlock (<a href="http://blog.twitter.com/2009/04/whats-deal-with-oauth.html" target="_blank">via Twitter OAuth of course</a>) you are sharing the list of people that you block. As long as the app is authorized I can update this list and the app can learn from it.</p>
<p>Additionally I will be writing various bots (crawlers) that analyse Twitter activity in terms of suspicious behaviour and mine more data. More about these bots later too <img src='http://web.2point1.com/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' /> </p>
]]></content:encoded>
			<wfw:commentRss>http://web.2point1.com/2009/07/27/twitblock-is-born/feed/</wfw:commentRss>
		<slash:comments>9</slash:comments>
		</item>
	</channel>
</rss>
