<?xml version="1.0" encoding="utf-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>Aldoblog &#187; tidbits</title>
	<atom:link href="http://aldoblog.com/tag/tidbits/feed/" rel="self" type="application/rss+xml" />
	<link>http://aldoblog.com</link>
	<description>Michael Alderete’s Weblog</description>
	<lastBuildDate>Wed, 23 May 2012 11:14:11 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.3.2</generator>
		<item>
		<title>Latent semantic analysis is not Bayesian filtering</title>
		<link>http://aldoblog.com/2003/05/latent-semantic-analysis-is-emnotem-bayesian-filtering/</link>
		<comments>http://aldoblog.com/2003/05/latent-semantic-analysis-is-emnotem-bayesian-filtering/#comments</comments>
		<pubDate>Sun, 04 May 2003 06:25:10 +0000</pubDate>
		<dc:creator>Alderete</dc:creator>
				<category><![CDATA[Anti-Spam]]></category>
		<category><![CDATA[Mac OS X]]></category>
		<category><![CDATA[Technology]]></category>
		<category><![CDATA[anti-spam]]></category>
		<category><![CDATA[bayesian]]></category>
		<category><![CDATA[eudora]]></category>
		<category><![CDATA[inbox]]></category>
		<category><![CDATA[latent-semantic-analysis]]></category>
		<category><![CDATA[mail.app]]></category>
		<category><![CDATA[spam]]></category>
		<category><![CDATA[spamnet]]></category>
		<category><![CDATA[tidbits]]></category>

		<guid isPermaLink="false">http://aldoblog.com/blog/299</guid>
		<description><![CDATA[Macworld recently ran an article about anti-spam tools for Mac OS X, which incorrectly simplified the world of anti-spam tools down to Boolean, points-based, and Bayesian filters. There are at least two more categories of anti-spam tools.]]></description>
			<content:encoded><![CDATA[<p></p>	<p>Macworld <a href="http://www.macworld.com/2003/04/magazine/april2003toc/" title="Macworld April 2003: ">recently ran</a> an article about anti-spam tools for Mac OS X, which incorrectly simplified the world of anti-spam tools down to Boolean, points-based, and Bayesian filters.</p>

	<p>Two additional categories are distributed recognition, such as the <a href="http://www.rhyolite.com/anti-spam/dcc/">Distributed Checksum Clearinghouse</a> (<span class="caps">DCC</span>) and <a href="http://razor.sourceforge.net/">Vipul&#8217;s Razor</a>, and latent semantic analysis. I don&#8217;t know of any distributed recognition products for the Mac (there&#8217;s a very good one for Windows Outlook, <a href="http://www.cloudmark.com/products/spamnet/">SpamNet by Cloudmark</a>), but there certainly <em>is</em> a latent semantic analysis tool &#8212; Apple&#8217;s Mail in Jaguar!</p>

	<p>The simplification (or oversight) is relatively understandable. From an end-user perspective, there&#8217;s no meaningful difference &#8212; even though <a href="http://www.pacificavc.com/blog/2003/02/10.html" title="Bayesian Nets, Latent Semantics">the math is very different</a>. It&#8217;s not clear which will prove better at filtering out spam, even though in the article Mail&#8217;s filtering did the best. Seems like it&#8217;s good to have both in the fight!</p>

	<p>While I&#8217;m posting about it, I should note that the article was written prior to the release of <a href="/blog/298">my new favorite</a> anti-spam tool, <a href="http://www.spamnix.com/">Spamnix</a>, and so it doesn&#8217;t include it in the roundup. From my own experience with Mac OS anti-spam tools I think that, with the caveat that it only works with Eudora, it would have done well in the evaluation. Perhaps Geoff Duncan, or someone else at <a href="http://www.tidbits.com/">TidBITS</a>, will review it soon, and confirm that guess. I know they like Eudora at TidBITS &#8212; they literally wrote the book!<hr />Copyright &copy; 2012 by <strong><a href="http://aldoblog.com">Aldoblog</a></strong>. All rights reserved. This feed is provided for personal non-commercial use only. If you are not reading this material in your news aggregator, the site you are looking at is guilty of copyright infringement. Please contact legal-2011@aldoblog.com so we can take action immediately.</p>]]></content:encoded>
			<wfw:commentRss>http://aldoblog.com/2003/05/latent-semantic-analysis-is-emnotem-bayesian-filtering/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
	</channel>
</rss>

