Bayesian filters

Spam is a problem that isn't going to go away anytime soon. Even if governments legislate against it, Spammers ability to adapt their messages and to get unsuspecting users with broadband connections to broadcast spam (Zombies) makes it a very difficult proposition to prevent unwanted messages completely. Chances are you have already filtered your e-mail, and that your ISP is also actively filtering your e-mail even before it gets to you. The problem is that Spammers are adapting to filters, and getting around these types of systems.

Over the past year companies have begun to introduce anti-spam products based on Bayesian filters. A Bayesian filter evaluates the content of a message and scores it based on an algorithm, typically from 0 (not spam) to 100. You set a threshold of what score you want to filter against, and anything higher than that score does not come into your Inbox. That threshold is usually set at a default of 50.

Recently Eudora 6.0's paid mode began offering SpamWatch, which incorporates a Bayesian filter. If, for example, you move all of your filtered spam into the new Junk folder, you already have a good start on getting the filter to be effective. A Bayesian filter is adaptive. So as you continue to designate items as junk, moving them to the Junk folder and setting their scores to 100 or from the Junk box, unjunking them, setting their scores to 0 and returning them to your Inbox, the Bayesian filter gets better. This newsletter you get

    Requires Free Membership to View

might be junk to one user and a treasure to another, but now the end user can define it. While there are methods for defeating Bayesian filters, they aren't foolproof, and messages of those types can often be prefiltered by other systems in use.

You'll find a description of how Bayesian filters work at: http://www.paulgraham.com/spam.html and at http://email.about.com/cs/bayesianfilters/a/bayesian_filter.htm. Products with Bayesian filters are found for all e-mail clients, and include: SpamBayes, Spam Bully, Spam Combat, Mozilla's spam program, ResponZe, and others. A compiled list may be found at: http://email.about.com/cs/bayesiansoftwin/.

Barrie Sosinsky is president of consulting company Sosinsky and Associates (Medfield MA). He has written extensively on a variety of computer topics. His company specializes in custom software (database and Web related), training and technical documentation.

This was first published in December 2003

There are Comments. Add yours.

TIP: Want to include a code block in your comment? Use <pre> or <code> tags around the desired text. Ex: <code>insert code</code>

REGISTER or login:

Forgot Password?
By submitting you agree to receive email from TechTarget and its partners. If you reside outside of the United States, you consent to having your personal data transferred to and processed in the United States. Privacy
Sort by: OldestNewest

Forgot Password?

No problem! Submit your e-mail address below. We'll send you an email containing your password.

Your password has been sent to:

Disclaimer: Our Tips Exchange is a forum for you to share technical advice and expertise with your peers and to learn from other enterprise IT professionals. TechTarget provides the infrastructure to facilitate this sharing of information. However, we cannot guarantee the accuracy or validity of the material submitted. You agree that your use of the Ask The Expert services and your reliance on any questions, answers, information or other materials received through this Web site is at your own risk.