rhs blog
Richard Schwartz's Web Log
Powered by Lotus Notes & Domino 6



rhs blog is being phased out

Please visit The Power of The Schwartz


penumbra logo Penumbra Bloggers
Dan Fredricksen (rss)
Melissa Gena (rss)
Carolyn Kraut (rss)
Ben Langhinrichs (rss)
Henry Newberry (rss)
Rob Novak (rss)
Rocky Oliver (rss)
Andrew Pollack (rss)

notes logo Domino Bloggers
3C Interop (rss)
Airplay (rss)
Gaston Annebicque (rss)
Andrew Barker (rss)
Robert Basic (rss)
Chris Bekas (rss)
Alan Bell (rss)
Brian Benz (rss)
Ray Bilyk (rss)
Olaf Björklund (rss)
David Bockes (rss)
Michael Braly (rss)
Chris Brandlehner (rss)
Breaking Par (rss)
Ed Brill (rss)
Bill Buchan
Sean Burgess (rss)
Christoper Byrne (rss)
Curt Carlson (rss)
Rune Carlson
Colman Carpenter (rss)
Jerry Carter (rss)
Steve Castledine (rss)
Ferdy Christant (rss)
Chris Coates (rss)
CodeStore (rss)
Patrick Corey (rss)
Jack Dausman (rss)
Ray Davies (rss)
Laurent deWalick (rss)
Dave Dickey (rss)
Almar Diehl (rss)
Frank Docherty (rss)
Domino-IT (rss)
Dragon
Thomas Duff (rss)
Bruce Elgort (rss)
Warren Elsmore (rss)
Ed Falcon
Jens-Christian Fischer (rss)
Wolfgang Flamme (rss)
Dan Fredricksen (rss)
Nathan Freeman (rss)
From Domino (rss)
Melissa Gena (rss)
Nick Goddard (rss)
Mike Golding (rss)
Neil Gower (rss)
Mac Guidera (rss)
David Gurteen
Mike Hamner (rss)
Christopher Harvey (rss)
John Hawkins (rss)
John Head (rss)
Henning Heinz (rss)
Alex Hernandez
Anthony Hicks
Kurt Higley (rss)
Roy Holder (rss)
Jake Howlett
InsideLotus (rss)
Jochen Tinn (rss)
Jon Johnston (rss)
Johan Känngård (rss)
Tony Kelleran (rss)
Chris King (rss)
Justin Knol (rss)
Hynek Kobělka (rss)
Heinz Ulrich Krause (rss)
Carolyn Kraut (rss)
Francois Lafleur (rss)
Neill Laney (rss)
Ben Langhinrichs (rss)
Justin Lauder (rss)
Chris Linfoot (rss)
Joe Litton (rss)
LNDCentral (rss)
Martin Los (rss)
Gonzo Lotusphere (rss)
Declan Lynch (rss)
Eric Mack (rss)
Brian Mahoney (rss)
John Z. Marshall (rss)
Rob McDonagh (rss)
Kathleen McGivney (rss)
Ian McNairn (rss)
Dave Meehan (rss)
Chris Miller (rss)
Stephen Mitchell (rss)
Paul Mooney (rss)
Mike Munnis (rss)
Dave Navarre (rss)
Captain Nelson (rss)
Henry Newberry (rss)
News4Notes (rss)
Tom Nichols
Fabian Nirman
Keith Nolan (rss)
Tom O'Neil (rss)
Bob Obringer (rss)
Rocky Oliver (rss)
Tony Ollivier (rss)
Devin Olson (rss)
Nuno Pereira (rss)
Bruce Perry (rss)
Andrew Pollack (rss)
Ben Poole (rss)
Kevin Presseault (rss)
Colin Pretorius (rss)
Project DX (rss)
Jack Ratcliff (rss)
Niel Revelle (rss)
Julian Robichaux (rss)
Stan Rogers (rss)
John Roling (rss)
Ben Rose (rss)
Jeremy Schwab
Libby "NotesGirl" Schwarz (rss)
Nik Shenoy (rss)
Matt Smith (rss)
Michael Sobczak (rss)
Lance Spellman (rss)
Philip Storry (rss)
Jess Stratton (rss)
Keith Strickland (rss)
Chris Toohey
Carl Tyler (rss)
Erwin van Hunen (rss)
John Vaughan (rss)
Maurice Verhagen (rss)
Florian Vogler (rss)
Michael Wagner (rss)
Volker Weber (rss)
Websphere Portal Notebook (rss)
Paul Westlake (rss)
Matt White (rss)
Colin Williams (rss)
Stephan Wissel (rss)
Ed Wrenbeck
Rob Wunderlich (rss)

Other Bloggers
Jeremy Allaire (rss)
Dave Barry
Ned Batchelder (rss)
Matthew Berk (rss)
Hiawatha Bray
Dan Bricklin (rss)
Bob Congdon (rss)
Cool Tools (rss)
Mark Cuban (rss)
David Pogue (rss)
Howard Dean
Peter DeHaas (rss)
Gary Devendorf (rss)
Esther Dyson
Bruce Eckel (rss)
Miguel Estrada (rss)
Amitai Etzioni
Bob Frankston & David Reed (rss)
Adam Gaffin (rss)
Dan Gillmor (rss)
Steve Gillmor (rss)
Gary Hart
Charles Hudson (rss)
Chris Kagy (rss)
Mitch Kapor (rss)
Damien Katz (rss)
Lawrence Lessig (rss)
Pete Lyons (rss)
Many to Many (rss)
Joshua Marshall (rss)
Ross Mayfield (rss)
Measure Twice (rss)
Michael Moore
Peter O'Kelly (rss)
Ray Ozzie (rss)
Sebastian Paquet
John Patrick (rss)
Jonathan Peterson (rss)
Tom Peters (rss)
Mark Pilgrim (rss)
Glenn Reynolds
Steve Rubel (rss)
Sam Ruby (rss)
Pito Salas (rss)
Michael Sampson (rss)
Andrew Samwick (rss)
David Schatsky (rss)
Kevin Schofield (rss)
Jonathon Schwartz (rss)
Robert Scoble (rss)
Clay Shirky (rss)
Smart Mobs
SocialText Blog (rss)
Joel "on Software" Spolsky (rss)
Jon Udell (rss)
Venture Blog
Wil Wheaton (rss)
Andy Williams Affleck (rss)
Dave Winer
Amy Wohl
Tom Yager (rss)
Steve ?

notes logo Notes & Domino
Penumbra Group LDD OpenNTF
Lotus Software IBM FAQ

notes logo Domino Blogging
FreeDom Blog Dom Blog
BlogSphere DominoBlog

RHS On The Web
RHS Consulting Home Page
Links To This Page

On www.lotus.com...
 Lotusphere 2003 Preview

On www.advisor.com...
 Lotusphere 2002
    Reports 1 2 3
 Domino Designer 5
 The Notes C++ API
 .NET and Domino
 LotusScript Coaching
 Tips 'n Tricks

On LDD Today (www.notes.net)....
 Lotusphere 2003
    Reports 1 2 3 4
 Field Encryption
 Frames in 4.6
 Domino Architecture,
    Parts 1 and 2
 Bruce's Article on
    Objects

Report From The 2004 MIT Spam Conference

This was a very interesting conference. Mark Ramos of Granite Software also attended, and we sat near each other and had lunch together. Other than having one familar face from the Lotus community in the crowd, however, this conference was unlike any other conference I've been to in a long time. First of all, it was an academic conference, which means that presentations are much shorter than what we see in industry conferences. It all took place in one auditorium, with no breakout sessions. There were 19 individual presentations on the agenda. That's an awful lot of information in a very short time.

Now, about the venue... MIT is a world class institution... there's no question about that... but it has all the charm of an urban industrial park. I found it odd that, with as many experts in queueing theory as there are at MIT, there was only a single rest room for an auditorium that held somwhere in the neighborhood of 500 people.

The most frequently heard phrases during the conference were probably "arms race", "white list", and of course "Bayesian". Another common term was "innoculation", which refers to automatically passing on spammer information in near-real-time so that other sites can benefit from information about attacks that you have detectd.

A wide variety of viewpoints were given, some of them in direct conflict. One speaker, Terry Sullivan, presented a statistical analysis that strongly challenged the conventional wisdom that spam patterns mutate very quickly, yet many other speakers took the volatility of spam patterns as a given. One of the two speakers who addressed the problem of making statistical filtering fast enough for server implementation, completely puzzled the audience by citing some numbers describing his adaptation of the CRM114 approach that baffled everyone because he seemed to be implying that he was computing probabilities for eight distinct patterns within a two token window. Clearly something was lost in translation on that one -- but I actually have to admit that this talk got me thinking about a few things, and I'm going to start a correspondence with some of the presenters to see whether the ideas I've come up with might be useful to them. There were also two different speakers on the subject of legal responses to spam, one of whom was fairly optimistic, and the other fairly pessimistic, and two different speakers talking about sender-pays economic solutions, one of whom presented a detailed economic model that was certainly of academic interest even if the practical value is questionable.

I think the most intriguing presenation, in terms of new ideas and techniques, was Marty Lamb's talk about TarProxy. The basic idea is that TarProxy recognizes spammers when they connect, and it does everything it can to keep them connected for as long as possible, while not actually delivering their spam. This turns the tables on spammers, consuming their resources so that they can't send as many spam messages. It's like putting a telemarketer on hold. He's designed it to be pluggable, so integrating it with an existing spam solution should be quite easy, and integrating it with a white list to improve reliability would be a cool idea, too.

Eric Kidd of Dartmouth Medical School gave an interesting presentation about Bayesian Whitelisting. He spoke about using statistical analysis of headers to quickly determine whether mail could bypass more expensive analysis. What he's really doing is analyzing the implicit social network within email, and looking for messages that fit the existing communication patterns. Cool idea.

The best presentation was actually delivered in absentia by John Graham-Cumming. His video presentation about "How To Beat A Bayesian Filter" took us inside the minds of spammers, He described an attack on Bayesian filtering using -- get this! -- Bayesian analysis on the spammer's side! The spammer sends a massive number of messages, using the "word salad" technique to try to find versions that make it through the filters, and uses statistics on which ones get rejected least often to train his software to use those particular words more often. His conclusion was that all feedback to spammers is harmful. Non-deliveries, challenge/response messages, HTML rendering of images that contain coded acknowledgement data, SMTP rejection messages, etc., all potentially help spammers fine-tune their attacks. He's right, of course, but as is the case with many things that we know are correct in a purely academic sense, the question of what we should do in practical terms remains wide open.

The most controversial topic was one of the last ones:Eric Raymond's presentation on SPF, which is a DNS-based technique for adding authentication into the SMTP protocol. Opinons run very strong about whether SPF does any good at all, whether it does more harm than good, etc. A fellow in the audience, whom I'm 99.9% certain was Barry Shein, founder of The World -- the first public dialup ISP anywhere, which happens to be where I had my very first dialup account -- spoke very critically of SPF. I'm not sure where I stand on this. Mass adoption of SPF will cause problems for anyone sending mail directly from dynamic IP addresses, but the fact that many major ISPs are already rejecting mail from dynamic IPs kind of makes this a moot point IMHO. It also causes problems for automatied forwarding messages -- at least the invisible type of forwarding that users are accustomed to when they move from one address to another. I'm leaning toward the belief that SPF is another weapon, not necessarily a definitive way to track spam, and probably not a way to reject spam all by itself either, but perhaps as a way to determine which inbound messages need to be subjected to the most stringent analysis before delivery.

The very last presentation, by Richard Jowsey of Death2Spam (gotta love that name!!) appealed to the latent math geek in me. I say latent, but my friends all know about it, and I've let it show here from time to time, I guess. Anyhow, he showed lots of pretty bell curves while handling the topic of fine-tuning Bayesian filtering. It turns out that my own experiments with Bayesian filters in LotusScript explored some similar tuning concepts, but I never actually did the math to justify what I was doing.




Discussion | Post Response


Report From The 2004 MIT Spam Confe... ( 18-Jan-04)
. . RE : Report From The 2004 MIT Spam ... ( 21-Jan-04)
. . . . RE : Report From The 2004 MIT Spam ... ( 23-Jan-04)

My Company An Association of Lotus Business Partners That I Helped Found
I contributed to this book
Previous MonthJan05
1234567
891011121314
15161718192021
22232425262728
293031

By Category
Blogs, Social Software & Collaboration
Business, Processes & Methods
Computer Industry News
Diversions
Events, Presentations & Awards
IBM, Lotus, Notes & Domino
Law and Technology
News, Politics, Issues, Opinion
Personal
Pet Peeves
Programming
Projects - Blog Aggregator
Projects - RSS Feed Generator
QuickBlogs
Spam & Security
Systems And Networking
Technology & Science
Tips - Programming
Travel & Food
Tributes


Recent Blog Responses
Mary Beth Raven: Drag Drop
Ulrich Kraus: Tell amgr
Ferdy C.: Congratulations
Pito Salas: Benton, NH
Ed Brill: Sharepoint
Amy B.: IQ
Rocky Oliver: Intolerance
Matt White: London
Ed Brill: Picciano article
Adam Osborne: Renewals

Really Useful Blog Entries
Volker Weber's: devWorks Live links
Mike Golding's CSS Buttons
Kalsey: Button Maker
McKnight: Steal These Buttons
Kalsey: CSS Tabs

Required Reading
Joel on Software - Strategy Letter V
Understanding Notes and Its Developers
Rise of the Stupid Network
Dijkstra: Programming as a Human Activity
Fred Brooks: No Silver Bullet
Ken Thompson: Trusting Trust
Paul Graham: Hackers and Painters
Paul Graham: Lots of Stuff About Spam
notestips.com :: Homepage Best Practises
Shirky: A Group Is Its Own Worst Enemy
Joel On Software: Stupidity
Intranet Journal: Content Owners & Developers
Chris Linfoot: Email Disclaimers
Jack Dausman: Server ID Security
Jim McCarthy: 21 Rules of Thumb

What's All This About Social Software?
Social Software Alliance Wiki
Guardian: Social climbers
Headshift: Smarter, Simpler Social
Many-to-Many: Social Software
Shirky: Politics of Groups
Smart Mobs: Historical Roots
Coates: Working Definition
Corante: Earliest Use of the Term
Corante: Why Social Software?
Corante: Why Now?
Shirky: IM Rulz
Stowe Boyd: -Are You Ready?
Many-to-Many: Communities and Networks

Noters & Domino 6 Bible Mentions
Volker Weber: What a fat book
Ray Davies: Comment
Joe Litton: Welcome Rocky!

Contact Info
email - Email
aim - AIM
home - Personal Home Page
home - RHS Consulting Home Page

Search
Javascript required for search

Legal
Attribution required. Permission required for commercial use.

Translation