Industry guru Dave Taylor offers free tech support on a wide variety of technical and business topics, including HTML, Apple iPhone, online advertising, Cascading Style Sheets, Web design, management, Unix, Linux, search engine optimization, online dating, Mac OS X, shell script programming and Microsoft Windows.

What is a CAPTCHA anti-spam system?

My system administrator is telling me that I need to add a "capcha" [actually, it's "captcha"] system to my site so that I get less spam. What's a captcha system and why would I want it?


Dave's Answer:

Ah, spam, the bane of our collective online existence. It's a pain and it's frustrating how much time and effort we have to collectively expend trying to minimize its impact on our online experiences. Blech.

Here's a fun fact that most people don't know: CAPTCHA is actually an acronym. It stands for "Completely Automated Public Turing test to tell Computers and Humans Apart." (Turing here refers to the brilliant computer scientist Alan Turing, who spent much time thinking about how to differentiate really smart computers from humans, to so-called "Turing test").

I know you've seen these. Here's a typical CAPTCHA from Google's site:

google captcha (captcha,capcha)

The basic idea is that it's some twisted or distorted or obfuscated text that you, as a human, will have no problem recognizing, but that a computer program would have a very tough time interpreting: if it can't figure out the words shown, it can't pretend to be a human and therefore can't spoof the system.

Here's another example, from Craigslist:

craigslist recaptcha (captcha,capcha)

This one is more interesting in some sense because it's the same basic idea, but this recaptcha is actually also a distributed solution to interpreting difficult-to-read scanned text in old books being digitized. Nice win:win!

Lots of sites use these, actually. Here's one from Facebook:

facebook_captcha (captcha,capcha)

Unfortunately, the simpler CAPTCHA systems have been cracked through various means, ranging from smart image scanning and analysis software to brute-force guessing (mapping captcha image file name with a solution) to simply showing the CAPTCHA in another context on an apparently innocent Web site and storing the values [innocent] people type in.

MySpace is late to the party, but its system is pretty typical:

myspace captcha (captcha,capcha)

Earthlink is a bit more colorful:

earthlink captcha

And, finally, here's what Yahoo uses:

yahoo captcha (captcha,capcha)

They're all using the same basic concept and it works, though it can definitely be frustrating when they twist or distort the text too much and just cause a headache.

I would say that if you are having spam problems on your site, it's quite probably from automated tools pretending to be humans, and a CAPTCHA system might well be just what you need to make the problem disappear.

If only it were so easy to make the spammers disappear.



Help others find this article at Del.icio.us, Digg, Netscape, Reddit, and Stumble Upon    

Subscribe!

Never miss another useful Q&A article again! Subscribe to AskDaveTaylor with Google Reader.

Comments

I have recently added reCAPTCHA to my site and my spam has completely disappeared, from hundreds a day to only real people leaving real comments. Now, I know it may not always be that clear cut, but it worked well for me.

I figure Dave's site, with all its Google PageRank goodness, is a bigger target for many spammers, so his solution may need to be more sophisticated. With my own site, though, it was pretty obvious that certain pages (or entries) had been flagged as "spammable" and passed around to other spammers. There was very little sophistication about it.

I have a very "normal" blog, that is, I do not have hundreds of thousands of visits per month. reCAPTCHA was a wonderful solution, and I really liked it because it was helping out the folks who are teaching computers to digitize books too. Very nice, indeed. If you have your own website and use WordPress.org or MovableType blogging software on your site, they both have plugins to make reCAPTCHA work with very little hassle!

Posted by: Cooper Strange at March 10, 2009 10:50 PM

Yes, I was wondering that Myspace has a new 2.0 profile update, I want to put a custom myspace music player up there, but it doesn't show it, where do I actually put the code on myspace, I've tried the general section of my profile and the CSS section, it didn't work..

Posted by: Eric D. at March 22, 2009 2:12 PM

I have a lot to say, but ...
Starbucks coffee cup I have a lot to say, and questions of my own for that matter, but most of all I'd like to say thank you for all your efforts on this Web site by buying you a chai!

I do have a comment, now that you mention it!











Remember personal info?


Please note that I will never send you any unsolicited commercial email. Ever.

While I'm at it, please note that by submitting a question or comment you're agreeing to my terms of service, which are: you relinquish any subsequent rights of ownership to your material by submitting it on this site.









Uniblue: Free Virus Scan

Follow me on Twitter @DaveTaylor

Search
Find just the answers you seek from among our 2300+ free tech support articles by using our Lijit search engine.


Help!





Subscribe to
Ask Dave Taylor!

Add to Google Reader
Add to My Yahoo!
Subscribe in NewsGator Online

RDF   XML

Free Updates!
Sign up and get free weekly updates and special offers on books, seminars, workshops and more.


Recent Entries
Book Links
© 2002 - 2009 by Dave Taylor. All Rights Reserved.

Note: This web site is for the purpose of disseminating information for educational purposes, free of charge, for the benefit of all visitors. We take great care to provide quality information. However, we do not guarantee, and accept no legal liability whatsoever arising from or connected to, the accuracy, reliability, currency or completeness of any material contained on this web site or on any linked site.

[whiteboard marker tray]
"Ask Dave Taylor®" is a registered trademark of Intuitive Systems, LLC.