I am subscribed to the Cocoa tag RSS feed on del.icio.us and I just received what I consider as SPAM: someone posted a bookmark to his/her site with a huge list of seemingly popular tags in effect spamming everyone subscribed to these tags… I’m only linking to the site in question to document (I used the rel="nofollow" attribute to prevent the spammer from benefiting from the link) what I am afraid is the first (though I hope it’s the last) example of del.icio.us spam.

Spamming del.icio.us this way is similar to the old meta trick, which used to be successful in the pre-Google days to ensure high-ranking on search results. It is much, much more efficient due to del.icio.us’ popularity though and is a “clever” use of folksonomies: people subscribed to tags are very likely to check sites marked with such tags. Moreover, RSS feeds ensure wide distribution of the link. I could see spammers efficiently use that technique to make people check their sites out.

The question now is what can be done to prevent such an obvious perversion (or is it?) of social bookmarking? One potential way would be to restrict the number of tags that could be assigned to bookmarks but this is obviously not that useful: spammer would use the 15 (or whatever the tag number limit is) most popular tags to still efficiently propagate their crap. Which would in turn lead to a corruption of the folksonomy: if most popular tags are spammed with irrelevant crap, the tags become less meaningful…

More elaborate solutions could be used by performing content analysis and compare the newly tagged link to the most popular links with the same tag (assuming that these links are relevant, which they should if you are to trust folksonomies) and check that there is some overlap content before allowing the tag to be used. Actually, this wouldn’t even work as illustrated by the cocoa tag itself: cocoa is most known as the substance from which chocolate is issued but on del.icio.us (with its probably slightly geeky crowd) most links tagged with “cocoa” refer to Mac OS X’s programming framework. Though rare, I have seen bookmarks to cocoa-the-substance-related websites in the midst of the flow of cocoa-the-OS-X-API links in the cocoa RSS feed. With the content analysis described above, people wanting to assign the cocoa tag to chocolate-related sites would probably be barred (no pun intended) from using it…

I am not quite sure what can effectively be done to prevent unscrupulous users from spamming others but it sure would be interesting to think about it. In the meant time, it’s a sad day for folksonomies. I am afraid that if you are interested (like me) in subscribing to tags via RSS, you might need to resign yourself to receiving, in a near future, a bunch of irrelevant crap along with legit links… :(

According to this post on the del.icio.us discuss mailing list, Joshua is aware of the issue and has code to deal with it in the next upgrade of del.icio.us. Very cool, though it’d be really interesting to know what kind of solution has been implemented.

,

No related posts.

Related posts brought to you by Yet Another Related Posts Plugin.