2010-08-29

IRC and Blogs have something in common

And I’m not referring to the bile and acid reflux; of course they have that in common, but I don’t care much about those; what I care they to have in common is simply spam.

In the past two months, my ModSecurity-based antispam method for the blog started to fail; sure enough, filtering by user agent works well enough to filter a lot of spam, but a lot of that has still been passing through. New waves of spam declare themselves coming from Firefox 3.0, or MSIE 7 on Vista, .. all plausible User-Agent strings that I cannot simply filter.

So I went a step further and started looking at newer patters; the answer was obvious after a few calls to the MultiRBL enquirer — most, if not all, the spam is coming from open proxies. I cannot be sure for all because I only went down to search the block lists that various IRC networks use to filter their connections; a lot of IPs are in most of them, a few are just in one or two, and a few are in none. But given the way those proxy work, it’s quite natural that they might be found blog spamming but not IRC spamming.

Where does this leave us? Not really any further than we were before; IRC networks use double-protection: they use the DNSBL to shut people out when they know them, and all the others, they portscan to make sure that they are not really open proxies. Now this can backfire, since not all networks probe their clients properly – I had my own vserver blacklisted on a network before because it redirected to my main website when called with an unknown vhost… and the IRC network only tested if it accepted the request rather than testing that the request caused a further connection – and takes a bit of time to perform especially for slower clients. On a distributed, non-low-latency network like IRC, this is acceptable. For blog comments, not so much.

Right now I can only rely on the multiple DNSBL checks, and even those tend to be cumbersome and slow down the connection; having a much tougher test is not going to be a happy situation neither for my server, nor for the people who would like to comment.

On the other hand, I’m thinking whether I can prepare the tougher test directly in ModSecurity, and running the complete, hungry test on the second comment in a day. That, would be a nice situation. But I’ll need more time to do that, as it would need to shell out to some other program, or write an openproxy checker in LUA.

If somebody has other ideas and solutions, they are definitely welcome.

Flameeyes 2845 posts

Comments 9

Owen Shepherd says:

2010-08-30 at 00:43

Akismet?

Reply
Flameeyes says:

2010-08-30 at 00:46

Akismet works to hide spam from the published feed and articles’ pages… not so much to filter them beforehand.Given that I have to waste time to delete the comment spam beforehand (and the newest Typo updates actually make spam comments even more obnoxious because they are shown expanded on the feedback view as well), I’d rather filter them beforehand entirely.This way, I can also let _all_ the posts open for comments even years after writing them, which is quite rare unfortunately, in other blogs.

Reply
user99 says:

2010-08-30 at 15:52

64 bytes from vanguard.flameeyes.eu (213.239.237.6): icmp_req=4 ttl=47 time=154 ms so it’s live….thinking that you are blacklisted by WoT plugin.

Reply
Flameeyes says:

2010-08-30 at 16:03

@user99 this blog, my website, and the two links to the right are _all_ on the same vserver; _all_ with the same configuration, and filtering is done actively _only_ on commenting.

Reply
user99 says:

2010-08-30 at 19:52

@ flameeyes: I understand …still from two different IP’s the links on the right are inaccessible. Cname not resolving? I’ll ‘dig’ them later.one IP- homeother IP- work

Reply
user99 says:

2010-08-30 at 20:21

(on lunch :-0 )they don’t resolve the same.david@random ~ $ dig altercut.it; <<>> DiG 9.7.1 <<>> altercut.it;; global options: +cmd;; Got answer:;; ->>HEADER<<- opcode: QUERY, status: NOERROR, id: 7751;; flags: qr rd ra; QUERY: 1, ANSWER: 1, AUTHORITY: 0, ADDITIONAL: 0;; QUESTION SECTION:;altercut.it.INA;; ANSWER SECTION:altercut.it.86400INA213.239.237.6;; Query time: 149 msec;; SERVER: 68.87.74.166#53(68.87.74.166);; WHEN: Mon Aug 30 12:07:41 2010;; MSG SIZE rcvd: 45david@random ~ $ dig blog.flameeyes.eu; <<>> DiG 9.7.1 <<>> blog.flameeyes.eu;; global options: +cmd;; Got answer:;; ->>HEADER<<- opcode: QUERY, status: NOERROR, id: 28139;; flags: qr rd ra; QUERY: 1, ANSWER: 3, AUTHORITY: 0, ADDITIONAL: 0;; QUESTION SECTION:;blog.flameeyes.eu.INA;; ANSWER SECTION:blog.flameeyes.eu.85960INCNAMEhttp://www.flameeyes.eu.http://www.flameeyes.eu.85960INCNAMEvanguard.flameeyes.eu.vanguard.flameeyes.eu.85960INA213.239.237.6;; Query time: 38 msec;; SERVER: 68.87.74.166#53(68.87.74.166);; WHEN: Mon Aug 30 12:08:21 2010;; MSG SIZE rcvd: 92

Reply
user99 says:

2010-08-31 at 07:39

david@random ~ $ dig http://www.altercut.it; <<>> DiG 9.7.1 <<>> http://www.altercut.it;; global options: +cmd;; Got answer:;; ->>HEADER<<- opcode: QUERY, status: NOERROR, id: 55979;; flags: qr rd ra; QUERY: 1, ANSWER: 1, AUTHORITY: 1, ADDITIONAL: 0;; QUESTION SECTION:;http://www.altercut.it.INA;; ANSWER SECTION:http://www.altercut.it.86203INCNAMEhosting.flameeyes.eu.;; AUTHORITY SECTION:flameeyes.eu.10603INSOAdns14.ovh.net. tech.ovh.net. 2010082301 86400 3600 3600000 86400;; Query time: 28 msec;; SERVER: 68.87.74.166#53(68.87.74.166);; WHEN: Mon Aug 30 23:20:32 2010;; MSG SIZE rcvd: 121david@random ~ $ ping hosting.flameeyes.euping: unknown host hosting.flameeyes.eudavid@random ~ $ ping http://www.altercut.itping: unknown host http://www.altercut.itdavid@random ~ $ ping altercut.itPING altercut.it (213.239.237.6) 56(84) bytes of data.64 bytes from vanguard.flameeyes.eu (213.239.237.6): icmp_req=1 ttl=47 time=154 ms

Reply
user99 says:

2010-08-31 at 07:44

either need a cname for http://www.altercut.it or fix your html

Reply
David says:

2010-08-31 at 13:48

Perhaps you’ve already considered this, but have you thought of using a Hashcash-like system? Hashcash in its original incarnation requires a client to find a hash collision of N bits (configurable). There are some javascripts implementations, one plug-in for WordPress iirc. You could either require a small N for each comment, or a somewhat larger N for the first comment and set a cookie.

Reply

Popular tags

The Latest
View All

Identity Crisis In The Age of AI

Bloke On A Trike

This Blog, Brought To You Through AI! (Well, Kinda)

Did I Finally Solve My Audiobook Woes? Well, Maybe.

IRC and Blogs have something in common

Comments 9

Leave a ReplyCancel reply

IRC and Blogs have something in common

Share this:

Comments 9

Leave a ReplyCancel reply

Related Posts