I guess they picked the wrong guy to test the spam algorithm on.
<see title>
Linux kernel supremo Linus Torvalds has published a scathing open letter to Google's Gmail team after discovering that the service had incorrectly marked hundreds of his incoming email threads as spam – including ones containing kernel patches. "Something you did recently has been an unmitigated disaster," Torvalds wrote in …
This post has been deleted by its author
At least they were not discarding it altogether as they do with mail being sent to domains which have both v6 and v4 MXes.
If gmail starts trying to deliver via v6 it _NEVER_ falls back to a v4 MX. So if the v6 MX has an issue, well you just lost your mail. So effectively, there is no MX order fallback.
So, looking at that, do you expect antispam to work in a company which has degenerated to a point where the mail team does not understand the concept of an MX how mail delivery should work? I would not. I guess I am not the only one too as I have just noticed that comcast has removed the v6 MX off their DNS records.
Basically, Google's enforcing DKIM from certain domains, and if a message is "from" someone whose e-mail host provides proper DKIM, but it's missing it, Google (and Yahoo) servers reject it. Mailing lists aren't usually set up to properly handle DKIM (being, effectively, a relay), and therefore get rejected.
The workaround that I saw one mailing list use was to resend the e-mail from the mailing list's address, append "via (mailing list name)" to the name on the from field, and just have both the mailing list and the original author in reply-to.
It's DMARC that's to blame (being a broken solution). DKIM itself is fine.
It's also the obnoxious "conversation view" I suspect. Switch that off and only the actual spam ends up in the spam box. I probably get less mail than Linus, but I do get 100 or more per day on gmail, and the false positives are rare. But come on, if you care about this, you need to eyeball the spam folder a couple of times a week. I see less than 1% false positives and they are almost all from user@yahoo.com via mailing lists, and caused by DMARC.
DMARC is massively broken, because it mandates an SPF test on the From header, even if a Sender header is present. What it should do is to test the Sender if present, else the From, but it doesn't.
Most mailing lists work completely RFC-compliant by adding a Sender header (known as the 'secretary scenario'). However, to get past DMARC tests, they have to violate the RFC and rewrite the From header instead, concealing the originator of the mail.
This post has been deleted by its author
He probably uses it because:
1) Volume - The quantity of general mail he gets and the amount of spam he might get be vast, so he needs something that can handle the GBs of email smoothly. His inbox size could be huge and difficult to not only maintain but search. Bet he doesn't "Inbox Zero" at the end of the day.
2) Conversation View - Yes I know that Thunderbird or other tools do provide conversation view but Thunderbird is increasingly less performant especially on larger mail boxes (in my experience).
3) Spam - This is the controversial part of the mix because it is the bit that failed, but good spam filters are rare.
I spent many years running my own mail server, dealing with spam filters, I've moved to a hosted solution mainly because I can't be bothered with the hassle any more. My IMAP host isn't google which means it is relatively slow and has terrible webmail, but at least I don't have to worry anymore.
Anyone have any suggestions of good value hosted IMAP providers with decent webmail (e.g. Horde IMP not just Squirrel Mail) and configurable spam filters?
Google has legal right to keep and analyse every single mail that comes in or out. Also being an American company it can create major international problem for hackers.
NSA probably runs a cooler Linux (SELinux) too, why doesn't Linus get linus@nsa.gov next time? They probably have better anti spam filters.
Is GMail code open source?
I know it doesn't violate the GPL because it's never transferred to anyone, making it even more closed than non-open source code released as binary only.
Thereby, the fact it runs on some variant of Linux is close to irrelevant. Google does whatever it likes and you have to accept it.
If I where Linus, I would run the mailing list out of standard, well known mail applications, without Google-in-the-middle.
Like an April Fool gag. You're ready to post "Really!! He uses Gmail??" - then you remember what day it is.
(Notice, I'm hedging my bets, here!)
Nb. So, what, he uses Gmail because it's free?? My superior provider charges me ~£5 a year. Runs a tight ship, does ol' Linus?
"Nb. So, what, he uses Gmail because it's free?? My superior provider charges me ~£5 a year."
Gmail may not necessarily be free, depending on how Torvalds and/or the Foundation use it - for UK pricing, I'll they'll see your £5 per year and raise it to £3.30 or £6.60 per user per month.
I haven't checked those prices for myself - I've no intention of farming the email for any of my domains out to Google - but because of clients who seem to think Google's arse is the source of our sunshine.
There is something *soul sucking* about having people, some very clever indeed in their anti-social way, making your job harder on purpose...and it's every day. It rarely slows down and it never ever stops. After 20 years of Spam (I had been running a mailserver for some years by 1994) it was just too depressing.
Antispam tends to be a grind job and I, for one, don't want Linus Torvalds having to deal with it. He has plenty of necessary grind in his life already without adding to it.
The day I stopped running a significant mail server was a bright and wonderful new chapter in my life.
I am in no way exaggerating.
> Antispam tends to be a grind job
Yes and unfortunately our continued use of ever-fancier methods of looking for it after it has arrived only serves to mask the problem so the end users (and NB higher-ups) don't understand the scale of it except for some stats and using it to pretend they never got that email.
There are easy ways to reduce the quantity of it (and I'm as guilty as anyone of posting that 'your spam solution does not work' list) but require several things to happen and the fact that any one thing won't kill all of it at once shouldn't be the reason not to bother but there's a huge amount of money (as well as our time and efforts) tied up in this 'filtering technology' so nobody wants to try anything different.
100% agree.
It's a pity though that a man who knows how hard large scale software development is, how hard a bug free program is, and how hard accurate spam detection is... would have a rant at a bunch of software developers for making the occasional and utterly unavoidable mistake.