Rspamd 1.2 has been released

2016-03-21 00:00:00 +0000

The next major release of rspamd: 1.2.0 is now released.

Key features:

Dynamic rule updates
Regular expressions maps support
Better performance: pcre2 support, faster fuzzy hashes, faster IP lookups
Improved stability: fixed many important bugs and memory leaks

This version is a gradual improvement over the previous 1.1 branch. It is the first release with rule updates support. I believe that it would be easier to backport new rules or critical score changes from the experimental branch to the stable one. Updates are signed to protect their integrity and authenticate the update source.

Among other features introduced by this version are regular expression maps support (with hyperscan acceleration if available). These maps could be used to match many regular expressions and, at the same time, detect certain patterns in the messages being scanned.

Rspamd 1.2 has a couple of performance improvements: it now supports the PCRE 2 regular expressions library that is usually faster than pcre 1. Fuzzy hashing gets further improvements by utilizing AVX2 instructions which are available for the Intel Haswell CPU family. From version 1.2 onwards, rspamd uses a better algorithm to store IP addresses allowing lookups among millions of IPv4 and IPv6 records in almost zero time.

The new release is scanned with Coverity scan and other static analysis tools that helped to fix many potential bugs and leaks. I believe that rspamd 1.2 is stable, solid and completely production-ready so far.

The complete log of changes can be found here: https://github.com/vstakhov/rspamd/blob/1.2.0/ChangeLog

There are many important additions in the documentation shipped with rspamd. There is now a frequently asked questions article that describes many aspects of practical rspamd use. The quick start guide has also been updated to improve new users’ experience when installing and running rspamd.

Rmilter has been also upgraded to version 1.7.5 which fixes important greylisting and clamav issues. The rmilter changelog is available here: https://github.com/vstakhov/rmilter/blob/1.7.5/ChangeLog

Rspamd vs Spamassassin performance comparison

2016-03-03 00:00:00 +0000

Just before 1.2 release, I have measured performance of rspamd comparing to SA. In this experiment, I’ve taken rspamd master branch with default rules. Then I’ve added all rules from SA using spamassassin plugin. Hence, two scanners run with almost exact set of rules.

This set is quite large and it includes about 3k of custom regexp rules. Rspamd runs without hyperscan and pcre2, so it performs literally the same job as SA does. And here are results for about 100k messages being scanned:

Total False Positives: 517
Total False Negatives: 348
Total messages: 101349

Total SA time: 423942 seconds, total rspamd time: 33149 seconds
Average SA time: 4182ms/msg, average rspamd time: 327ms/msg seconds

So the difference in checks is less than 1% and in many cases rspamd does better job than SA because, for example, multiple hits of URIBL rules, phishing detection and some other differences. And it’s still 13 times faster than SA. Moreover, it eats less memory and can process more messages in parallel. In other experiments, rspamd was able to process about 450 messages per second on a single SandyBridge 4 cores scanner box.

I plan to release rspamd 1.2 very soon with a lot of cool features, including dynamic rules updates. I would appreciate any help in testing of the experimental packages. In fact, they are already used in production and are even more stable than 1.1 branch.

Rspamd switches to apache 2 license

2016-02-04 00:00:00 +0000

In the modern world, software patents are a significant threat for the Open Source software. Therefore, I have decided to switch from the original BSD license to Apache-2 license. Whilst Apache license has the same permissive clauses as BSD license has there is an explicit definition of software patents in Apache license. The Apache License contains both a patent grant and a patent retaliation clause.

Another terms of licensing have not been changed: you can still use the code in your projects and you are not obliged to open your modifications to the code like you need in GPL. Contributed code is still licensed under BSD license.

Rspamd 1.1.2 has been released

2016-01-29 00:00:00 +0000

The next feature release 1.1.2 of rspamd is out. This release contains some important improvements:

Add support for forged confirmation headers (by @AdUser)
Improve multimap plugin: add filtering support
Add rspamadm statconvert utility to convert statistical tokens and learn cache from sqlite3 to redis
Add logging for slow rules and regexps
Add mime_types plugin to check sanity of mime types in messages

Bugfixes and minor improvements in this version:

Fix stat_cache closing
Add checkpoints to sqlite3 learn cache
Do not recompile lua generated headers all the time
Increase number of messages learned
Fix issues with dual stack and hfilter
Disable MID checks for hfilter by default
Fix cache definitions in multiple classifier and no type
Don’t crash if learn cache failed to initialize
Fix googlegroups support in maillist plugin
Rework flags LUA API:
- Allow to check for a specific flag
- Add learn_spam, learn_ham and broken_headers flags
- Unify internal functions
Add BROKEN_HEADERS rule
Allow any, mime and smtp for get_from/get_recipients
Add rule to detect spammers attempts to cheat mime parsing
Rework parsing of IP addresses in configuration (better IPv6 support)
Add util.parse_mail_address function to LUA API
Add lua sqlite3 module
Implement synchronous redis call
Ratelimit: avoid possible indexing of nil value (Fixes #498) (by @fatalbanana)
Implement redis advanced lua api with pipelining
Fix memory leak on redis stat (#500)
Fix user/language learn count in sqlite statistics (#496) (by @fatalbanana)
Fix build with custom pcre
Fix fuzzy relearning (#498)
Improve planning of asynchronous tasks
Add base32 decode/encode routines to lua util
Allow converting of learn cache from sqlite to redis
Add methods to check if a messages has from/rcpts
Disable reload command in rc scripts
Improve runtime CPU dispatcher for libcryptobox
Add preliminary support of digital signatures via ed25519
Add detection for RDRAND support
Print configuration of crypto on start
A in SPF presumes AAAA lookup as well

This version has full backward compatibility with 1.1.0 preserved.

Rspamd 1.1 has been released

2016-01-18 00:00:00 +0000

The next major release of rspamd: 1.1.0 is now released. In this version, I did another bunch of architectural rework. This time, I’ve refactored fuzzy storage, regular expressions processing and statistics primarily.

A number of features that I was asked constantly about have been added to rspamd, including such features as:

Autolearning for BAYES: statistics can learn on good or bad messages automatically
Redis backend for statistics to enable distributed and fast redis storage for rspamd cluster
Scalable fuzzy storage: it is now possible to scale hashes storage across multiple processes to process thousands of requests per second

There is also major performance improvement: hyperscan engine support for optimizing regular expressiosn execution.

With this version, I have added a lot of documentation, including tutorials and improved quick start guide.

Here is the full changelog for this version available on GitHub: https://github.com/vstakhov/rspamd/blob/1.1.0/ChangeLog

The new version is almost 100% backward compatible with 1.0 branch but please check the migration document if you are using per-user statistics and rspamd-1.0.

Here are some graph of rspamd performance on scanning:

So you can see that rspamd can scan as much as 200 messages per second consuming less that 50% cpu of a typical scanner machine (Xeon E5405 single CPU).

Rmilter is also upgraded to the version 1.7.0 that brings full IPv6 support, redis cache support and major cleanup of unused and broken stuff. Rmilter changelog is available here: https://github.com/vstakhov/rmilter/blob/1.7.0/ChangeLog