Moderator: Forum Moderator


cd /usr/sbin
wget http://www200.pair.com/mecham/spam/UpdateSaneSecurity.sh.txt
mv UpdateSaneSecurity.sh.txt UpdateSaneSecurity.sh
chmod +x UpdateSaneSecurity.sh
UpdateSaneSecurity.sh
drwxr-xr-x 2 clamav clamav 4096 2007-06-16 19:48 daily.inc
-rw-r--r-- 1 clamav clamav 9351789 2007-06-10 21:16 main.cvd
-rw------- 1 clamav clamav 260 2007-06-16 19:14 mirrors.dat
-rw-r--r-- 1 clamav clamav 347982 2007-06-16 19:25 MSRBL-Images.hdb
-rw-r--r-- 1 clamav clamav 228232 2007-06-08 04:33 MSRBL-SPAM.ndb
-rw-r--r-- 1 clamav clamav 1033688 2007-06-16 19:48 phish.ndb
-rw-r--r-- 1 clamav clamav 174338 2007-06-15 02:55 phish.ndb.gz
-rw-r--r-- 1 clamav clamav 516182 2007-06-16 19:48 scam.ndb
-rw-r--r-- 1 clamav clamav 102738 2007-06-15 02:55 scam.ndb.gz

Right now I have mimedefang running 'pdfinfo $file 2>&1' for each pdf, and if it has an error, I'm quarantining the entire message.





DESCRIPTION
sa-compile uses "re2c" to compile the SpamAssassin ruleset. This is
then used by the "Mail::SpamAssassin::Plugin::Rule2XSBody" plugin to
speed up SpamAssassin's operation, where possible, and when that plugin
is loaded.
"re2c" can match strings much faster than perl code, by constructing a
DFA to match many simple strings in parallel, and compiling that to
native object code. Not all SpamAssassin rules are amenable to this
conversion, however.



score TVD_PDF_FINGER01 2.0
pinoyako amavis[12315]: (12315-04) Blocked INFECTED (Email.Stk.Gen588.Sanesecurity.07071604.pdf)

# <LICENSE>
#
# Free as in beer. Free as in speech. You get the picture...
#
# </LICENSE>
#
# File: pdfinfouserrules.cf
# Version: 0.1
# Created: 2007-07-19
# Modified: 2007-07-19
# Author: Andy Kinnard (AnonymousDog) andyk at slcpa dot biz
# Requires: PDFInfo.pm plugin
# License: None
# Description: This plugin/ruleset combination will help you alleviate the new
# PDF based stock spam which began to appear mid-June, 2007.
#
# Changes:
#
# 0.1 - initial ruleset.
#
ifplugin Mail::SpamAssassin::Plugin::PDFInfo
# pdf_match_details()
body GMD_PRODUCER_UNKNOWN eval:pdf_match_details('producer','/^unknown$/')
describe GMD_PRODUCER_UNKNOWN Missing PDF meta data for producer
score GMD_PRODUCER_UNKNOWN 0.5
body GMD_CREATED_ZERO eval:pdf_match_details('created','/^0$/')
describe GMD_CREATED_ZERO Missing PDF meta data for created date
score GMD_CREATED_ZERO 1.0
# The next four should be just meta b/c they're very common in ham and uncorrupted pdfs. The descriptions follow from the above two.
body __GMD_CREATOR_UNKNOWN eval:pdf_match_details('creator','/^unknown$/')
body __GMD_TITLE_UNTITLED eval:pdf_match_details('title','/^untitled$/')
body __GMD_MODIFIED_ZERO eval:pdf_match_details('modified','/^0$/')
body __GMD_AUTHOR_UNKNOWN eval:pdf_match_details('author','/^unknown$/')
# End of four
body GMD_PRODUCER_TEXT2PDF eval:pdf_match_details('producer','/^text2pdf/')
describe GMD_PRODUCER_TEXT2PDF PDF meta data for producer begins with text2pdf
score GMD_PRODUCER_TEXT2PDF 3.0
body GMD_PRODUCER_IMAGEMAGICK eval:pdf_match_details('producer','/^ImageMagick/')
describe GMD_PRODUCER_IMAGEMAGICK PDF meta data for producer begins with ImageMagick
score GMD_PRODUCER_IMAGEMAGICK 0.001
body GMD_PRODUCER_EASYPDF eval:pdf_match_details('producer','/easyPDF/')
describe GMD_PRODUCER_EASYPDF PDF meta data for producer contains easyPDF
score GMD_PRODUCER_EASYPDF 0.5
body GMD_TITLE_STOCK eval:pdf_match_details('title','/stock/')
describe GMD_TITLE_STOCK PDF meta data for title contains stock
score GMD_TITLE_STOCK 2.0
# metas
meta GMD_PDF_LIKELY_CORRUPT ( GMD_PRODUCER_UNKNOWN && GMD_CREATED_ZERO )
describe GMD_PDF_LIKELY_CORRUPT Missing PDF meta data for producer and created date indicates probable PDF format corruption
score GMD_PDF_LIKELY_CORRUPT 1.5
meta GMD_MISSING_LESSER_DETAILS ( __GMD_CREATOR_UNKNOWN && __GMD_TITLE_UNTITLED && __GMD_MODIFIED_ZERO && __GMD_AUTHOR_UNKNOWN )
describe GMD_MISSING_LESSER_DETAILS Missing PDF meta data for ALL lesser details: creator, title, modified date, and author
score GMD_MISSING_LESSER_DETAILS 0.5
meta __GMD_KNOWN_SPAM_PRODUCERS ( GMD_PRODUCER_TEXT2PDF || GMD_PRODUCER_IMAGEMAGICK || GMD_PRODUCER_EASYPDF )
describe __GMD_KNOWN_SPAM_PRODUCERS PDF meta data for producer matches one of those deemed "known spam producer"
# This rule needs more titles to be effective
meta __GMD_KNOWN_SPAM_TITLES ( GMD_TITLE_STOCK )
describe __GMD_KNOWN_SPAM_TITLES PDF meta data for title matches one of those deemed "known spam titles"
# This rule won't be effective until __GMD_KNOWN_SPAM_TITLES is
meta GMD_PRODUCER_AND_TITLE ( __GMD_KNOWN_SPAM_PRODUCERS && __GMD_KNOWN_SPAM_TITLES )
describe GMD_PRODUCER_AND_TITLE PDF meta data for title AND producer match those deemed "known spam *"
score GMD_PRODUCER_AND_TITLE 0.001
endif

gblades wrote:Rule hits analysis for your rules. Fields are :-
rule name, rule description, total, ham number, ham percent, spam number, spam percent
GMD_PRODUCER_UNKNOWN 35 2 5.7 33 94.3
GMD_PRODUCER_EASYPDF 12 0 0 12 100
analysing GMD_PRODUCER_UNKNOWN furthur the two HAM hits were false positives so I would reduce the score for this rule.


I am so reticent in joining another mailing list; that is what they use for a forum.





We're seeing zero...nothing since Monday. Nothing is getting though either; so, they didn't just change their pdfs -- they turned it off (some bots may wander off on their own). I'd expect this to resurface as a slightly retooled attack soon (before school gets back in).Marius wrote:Big decrease here too. We were getting sometimes thousands per day at the most, and so far today only 30 or so.
Makes you wonder doesn't it? I wonder if they have changed their messages to evade our signatures.

I welcome patches... but I will not make anything available that
requires outside perl module dependancies.
Well that's great, since the basic meta data parsing is broken anyway.gblades wrote:PDFInfo 0.8 has been released and now works on .fdf files aswell.
Which is just fabulous! So, he insists on reinventing the wheel despite there being acceptable CPAN modules for getting meta data from pdfs.gblades wrote:I mailed the author about the comments on this thread and got the following reply :-I welcome patches... but I will not make anything available that
requires outside perl module dependancies.



gblades wrote:I have been running botnet for a while...These are the botnet rules I am using. I prefer to assign smaller scores to the individual tests and then whitelist IP's from regular senders which false positive.

Return to FuzzyOCR and the Magic of Image Spam
Users browsing this forum: No registered users and 0 guests