PDFassassin is a module for SpamAssassin that allows for the scanning of PDF files in email message attachments. Email bodies are scanned upon connection and checked for PDF attachments. Text is extracted from the PDF via pdftotext and scanned by SpamAssassin. Should the PDF contain images, the gocr program is called to extract the text content. The total spam score of the PDF is compared against the global required_score setting; if it's higher, a score equal to the one specified in pdf.cf is appended to the overall score of the email message.
|Tags||Communications Email Filters|
|Operating Systems||POSIX IRIX|
Release Notes: This release fixes clean_pdf_temp() declaration comments.
Release Notes: This initial release implements the module for spamassassin which scans the content of PDF attachments in email messages and appends the spam score to that of the originating email message.