[dspam-users] 3.4.6: tum builds spam corpus very slowly and spam subject always on

From: Andreas Klemm <andreas@klemm.apsfilter.org>
Date: Mon Aug 08 2005 - 02:27:35 EDT

Hi,

its about this dspam version (dspam-3.4.6.20050523.0845).

This time I tried to setup dspam, that it gets trained.
Since by training corpus I somtimes get strange results,
too weighted to the one or the other side (spam/innoc.).

But the corpus only is increasing very slowly.

Seems for me as if training lasts a year or even more.
Though I get aprox. 2000 mails a day. Mainly mailinglist
traffic and ... spam.

Look here: Only 157 and 11 for spam and innocent corpus.

This few in about 10 weeks:
Look, from when my db newinstall is:
root@titan[ttyp2]{221} /var/db/pkg/postgresql-server-8.0.3 ll
total 80
-rw-r--r-- 1 root wheel 58 May 21 19:07 +COMMENT

root@titan[ttyp2]{211} ~ dspam_stats andreas
andreas TS: 6204 TI: 23156 SM: 1045 IM: 77 SC: 157 IC: 11

Part of my config.

TrainingMode tum
#Feature sbph
Feature chained
Feature tb=4
#Feature whitelist
Feature noise
Algorithm graham burton
PValue graham
Preference "signatureLocation=headers" # 'message' or 'headers'
Preference "spamAction=tag"

#Preference "spamSubject=SPAM"
btw, although I commented out the spamsubject I still get it
prepended to the subject as can be seen here.

1424 N Jan 30 €É¯Å¯S°Ï¡ãº^°®§ ( 49) [SPAM] »ŽÃPŽ£€É§Aªº¹qž£®Ä¯à¡A¥Î³Ì«K©yªº»ù
1425 N Feb 01 ~Áú­·ºô­¶€j®v~ ( 194) [SPAM] ³ÌHOTÁú¬yºô­¶§A€]¥i¥HŸÖг¡F³Ì¶W­ÈÁ

        Andreas ///

-- 
http://www.64bits.de
http://www.apsfilter.org
http://people.FreeBSD.org/~andreas
Received on Mon Aug 8 02:31:00 2005

This archive was generated by hypermail 2.1.8 : Thu Sep 29 2005 - 13:51:28 EDT