Journal of Computers, Vol 4, No 3 (2009), 245-250, Sep 2009
doi:10.4304/jcp.4.3.245-250

Application of Refined LSA and MD5 Algorithms in Spam Filtering

Jingtao Sun, Qiuyu Zhang, Zhanting Yuan

Abstract


The paper proposes a spam filtering method that uses integrated and refined Latent Semantic Analysis (LSA) and Message-Digest Algorithm 5 (MD5) algorithms to address a series of universal problems in spam filtering, including remarkably lowered filtering precision and notably unbalanced filtering efficiency as a result of lack of latent semantic analysis of mail contents. In introducing LSA, its weighting function is improved by integrating fuzzy membership to improve effectiveness of LSA in processing mail contents. On top of this, MD5 algorithm is used to generate “E-mail fingerprint”, thus enabling quick matching and realizing highly efficient and accurate processing of mass- mailing spam. The result of the simulation experiment testifies effectiveness of the method.



Keywords


Latent Semantic Analysis; Message-Digest Algorithm 5; Fuzzy Membership; E-mail Fingerprint; Spam Filtering

References



Full Text: PDF


Journal of Computers (JCP, ISSN 1796-203X)

Copyright @ 2006-2012 by ACADEMY PUBLISHER – All rights reserved.