期刊文献+

垃圾邮件过滤系统的研究与实现 被引量:9

Research and Implementation of Spam Filtering System
下载PDF
导出
摘要 介绍了各类垃圾邮件过滤技术,分析了已经应用于垃圾邮件内容过滤领域的一些分类算法存在的某些不足,创新地将一种新的分类算法(SECTILE)应用于垃圾邮件的分类过滤中去,并设计了一个多层次垃圾邮件过滤系统。该系统整合了多项垃圾邮件过滤技术(黑名单/白名单技术、基于规则的过滤、基于内容的过滤),实验和分析结果表明,该系统提高了垃圾邮件过滤的效率和准确性。 This paper introduces simply kinds of the carrent spare filtering technology and analyzes the disadvantages of some algorithms which have been implemented on spare filtering field. A new algorithm(SECTILE) is chosen to implement content based filtering. Three kinds of filtering technology (black/white list based, rule based, content based) are implemented on he spare filtering system it designs. Experiment and analysis prove that the efficiency and accuracy of spare filtering are improved.
出处 《计算机工程》 EI CAS CSCD 北大核心 2006年第18期106-108,124,共4页 Computer Engineering
基金 湖北省科技攻关基金资助项目(2003AA101C07)
关键词 垃圾邮件过滤 SECTILE 文本分类 Spare filtering SECTILE Text categorization
  • 相关文献

参考文献4

  • 1潘文峰.[D].北京.中国科学院计算技术研究所,2004.7. 被引量:22
  • 2Mehran S,Susan D,David H,et al.A Bayesian Approach to Filtering Junk E-mail[C].Learning for Text Categorization:Papers from AAAI Workshop,Madison,Wisconsin,1998:55-62 被引量:1
  • 3Androutsopoulos I,Koutsias J,Chandrinos K V,et al.An Experimental Comparison of Naive Bayesian and Keyword-based Anti-spam Filtering with Encrypted Personal E-mail Messages[C].Proceedings of the 23rd Annual International ACM SIGIR Conference on Research and Development in Information Retrieval,Athens,Greece,2000:160-167. 被引量:1
  • 4王建会,王洪伟,申展,胡运发.一种实用高效的文本分类算法[J].计算机研究与发展,2005,42(1):85-93. 被引量:20

二级参考文献10

  • 1周水庚.[D].上海:复旦大学,2000. 被引量:1
  • 2王建会 胡运发.基于等效半径的文本分类算法.技术报告:021011346[R].复旦大学,2002.. 被引量:1
  • 3C. J. C. Burges. A tutorial on support vector machines for pattern recognition. Data Mining and Knowledge Discovery,1998, 2(2): 955--974. 被引量:1
  • 4R. Schapire, Y. Singer. BoosTexter: A boosting-based system for text categorization. Machine Learning, 2000, 39(2/3) : 135-- 168. 被引量:1
  • 5Y. Dasarathy B. V. Minimal consistent set (MCS) identification for optimal nearest neighbor decision system terms design. IEEE Trans. on System Man Cybern, 1994, 24(3): 511-517. 被引量:1
  • 6W. Lam, C. Y. Ho. Using a generalized instance set for automatic text categorization. The 21st Ann. Int'l ACM SIGIR Conf. on Research and Development in Information Retrieval(SIGIR'98), Melbourne, Australia, 1998. 被引量:1
  • 7Fuchun Peng, Dale Schuurmans. Self-supervised Chinese word segmentation. The 4th International Symposiun on Intelligent Data Analysis(IDA 2001), Cascais, Portugal, 2001. 被引量:1
  • 8R. W. Sproat, et al.. A stochastic finite-state wordsegmentation algorithm for Chinese. Computational Linguistics,1996, 22(3): 377--404. 被引量:1
  • 9Thomas Emerson. Segmenting Chinese in unicode. The 16th Int'l Unieode Conf., Amsterdam, Holland, 2000. 被引量:1
  • 10边肇祺 张学工.模式识别[M].北京:清华大学出版社,2001.. 被引量:29

共引文献40

同被引文献35

引证文献9

二级引证文献12

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部