期刊文献+

面向中文特定信息变异的过滤技术研究 被引量:7

Information filtering for modified specific Chinese information
下载PDF
导出
摘要 研究了如何快速识别并过滤经过变异处理的中文信息的技术,并将变异规则限定在当前中文网络最常见的5种变异方法上.提出了一个快速而准确的中文信息多模式模糊匹配算法,该算法在WM算法的基础上融合了压缩编码的思想,适于实时地对网络信息进行处理.实验表明,基于该算法的信息过滤系统能够支持大量的输入模式,系统对模式的识别准确率超过了99%,并且达到了很高的执行效率.该算法在中文信息过滤领域有着广阔的应用前景. How to recognize and filter modified specific Chinese information is researched. This paper focuses on five most commonly-used modifying rules in Chinese network, and presents an efficient multi-pattern approximate matching-algorithm. This algorithm is based on WM algorithm and suitable for processing real-time Chinese information. The idea of compressing and encoding is also introduced. Experiments show that the information filtering system based on this algorithm can support a lot of patterns with an accuracy of over 99 % and high speed. This algorithm can be widely applied to Chinese information filtering.
出处 《高技术通讯》 CAS CSCD 北大核心 2005年第9期7-12,共6页 Chinese High Technology Letters
基金 国家高技术研究发展计划(863计划),国家自然科学基金
  • 相关文献

参考文献8

  • 1Knuth D E, J. H. Morris Jr and V. R. Pratt. Fast Pattern Matching in Strings. SIAM J Comput, 1977, 6( 1 ) : 323. 被引量:1
  • 2Boyer R S, Moore J S. A Fast String Searching Algorithm.Comm ACM, 1977, 20(10): 762. 被引量:1
  • 3Aho A V, Corasick M. Efficient String Matching: An Aid to Bibliographic Search. Cotton ACM, 1975, 18(6): 333. 被引量:1
  • 4Wu S, Manber U. A Fast Algorithm for Multi-pattern Searching. Technical Report, Tile Computer Seience Department,Tile University of Arizona. 1994. 被引量:1
  • 5Wu S, Manber U. Agrep- A Fast Approximate Pattern-matching Tool. In: Usenix Winter Technical Conference, San Francisco. 1992. 153. 被引量:1
  • 6Baeza-Yates R, Navarro G. New and Faster Filters for Multiple Approximate String Matching. Random Structures and Algorithms ( RSA ), 2002, 20( 1 ) :23. 被引量:1
  • 7Hyyroe H, Navarro G. Faster Bit-Parallel Approximate String Matching. In: Proceedings of the 13th Annual Symposium on Combinatorial Pattern Matching (CPM 2002). 2002. 203. 被引量:1
  • 8Kim S, Kim Y. A Fast Multiple String-Pattern Matching Mgorithm. In: Proceedings of the 17th AoM/IAoM International Confference on Computer Science. May 1999. 44. 被引量:1

同被引文献53

引证文献7

二级引证文献26

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部