期刊文献+

Web文本挖掘系统及其关键技术研究 被引量:11

The Key Technical Research of Text Mining System Based on Web
下载PDF
导出
摘要 随着网络信息的迅猛发展,信息量日益增加,怎样从海量的Internet上获取有用信息,WEB文本挖掘系统是挖掘技术的重要应用方向,它是指在给定的分类体系下,根据网页的内容自动判别内容类别的过程,论文对文本中所涉及的关键技术,包括K-最近邻参照法模型、基于隐马尔科夫模型(HMM)的信息抽取、机器学习方法,进行了研究和探讨,并且给出了基于信息抽取的文本挖掘系统的设计实现和下一步的研究重点。 With the development of network technology,the spread of internet become more and more quick.There are many types of complicated data in the information ocean.How to acquire useful knowledge quickly from the information ocean is the very difficult.The Text Mining based on Web is a new research field which can solve the problem effectively .This paper gives a research to several key techniques about Text Mining,including K-Nearest Neighbor Model, Information Extraction (IE) based on Hide in Markov Model (HMM), Machine Learning.It also describes a text mining model based on IE,and gives the results.
出处 《计算机工程与应用》 CSCD 北大核心 2003年第34期167-169,196,共4页 Computer Engineering and Applications
关键词 WEB文本挖掘 K-最近邻参照法 信息抽取 隐马尔科夫模型(HMM) Text mining based on Web,K-Nearest Neighbor,Information Extraction,Hide in Markov Model (HMM)
  • 相关文献

参考文献5

二级参考文献10

共引文献228

同被引文献62

引证文献11

二级引证文献18

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部