摘要
该文提出了结构关键词的概念,给出了结构概念格和内容概念格的形式化描述.结构概念格是对文档语义段的逻辑存储,内容概念格是对文档内容信息的逻辑存储.开发了一个基于文档的结构和内容构造两级概念格的信息抽取的实验系统.实验表明,该方法对减少信息抽取的时间和提高信息抽取的精度有显著的效果.
The concept of structure keywords is put forward, and the formal descriptions of structure and content concept lattice are introduced in this paper. The structure concept lattice is the logical storage of semantic structure of d octunents, and the content concept lattice is used to store content information of documents. Finally, an experiment system of IE based on two - level concept lattice is developed and the result indicates that the effectiveness of the proposed method is notable for reducing the time of IE and increasing the precision of IE.
出处
《江西师范大学学报(自然科学版)》
CAS
北大核心
2008年第2期179-183,共5页
Journal of Jiangxi Normal University(Natural Science Edition)
基金
国家自然科学基金资助项目(60575035)
关键词
结构关键词
结构概念格
内容概念格
structure keywords
structure concept lattice
content concept lattice