摘要
复句中的关系词对研究复句中各分句的语义关系有着重要意义,但在基于规则的关系词自动识别的研究中发现,并非复句中出现的关系标记都是关系词,从中识别出真正的关系词是研究的重点和难点。提出对一种典型的关系标记——位置相邻的关系标记进行自动标记的算法,该算法结合关系词库和关系词提取技术,分析其连用特征。实验表明,该算法对连用关系标记的标识准确率达到72.9%。
Relation words are of great significance for analyzing the semantic relations between clauses in multiple corn pound sentences. But in the process of automatic identification of relation words based on rule, we found that not all the relation marks in multiple compound sentence are relation words, and identifying the relation marks serving as relation words is the key point. An algorithm was proposed to identify typical relation words which are adjacent. By combining the relationship marked corpus and the extraction of relation markers, this algorithm uses the feature to identify these markers. Experiments prove that this approach achieves the accuracy of 72. 9% in the identification of these relation marks.
出处
《计算机科学》
CSCD
北大核心
2012年第7期190-194,共5页
Computer Science
基金
国家教育部重点研究基地重点项目(10JJD740012)
2011年国家社科基金项目(11BYY052)资助
关键词
有标复句
连用特征
自动标识
规则
Marked compound sentence, Adjoining feature, Auto-markup, Rule