期刊文献+

基于最优化控制模型的文本主题域划分

Text Subtopic-field Segmentation Based on Optimal Control Model
下载PDF
导出
摘要 提出一种基于最优化控制模型的文本主题域划分方法,采用主题域内距离、主题域间距离、主题域内夹角和主题域间夹角等相关要素,构建了最优化模型的目标函数,进而通过对模型求解得到文本主题域的最优划分模式.该方法为全局最优化方法,与具体的应用领域无关,具有较高的普适性.实验结果表明,在算法适用性、F1评价和Window Diff评价上,该算法均优于其他相关算法. A subtopic-field segmentation technique based on the optimal control model was proposed, the object function of the optimal control model was constructed by the within-subtopic-field distance, the between- subtopic-field distance, the within-subtopic-field angle and the between-subtopic-field angle. By solving the optimal control model, optimal subtopic-field segmentation is obtained. The method independent of specific application is a global optimal method. The experiments show that this method is better than other methods in applicability, F1 measure and Window Diff measure.
出处 《吉林大学学报(理学版)》 CAS CSCD 北大核心 2009年第4期769-776,共8页 Journal of Jilin University:Science Edition
基金 国家重点基础研究发展计划973项目基金(批准号:2004CB318000)
关键词 主题域划分 主题域内距离 主题域间距离 主题域内夹角 主题域间夹角 subtopic-field segmentation within-subtopic-field distance between-subtopic-field distance within-subtopic-field angle between-subtopic-field angle
  • 相关文献

参考文献3

二级参考文献45

  • 1朱靖波,叶娜,罗海涛.基于多元判别分析的文本分割模型[J].软件学报,2007,18(3):555-564. 被引量:15
  • 2石晶,戴国忠.基于PLSA模型的文本分割[J].计算机研究与发展,2007,44(2):242-248. 被引量:25
  • 3Salton G,Singhal A,Buckley C,Mitra M.Automatic text decomposition using text segments and text themes.In:Bernstein M,Carr L,Osterbye K,eds.Proc.of the 7th ACM Conf.on Hypertext.New York:ACM Press,1996.53-65. 被引量:1
  • 4Hearst MA.TextTiling:Segmenting text into multi-paragraph subtopic passages.Computational Linguistics,1997,23(1):33-64. 被引量:1
  • 5Morris J,Hirst G.Lexical cohesion computed by thesauri relations as an indicator of the structure of text.Computational Linguistics,1991,17(1):21-42. 被引量:1
  • 6Kozima H.Text segmentation based on similarity between words.In:Proc.Of the 31st Annual Meeting of the Association for Computational Linguistics.1993.286-288.Http://acl.ldc.upenn.edu/P/P93/P931041.pdf 被引量:1
  • 7Passoneau RJ,Litman DJ.Intention-Based segmentation:Human reliability and correlation with linguistic cues.In:Proc.Of the 31st Meeting of the Association for Computational Linguistics.1993.148-155.Http://acl.ldc.upenn.edu/P/P93/P931020.pdf 被引量:1
  • 8Reynar JC.Topic segmentation:Algorithms and application[Ph.D.Thesis].Pennsylvania:University of Pennsylvania,1998. 被引量:1
  • 9Ponte JM,Croft WB.Text segmentation by topic.In:Peters C,Thanos C,eds.Proc.of the 1st European Conf.on Research and Advanced Technology for Digital Libraries.Berlin,Heidelberg:Springer-Verlag,1997.120-129. 被引量:1
  • 10Reynar JC.Statistical models for topic segmentation.In:Proc.Of the 37th Annual Meeting of the Association for Computational Linguistics.1999.357-364.Http://acl.ldc.upenn.edu/P/P99/P991046.pdf 被引量:1

共引文献66

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部