期刊文献+

自动组卷中试题去重技术研究 被引量:9

Question similarity identification in automatic generation of test papers
下载PDF
导出
摘要 针对大规模题库中存在相似试题的问题,提出一种自动识别相似试题的方法.在知网词汇语义相似度模型的基础上,引入领域词汇对其进行改进,并且提出一种试题去重模型,来实现试题相似度的计算,解决了题库中相似和重复试题的自动识别问题,提高了相似试题识别的准确率.综合随机抽取法和试探回溯法两种组卷算法的优点,提出一种基于相似试题识别的组卷算法,提高了组卷的质量.实验表明试题相似度识别准确率达96%,非常接近人工判断结果,该方法不仅可以从同一试题类型内部,还可在不同类型之间消除相似试题.该方法已在C语言上机考试中进行了应用. To solve the problem of identifying similar questions in examination database, an algorithm for question similarity identification is proposed in this paper. By introducing domain words to the improvement of the word similarity model in HowNet, a model for question similarity identification is proposed to make the same or similar questions be identified and cut off automatically. This method improves the accuracy of identi- fication compared with other methods. By combining merits of the random selection with those of the back- tracking method, a new algorithm of generating papers automatically based on question similarity identification is proposed. It can guarantee the quality of papers. Test results show that the accuracy of question similarity i- dentification of this method is 96% , which approaches to that of manual identification. This method can cut off similar questions not only of the same type, but also of different types. Finally, this method has been applied to the on-line examination of C programming language.
出处 《哈尔滨工业大学学报》 EI CAS CSCD 北大核心 2009年第1期85-88,共4页 Journal of Harbin Institute of Technology
基金 国家自然科学基金资助项目(60673035)
关键词 相似题识别 智能组卷 难度等级 题库系统 similarity identification automatic paper generation difficult level system of examination
  • 相关文献

参考文献11

  • 1任爱华,武新利.题库建设的目标及数学模型[J].山东师范大学学报(自然科学版),1998,13(4):441-445. 被引量:35
  • 2林雪明,张钧良,蒋伟钢.基于知识点的试题库组卷算法的建立[J].微机发展,2001,11(2):77-79. 被引量:32
  • 3GUAN Y, WANG X L. Quantifying semantic similarity of Chinese words from HowNet[ C]//International Conference on Machine Learning and Cybernetics. Beijing: [s. n. ] , 2002:234 -239. 被引量:1
  • 4YU Z T, HU L. Similarity computation of Chinese question based on chunk [ C ]//International Conference on Machine Learning and Cybernetics. Dalian: [ s. n. ] , 2006 : 17 - 22. 被引量:1
  • 5李彬,刘挺,秦兵,李生.基于语义依存的汉语句子相似度计算[J].计算机应用研究,2003,20(12):15-17. 被引量:127
  • 6齐浩亮,杨沐昀,孟遥,韩习武,赵铁军.面向特定领域的汉语句法主干分析[J].中文信息学报,2004,18(1):1-5. 被引量:8
  • 7MANDREOLI F, MARTOGLIA R, TIBERIO P. A syntactic approach for searching similarities within sentences [ C ]//Proceedings of the eleventh international conference on Information and knowledge management. Virginia, USA : [ s. n. ] , 2002:635 - 637. 被引量:1
  • 8GAN K W, WONG P W, CHARNIAK E. Annotation information structures in Chinese texts using How net [ C]//Second Chinese Language Processing Workshop. Hong Kong: [ s. n. ] , 2000:85 -92. 被引量:1
  • 9NIRENBURG S, DONMASHNEW C, DEAN J. Two approaches to Matching in Example-based Machine Translation [ C ]//Proceddings of the fifth International Conference on Theoretical and Methodological in Machine Translation of Natural Languages. Kyoto, Japan : [ s. n. ] , 1993:45 - 57. 被引量:1
  • 10刘群 李素建.基于《知网》的词汇语义相似度计算.中文计算语言学,2002,7(2):59-76. 被引量:147

二级参考文献13

共引文献340

同被引文献44

引证文献9

二级引证文献19

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部