期刊文献+

基于多媒体信息检索的有监督词袋模型 被引量:7

Supervised bag of word model for multimedia information retrieval
下载PDF
导出
摘要 词袋模型的复杂度高,且区分能力较弱,为解决这一问题,在经典词袋模型的基础上,提出一种有监督的词袋模型。在训练过程中对训练样本类别进行标记,在此基础上构建直方图总体能量目标函数,依据能量最小准则学习码本。通过文本检索和图像检索两组多媒体信息检索实验进行对比,对比结果表明,有监督词袋模型比经典词袋模型的检索精确度高、检索耗时少。 The training and coding process of the classic bag of word model is unsupervised and does not require tag data.Although the adaptability of this approach is strong,the bag of word model is highly complex and has a weak ability to distinguish.To solve this problem,a supervised bag of word model based on the classical one was put forward.The proposed model needed to mark the category of samples in the training process.On this basis,the objective function of the overall energy of histogram was constructed,and the codebook was learned according to the minimum energy criterion.Through experimental comparison on two groups of multimedia information retrieval experiments including text retrieval and image retrieval,the results show that the supervised bag of word model is more accurate and less time-consuming than the classical one.
作者 袁桂霞 周先春 YUAN Gui-xia;ZHOU Xian-chun(School of Information and Mechanical and Electrical Engineering,Jiangsu Open University,Nanjing 210017,China;School of Electronic and Information Engineering,Nanjing University of Information Science and Technology,Nanjing 210017,China)
出处 《计算机工程与设计》 北大核心 2018年第9期2873-2878,共6页 Computer Engineering and Design
基金 国家创新基金项目(435012C26244104350)
关键词 词袋模型 多媒体信息检索 文本检索 图像检索 能量最小准则 bag of word model multimedia information retrieval text retrieval image retrieval minimum energy criterion
  • 相关文献

参考文献4

二级参考文献64

  • 1庄越挺,吴聪苗,吴飞,刘翔.多媒体交叉参照检索系统研究[J].计算机辅助设计与图形学学报,2005,17(4):834-839. 被引量:8
  • 2Wu, Lei,Hoi, Steven C. H.,Yu, Nenghai.Semantics-preserving bag-of-words models and applications. IEEE Transactions on Image Processing . 2010 被引量:2
  • 3Norvig P,Reiman D A,Goldstein DB,et al.2020 Visions[J].Nature,2010,463(7):26-32. 被引量:1
  • 4Bengio Y,Ducharme R,Vincent P,ed al.A neural probabilistic language model[J].Journal of Machine Learning Research,2003,3:1137-1155. 被引量:1
  • 5Mikolov T,Chen K,Corrado G,et al.Efficient estimation of word representations in vector space[C]//Proceedings of Interna- tional Conference on Learning Representations.Scottsdale,Ari- zona:ICLR,2013:1-12. 被引量:1
  • 6Mikolov T,Sutskever I,Chen K,et al.Distributed representa- tions of words and phrases and their compositionality[J].Ad- vances in neural Information Processing Systems,2013:1-9. 被引量:1
  • 7Xu W,Rudnicky A.Can artificial neural networks learn lan- guage models?[J].Computer Science Department Technical Report,2000,1(1):14. 被引量:1
  • 8Mikolov T,Deoras A,Povey D,et al.Strategies for training large scale neural network language models[C]//IEEE Work- shop on Automatic Speech Recognition and Understanding.Washington DC:IEEE,2011:196-201. 被引量:1
  • 9Socher R,Chen D,Manning C,et al.Reasoning with neural tensor networks for knowledge base completion[C]//Proceedings of NIPS.South Lake Tahoe,Nevada US:NIPS,2013:926-934. 被引量:1
  • 10Krizhevsky A,Sutskever I,Hinton G.ImageNet classification with deep convolutional neural networks[C]//Proceedings of Snips.South Lake Tahoe,Nevada US:NIPS,2012:l-9. 被引量:1

共引文献34

同被引文献47

引证文献7

二级引证文献26

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部