期刊文献+

基于改进潜在语义分析的跨语言检索 被引量:14

Cross-Language Information Retrieval Based on Improved Latent Semantic Indexing
下载PDF
导出
摘要 该文采用基于SVD和NMF矩阵分解相结合的改进潜在语义分析的方法为生物医学文献双语摘要进行建模,该模型将英汉双语摘要映射到同一语义空间,不需要外部词典和知识库,建立不同语言之间的对应关系,便于在双语空间中进行检索。该文充分利用医学文献双语摘要语料中的锚信息,通过不同的k值构建多个检索模型,计算每个模型的信任度,使得多个模型都对查询和文本的相似度做出贡献。在语义空间上进行项与项、文本与文本、项与文本之间的相似度计算,实现了双语摘要的跨语言检索。 Focused on the cross language information retrieval, this paper applies the improved Latent Semantic Indexing (LSI)by combining SVD and NMF to construct the semantic space for the abstracts of biomedical literatures. It maps the Chinese document and English document into the same semantic space without external dictionary and knowledge base and for the bilingual information retrieval. The proposed method also utilizes the anchor information included the abstracts of biomedical literatures and builds a series models corresponding to different K dimensions, all contributing to the similarity between query and documents with different credibility. As a result, the similarities of term to term, document to document and term to document are calculated forthe bilingual information retrieval of biomedical abstract. The experiment gets a better result.
作者 宁健 林鸿飞
出处 《中文信息学报》 CSCD 北大核心 2010年第3期105-111,共7页 Journal of Chinese Information Processing
基金 国家自然科学基金资助项目(60673039 60973068) 国家863高科技计划资助项目(2006AA01Z151) 教育部留学人员归国科研启动基金 教育部博士点基金资助(20090041110002)
关键词 计算机应用 中文信息处理 改进潜在语义分析 语义空间 跨语言检索 SVD NMF computer application Chinese information processing improved latent semantic indexing semantic spacel cross language IR SVD NMF
  • 相关文献

参考文献16

  • 1Kazuaki Kishida.Technical Issues of Cross-Language Information Retrieval:a Review[J],Information Processing and Management,2005,41(3):433-455. 被引量:1
  • 2Gina-Anne Levowa,Douglas W.Oardb,Philip Resnikc.Dictionary-based techniques for cross-language information retrieval[J].Information Processing and Management,2005,41(3):523-547. 被引量:1
  • 3Dong Zhou,Mark Truran.A Graph-Based Technique for Resolving Ambiguity in Query Translation Candidates.Symposium on Applied Computing[C]// Proceedings of the 2008 ACM symposium on Applied computing,Fortaleza,Ceara,Brazil:ACM New York,USA,2008:1566-1573. 被引量:1
  • 4Dong Zhou,Mark Truran.A Hybrid Technique for English-Chinese Cross Language Information Retrieval[J].ACM Transactions on Asian Language Information Processing (TALIP),2008,7(2):l-35. 被引量:1
  • 5Guihong Cao,Jianfeng Gao.Extending query translation to cross-language query expansion with markov chain models EC]]// Proceedings of the sixteenth ACM conference on Conference on information and knowledge management,2007:351-360. 被引量:1
  • 6J.Y.Nie,M.Simard,P.Cross-Language Information Retrieval based on Parallel Texts and Automatic Mining of Parallel Texts in the Web[C]// Proceedings of SIGIR'99,Berkeley,1999:74-81. 被引量:1
  • 7GAO JF,Nie JY.Trec-9 CLIR Experiments at MSRCN[C]// Proceeding of the Ninth Text Retrieval Conference.USA,2000:343-353. 被引量:1
  • 8Susan T.Dumais,Furnas G W.Indexing by Latent Semantic Analysis[J].Journal of the American Society for Information Science,1990,41(6):391-407. 被引量:1
  • 9Michael L.Littman,Susan T.Dumais,Thomas K.Landauer.Automatic cross-language retrieval using latent semantic indexing[C]// Proc.of SIGIR'96,1996:16-23. 被引量:1
  • 10Berry,M.W.,Young,P.G.Using Latent Semantic Indexing for Multilingual Information Retrieval[J],Computers and Humanities,1995,29(6):413-429. 被引量:1

二级参考文献22

共引文献3

同被引文献103

引证文献14

二级引证文献63

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部