期刊文献+

二元互关联后继树精简索引模型研究 被引量:2

Research on Streamline Inter-relevant Successive Trees
下载PDF
导出
摘要 全文检索领域的关键问题是索引模型以及索引的创建与检索算法.基于二元互关联后继树模型,提出一个实用性能好的后继节点有序的后继树精简索引模型(SIRST),并给出此模型下索引的创建与检索算法.通过将该模型与使用广泛的倒排文件模型(IF)进行比较,表明SIRST的检索效率远远高于IF,同时,随着文本集规模越来越大,SIRST的创建效率优势愈发明显. The key question of full-text retrieval domain is the index model as well as the index building and retrieval algorithms. In this paper, a novel index model named Streamline Inter-Relevant Successive Trees ( SIRST ) is proposed, which has sorted successive node and streamline node information based on the index model of Inter-Relevant Successive Trees ( IRST), and its building and re- trieval algorithms is presented. The performance study, comparing the cost of the index building and retrieval with the traditional Inverted Files (IF) model and SIRST under various text sets and query strings, shows that SIRST is outperforms them.
出处 《小型微型计算机系统》 CSCD 北大核心 2011年第2期286-290,共5页 Journal of Chinese Computer Systems
基金 国家"八六三"高技术研究发展计划项目(2007AA01Z403)资助
  • 相关文献

参考文献11

  • 1Justin Zobel, Alistair Moffat. Inverted files for text search engines [J]. ACM Computing Surveys (CSUR), 2006, 38(2). 被引量:1
  • 2Vo Ngoc Anh, Alistair Moffat. Inverted index compression using word-aligned binary codes[J]. Information Retrieval,2005, 8( 1 ) : 151-166. 被引量:1
  • 3Boitsov L M. Using signature hashing for approximate string matching[ J]. Computational Mathematics and Modeling,2002, 13 (3) : 314-326. 被引量:1
  • 4申展,江宝林,陈祎,唐磊,胡运发.全文检索模型综述[J].计算机科学,2004,31(5):61-64. 被引量:12
  • 5Manber U, Myers E. Suffix arrays: a new method for on-line suing scarches[C]. In: Proc. Of the FISTREE Ann. ACM-SIAM Syrup. on Discrete Algorithms, 1990, 319-327. 被引量:1
  • 6Rajasekaran S, Luo J, Nick H, et al. Efficient algorithms for similarity search[J]. Journal of Combinatorial Optimization,2001, 5 (1): 125-132. 被引量:1
  • 7Manber U, Myers G. Suffix arrays: a new method for on-line suing searches[J]. SIAM Journal on Computing. 1993, 22(5): 935-948. 被引量:1
  • 8Tao Xiao-peng, Hu Yun-fa, Zhou Shui-geng. Subsequent array: a new full text index [ C ]. Proceeding World Multiconference on Systemics, Cybernetics and Informatics, Florida, USA, 2001:551- 556. 被引量:1
  • 9周水庚,胡运发,关佶红.基于邻接矩阵的全文索引模型(英文)[J].软件学报,2002,13(10):1933-1942. 被引量:10
  • 10刘学文,陶晓鹏,于玉,胡运发.一种全新的全文索引模型——后继数组模型[J].软件学报,2002,13(1):150-158. 被引量:11

二级参考文献42

  • 1王静,孟小峰,王珊.基于区域划分的XML结构连接[J].软件学报,2004,15(5):720-729. 被引量:35
  • 2[1]Zeng Haiquan, Shen Zhan, Hu Yunfa. Mining Sequence Pattern from Time Series Based on Inter-Relevant Successive Trees Model. In:Proc. of 9th. Intl. Conf. on Rough Sets, Fuzzy Sets,Data Mining and Granular Computing (RSFDGrC'2003), LNCS/LNAI, Spring-Verlag, Chongqing, China, 2003 被引量:1
  • 3[3]Knuth D E. The Art of Computer Programming, Sorting and Searching. 1st edition. Addision-Wesley Pub. Co. , 1973 被引量:1
  • 4[4]Weiner P. Linear pattern matching algorithm. In: Proc. 14th IEEE Symposium on Switching and Automata Theory, 1973.1~11 被引量:1
  • 5[5]Manber U,Myers E. Suffix arrays: A new method for on-line string searches. In: Proc. of the FISTREE Ann. ACM-SIAM Symp. on Discrete Algorithms, 1990. 319~327 被引量:1
  • 6[6]Hu Yunfa, Zhou Shuigeng. A New Model of Chinese Full-text databases. In: Proc. World Multiconference on Systemics,Cybernetics and Informatics, Florida, USA, 2001. 528~533 被引量:1
  • 7[7]Tao Xiaopeng, Hu Yunfa, Zhou Shuigeng. Subsequent Array: A New Full Text Index. In: Proc. World Multiconference on Systemics, Cybernetics and Informatics, Florida, USA, 2001. 551~556 被引量:1
  • 8[11]Zobel J, Moffat A, Ramamohanarao K. Inverted files versus signature files for text indexing. Transactions on Database Systems,1998,23(4): 453~490 被引量:1
  • 9[12]Grossi R, Vitter J S. Compressed suffix arrays and suffix trees with applications to text indexing and string matching (extendedabstract). STOC 2000. 397~406, 1999 被引量:1
  • 10[14]Moffat A, Zobel J. Self-Indexing Inverted Files for Fast Text Retrieval. ACM Transactions on Information System, 1996, 14(4) :349~379 被引量:1

共引文献29

同被引文献7

引证文献2

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部