摘要
索引系统是搜索引擎的数据大本营,在搜索引擎发展早期,能够索引的网页数量代表了整个行业的技术发展水平。Lucene全文检索技术是信息检索领域广泛使用的基本技术,它是一个优秀的开源全文本搜索技术框架,本文详细分析了索引系统相关技术和Lucene的索引系统结构。
Index system is the data center of the search engine, at the beginning of the search engine, the number of the pages that can be indexed to represent the technology level of the whole industry. Lucene fulltext retrieval, as a basic skill, is widely used in the field of information retrieval; it is an excellent open- source full- text search technology framework. The paper analyzed Lucene's indexing system structure in detail and gave some introduction about the related technology of index system.
出处
《现代情报》
2009年第7期169-171,共3页
Journal of Modern Information
基金
北京市优秀人才专项项目(20071A0501600220)