期刊文献+
共找到4篇文章
< 1 >
每页显示 20 50 100
Optimizing partitioning strategies for faster inverted index compression 被引量:2
1
作者 xingshen song Yuexiang YANG +1 位作者 Yu JIANG Kun JIANG 《Frontiers of Computer Science》 SCIE EI CSCD 2019年第2期343-356,共14页
The inverted index is a key component for search engines to manage billions of documents and quickly respond to users' queries. Whereas substantial effort has been devoted to reducing space occupancy and decoding ... The inverted index is a key component for search engines to manage billions of documents and quickly respond to users' queries. Whereas substantial effort has been devoted to reducing space occupancy and decoding speed, the encoding speed when constructing the index has been overlooked. Partitioning the index aligning to its clustered distribution can effectively minimize the compressed size while accelerating its construction procedure. In this study, we introduce compression speed as one criterion to evaluate compression techniques, and thoroughly analyze the performance of different partitioning strategies. Optimizations are also proposed to enhance state-of-the-art methods with faster compression speed and more flexibility to partition an index. Experiments show that our methods offer a much better compression speed, while retaining an excellent space occupancy and decompression speed, networks. 展开更多
关键词 in verted INDEX INDEX compression OPTIMAL PARTITION APPROXIMATION algorithm
原文传递
A Flexible Space-Time Tradeoff on Hybrid Index with Bicriteria Optimization 被引量:1
2
作者 xingshen song Yuexiang Yang Yu Jiang 《Tsinghua Science and Technology》 SCIE EI CAS CSCD 2019年第1期106-122,共17页
Inverted indexes are widely adopted in the vast majority of information systems. Growing requirements for efficient query processing have motivated the development of various compression techniques with different spac... Inverted indexes are widely adopted in the vast majority of information systems. Growing requirements for efficient query processing have motivated the development of various compression techniques with different spacetime characteristics. Although a single encoder yields a relatively stable point in the space-time tradeoff curve,flexibly transforming its characteristic along the curve to fit different information retrieval tasks can be a better way to prepare the index. Recent research comes out with an idea of integrating different encoders within the same index,namely, exploiting access skewness by compressing frequently accessed regions with faster encoders and rarely accessed regions with succinct encoders, thereby improving the efficiency while minimizing the compressed size.However, these methods are either inefficient or result in coarse granularity. To address these issues, we introduce the concept of bicriteria compression, which aims to formalize the problem of optimally trading the compressed size and query processing time for inverted index. We also adopt a Lagrangian relaxation algorithm to solve this problem by reducing it to a knapsack-type problem, which works in O(n log n)time and O(n)space, with a negligible additive approximation. Furthermore, this algorithm can be extended via dynamic programming pursuing improved query efficiency. We perform an extensive experiment to show that, given a bounded time/space budget, our method can optimally trade one for another with more efficient indexing and query performance. 展开更多
关键词 INVERTED index BICRITERIA compression LAGRANGIAN RELAXATION
原文传递
K-ary search tree revisited:improving construction and intersection efficiency
3
作者 xingshen song Jinsheng DENG +1 位作者 Fengcai QIAO Kun JIANG 《Frontiers of Computer Science》 SCIE EI CSCD 2022年第3期215-217,共3页
1 Introduction Recently,k-ary search tree is gaining popularity as one infrastructure in search engines.Due to its intrinsic cache-and SIMD-friendly capabilities,k-ary search tree is efficient in compression and query... 1 Introduction Recently,k-ary search tree is gaining popularity as one infrastructure in search engines.Due to its intrinsic cache-and SIMD-friendly capabilities,k-ary search tree is efficient in compression and query processing when combined with inverted index[1-3].In a k-ary tree,each node is composed of k-1 entries,which evenly partitions its range into k subranges(subnodes).By aligning the node size with buffer size of faster cache,the data is expected to be better utilized before evicted out,and fewer cache misses are triggered as well. 展开更多
关键词 CACHE TREE INTERSECTION
原文传递
马来海松酸十八酰胺双羧酸钠表面活性剂的制备及其自组装性能研究
4
作者 张文静 孙兴燊 +3 位作者 翟兆兰 王丹 商士斌 宋湛谦 《林产化学与工业》 CAS CSCD 北大核心 2024年第4期51-60,共10页
以松香为原料经D-A加成、酰胺化和酸碱中和反应,合成了马来海松酸十八酰胺双羧酸钠(C_(18)MPA2Na),并利用FT-IR、^(1)H NMR、质谱确证了结构。采用表面张力法和尼罗红(NR)荧光探针法研究了C_(18)MPA2Na的表面活性,其临界胶束浓度(cmc)为... 以松香为原料经D-A加成、酰胺化和酸碱中和反应,合成了马来海松酸十八酰胺双羧酸钠(C_(18)MPA2Na),并利用FT-IR、^(1)H NMR、质谱确证了结构。采用表面张力法和尼罗红(NR)荧光探针法研究了C_(18)MPA2Na的表面活性,其临界胶束浓度(cmc)为0.29 mmol/L,达到cmc时的表面张力值(γcmc)为41.11 mN/m,具有良好的表面活性。将C_(18)MPA2Na与十六烷基三甲基溴化铵(CTAB)复配,成功制备了一类黏弹溶液,采用原位冷冻电镜(Cryo-TEM)和流变性能测试研究了复配体系的微观结构和流变行为,结果表明黏弹溶液的聚集体为蠕虫胶束。固定复配体系中CTAB的浓度为80 mmol/L,零剪切黏度(η0)随C_(18)MPA2Na浓度的增加先增加后降低,C_(18)MPA2Na为15 mmol/L时,η0最大值为5.67×10^(4)mPa·s;固定复配体系中C_(18)MPA2Na与CTAB的比例(3∶16),η0随复配体系中C_(18)MPA2Na和CTAB总浓度的增加,表现出先快速升高后趋于平稳的趋势,η0与浓度的幂指数值为11.7;当C_(18)MPA2Na为15 mmol/L,CTAB浓度为80 mmol/L时,随着pH值的降低,η0先增加后降低,且在pH值为7.13时η0值最大,为7.67×10^(4)mPa·s。该研究表明松香的刚性骨架能够促进表面活性剂自组装。 展开更多
关键词 松香 表面活性剂 黏弹溶液 蠕虫状胶束
下载PDF
上一页 1 下一页 到第
使用帮助 返回顶部