跳到主要导航 跳到搜索 跳到主要内容

Well-balanced successive simple-9 for inverted lists compression

  • Xi'an Jiaotong University
  • National University of Defense Technology

科研成果: 期刊稿件文章同行评审

摘要

The growth in the amount of information available on the Internet and thousands of user queries per second brings huge challenges to the index update and query processing of search engines. Index compression is partially responsible for the current performance achievements of existing search engines. The selection of the index compression algorithms must weigh three factors, i.e., compression ratio, compression speed and decompression speed. In this paper, we study the well-known Simple-9 compression, in which exist many branch operations, table lookup and data transfer operations when processing each 32-bit machine word. To enhance the compression and decompression performance of Simple-9 algorithm, we propose a successive storage structure and processing metric to compress two successive Simple-9 encoded sequence of integers in a single data processing procedure, thus the name Successive Simple-9 (SSimple-9). In essence, the algorithm shortens the process of branch operations, table lookup and data transfer operations when compressing the integer sequence. More precisely, we initially present the data storage format and mask table of SSimple-9 algorithm. Then, for each mode in the mask table, we design and hard-code the main steps of the compression and decompression processes. Finally, analysis and comparison on the experimental results of the simulation and TREC datasets show the compression and decompression efficiency speedup of the proposed SSimple-9 algorithm.

源语言英语
页(从-至)1416-1424
页数9
期刊IEICE Transactions on Information and Systems
E100D
7
DOI
出版状态已出版 - 7月 2017

学术指纹

探究 'Well-balanced successive simple-9 for inverted lists compression' 的科研主题。它们共同构成独一无二的指纹。

引用此