Unlock instant, AI-driven research and patent intelligence for your innovation.

A speed-up query processing method based on ipc encoding

A processing method and coding technology, applied in the information field, can solve problems such as increasing the waiting time of users, and achieve the effect of speeding up and improving the processing speed

Active Publication Date: 2020-02-11
INST OF INFORMATION ENG CHINESE ACAD OF SCI
View PDF3 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Although IPC encoding has a good compression rate, the online query processing time will increase the waiting time of users, thus limiting its further application

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • A speed-up query processing method based on ipc encoding
  • A speed-up query processing method based on ipc encoding
  • A speed-up query processing method based on ipc encoding

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0026] The present invention will be further described below through specific embodiments and accompanying drawings.

[0027] The speed-up algorithm provided by the present invention mainly includes three parts. One part is the recovery and skip algorithm when decompressing the IPC-encoded index file, and the second part is the fast intersection between a decompressed sequence list and an IPC-encoded compressed index list when processing Boolean intersection type queries algorithm. The third part is an algorithm for quickly obtaining the corresponding frequency value according to the specific value position list (COlist) that needs to be decompressed when processing sorting type queries.

[0028] 1. Restore and skip

[0029] IPC encoding is expanded into a binary sorting tree when encoding, and is densely stored in a pre-ordered manner when storing. When restoring, all values ​​must also be restored according to the structure of the binary sorting tree. Considering the orde...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention relates to a speed-up query processing method based on IPC coding. The method includes the steps that an index file under IPC coding is regarded as a tree-form skip list file, and an algorithm that express read is carried out to skip sub-trees is achieved; when a Boolean intersection query is processed, whether certain sub-trees are skipped or not is judged according to the monotonicity of a linked list, a large amount of time can be saved through skip operation, and therefore the online Boolean query speed is increased; when a sorting query is processed, a common TAAT processing mode and a common continue mechanism are used, a corresponding value of the index file at corresponding frequency can be rapidly obtained according to the position of the intersection result of an ID list, and the processing speed of the online sorting query is increased by skipping all sub-trees not needing to have access to. The query speed (including the Boolean query and the sorting query) is optimized according to the characteristics of IPC coding, and the user experience of a retrieval system is optimized.

Description

technical field [0001] The invention belongs to the field of information technology, and in particular relates to a speed-up query processing method based on IPC coding. Background technique [0002] Most of the current retrieval systems use inverted index as the data structure for processing user queries. The inverted index file (IF) is usually relatively large, and generally cannot be completely stored in the memory. Therefore, the index file must be compressed according to a certain encoding during actual application. Generally speaking, encodings with higher compression ratios will be slower for online query processing, so all encodings are to find a balance between space and processing time. [0003] There are two basic strategies when encoding ID index files, one is to encode the original incremental value, and the other is to encode the difference between two consecutive incremental values ​​(delta encoding). Generally speaking, the difference value is much smaller ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G06F16/24G06F16/22
CPCG06F16/2246G06F16/245
Inventor 付玺王斌李鹏王卿李雄徐杰马宏远
Owner INST OF INFORMATION ENG CHINESE ACAD OF SCI