Method and apparatus for predicate evaluation on compressed variable-length character strings

A predicate evaluation and string technology, which is applied in electrical digital data processing, instruments, calculations, etc., can solve the problems of not making good use of performance advantages, not supporting predicate evaluation, etc., and achieve the effect of enhancing usability and performance

Active Publication Date: 2018-11-27
IBM CORP
View PDF4 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Clearly, this solution does not support predicate evaluation on compressed-encoded data with trailing whitespace-insensitive semantics, and thus does not take advantage of the performance benefits of direct predicate evaluation on compressed data

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method and apparatus for predicate evaluation on compressed variable-length character strings
  • Method and apparatus for predicate evaluation on compressed variable-length character strings
  • Method and apparatus for predicate evaluation on compressed variable-length character strings

Examples

Experimental program
Comparison scheme
Effect test

no. 1 example

[0025] The following will refer to figure 2 A method of predicate evaluation on compressed variable-length character strings according to a first embodiment of the present invention is described. In this embodiment, the compressed variable-length character string is compressed using a compression encoding method that makes the sequence of the compressed encoded data consistent with the order under the tail space sensitive semantics, and is performed under the tail space insensitive semantics. The predicate evaluation described is illustrated as an example.

[0026] First, the basic idea of ​​this embodiment and the technical terms involved are briefly described.

[0027] Although as mentioned above, variable-length strings have different orderings under trailing whitespace-insensitive semantics and trailing whitespace-sensitive semantics, so it is difficult to find an appropriate compression encoding that preserves order for both semantics way, but actually most variable-le...

no. 2 example

[0076] In the method for predicate evaluation on compressed variable-length strings according to the first embodiment, for a variable-length string predicate, its false positive range and false negative range are respectively determined, and then according to the compressed variable Whether the length string falls within the compressed encoding's false-positive and false-negative ranges determines the result of the predicate evaluation. However, in fact, in many cases, it is not necessary to determine the false negative range and then determine whether the compressed variable-length character string falls within the false negative range of the compression encoding (ie steps S21, S24 and processing in S26). Hereinafter, an example will be described.

[0077] For example, according to the above description, it can be seen that the range of false positives is greater than or equal to the range of false negatives, so if the variable-length string of compressed encoding does not f...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

Methods, devices and computer program products for performing predicate evaluation on a compressed variable length character string are disclosed. Aspects include determining upper and lower boundaries of a false positive range for a predicate, the false positive range for the predicate being a range in which a result of the predicate evaluation may lead to a false positive error. Aspects also include compression-encoding the upper and lower boundaries of the false positive range and performing a first comparison for comparing the compressed variable length character string with the compression-encoded upper and lower boundaries of the false positive range. Aspects further include determining a result of the predicate evaluation based on at least a result of the first comparison processing.

Description

technical field [0001] The present invention relates to the field of relational databases, and more particularly, to a method and device for performing predicate evaluation on compressed variable-length character strings. Background technique [0002] Recent research has shown that when performing query processing in relational databases, performing predicate evaluation directly on the compressed data stored in the database, rather than decompressing the compressed data and then performing predicate evaluation on the decompressed data can greatly improve query performance. Processing performance. In this field, a predicate that sequentially includes an argument, an operation relational operator or a logical operator, and a connection constant and whose argument does not include a function, expression, or clause is usually called a simple predicate, or a predicate for short. In database language, the operational relational or logical operators may include "=", ">", ">=...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Patents(China)
IPC IPC(8): G06F17/30G06F17/27
CPCG06F16/90344
Inventor 陈星刘少蓉
Owner IBM CORP
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products