A compression indexing method and device for a character string sequence
A string and string group technology, applied in the field of data management, can solve the problems such as the decrease of the capacity of the branch node of the coding index, the increase of the number of branch nodes and the search complexity, the excessively long difference prefix length of the underlying leaf nodes, etc., so as to reduce the index The number of nodes, reducing the complexity of index search, and improving the effect of capacity
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment 1
[0102] image 3 A flow chart of a compression indexing method for a character string sequence provided by an embodiment of the present invention, consisting of figure 2 The compressed indexing device 10 shown performs, as image 3 As shown, the compression indexing method of the string sequence may include the following steps:
[0103] S101: Obtain a character string sequence, where the character string sequence includes more than one character string arranged in an orderly manner.
[0104] Optionally, the string sequence can be read directly from the columnar database.
[0105] It should be noted that more than one character strings arranged in an orderly manner can be arranged in ascending order of the dictionary, or in descending order of the dictionary. This embodiment of the present invention does not limit this, and the present invention only takes the sequence of character strings arranged in ascending order of the dictionary as an example. The compressed index meth...
Embodiment 2
[0193] Figure 8 A structural diagram of a compression indexing device 20 provided for an embodiment of the present invention, used to implement the method described in Embodiment 1, as Figure 8 As shown, the device may include:
[0194] The acquiring unit 201 is configured to acquire a character string sequence, where the character string sequence includes more than one character string arranged in an orderly manner.
[0195] The grouping unit 202 is configured to perform grouping processing on the character string sequence according to the difference prefix length of each character string in the character string sequence acquired by the acquisition unit 201, and obtain M character string groups, so that each character The difference prefix length of the first character string in the string group is the shortest within the preset string range, wherein, the M is an integer greater than or equal to 1, each character string group contains at least one character string, and eac...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com