Method and system for greatly compressing volume of database, terminal equipment and readable storage medium

A database and volume technology, applied in other database retrieval, other database indexing, special data processing applications, etc., can solve the problems of inability to balance between database volume and read and write performance, discount of overall database efficiency, and high database space occupation. Ease of development, realization and large-scale production, low R&D and production costs, and simple methods

Inactive Publication Date: 2020-08-07
四川创智联恒科技有限公司
View PDF11 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0003] When storing data in a traditional database, if the compression technology is not used to process the original data in order to ensure efficiency, then when the amount of data is large, the space occupied by the database will be relatively high; if in order to save space, some data compression technology will be used The original data is compressed and then stored. Even though the space occupied by the data can be reduced, the overall efficiency of the database is greatly reduced due to the time-consuming and labor-intensive processing of the compression technology itself.
[0004] Problems in the current method: when importing tree-structured data into the database, there is no balance between database volume and read-write performance

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method and system for greatly compressing volume of database, terminal equipment and readable storage medium
  • Method and system for greatly compressing volume of database, terminal equipment and readable storage medium
  • Method and system for greatly compressing volume of database, terminal equipment and readable storage medium

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0040] Example 1 is a very deep tree structure, and the record is similar to the following structure:

[0041] Aaa / Bbb / Ccc=5

[0042] Aaa / Bbb / Ccc / Ddd / Eee=1

[0043] Aaa / Bbb / Ccc / Ddd / Eee / Fff=string

[0044] …

[0045] If the fields such as Aaa and Bbb are relatively long English words, try to process the fields such as Aaa and Bbb as follows:

[0046] 1. Scan the original data and extract all the fields of node names such as Aaa and Bbb;

[0047] 2. Sort according to the word frequency of each node name from high to low;

[0048] 3. After sorting, encode. For example: Aaa is the highest frequency, followed by Bbb, and so on, as shown in Table 1:

[0049]

[0050]

[0051] Table 1 word frequency sorting code table 1

[0052] 4. Construct the optimal tree through the word frequency sorting code table, and create the optimal tree structure table, as shown in Table 2:

[0053] node field value 0 / 1 / 2 5 0 / 1 / 2 / 3 / 4 1 0 / 1 / 2 / 3 / 4 / 5 string … … ...

Embodiment 2

[0056] Embodiment 2 is data in a kind of xml format, as follows:

[0057]

[0058] 1. Scan the original data and extract all the fields of node names such as FAP and PerfMgmt;

[0059] 2. Sort according to the word frequency of each node name from high to low;

[0060] 3. After sorting, perform coding. When the frequency of node names is the same, just set a different code for each node name. The order of sorting does not affect the construction of the optimal tree. For example: FAP is the highest frequency, followed by PerfMgmt, Config, ConfigNumberOfEntity..., and so on, as shown in Table 3:

[0061] node name coding FAP 0 PerMgmt 1 Config 2 ConfigNumberOfEntity 3 … …

[0062] Table 3 Word frequency ranking code table 2

[0063] 4. Construct the optimal tree through the word frequency sorting code table, and create the optimal tree structure table, as shown in Table 4:

[0064] node field value 0 / 1 / 3 1 ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a method and a system for greatly compressing the volume of a database, terminal equipment and a readable storage medium, and the method comprises the following steps: S1, traversing all trees, and extracting the names of all nodes and the values of leaf nodes; S2, counting the frequency of node names to obtain a word frequency table; S3, sorting the word frequency tables according to the word frequencies from high to low; S4, creating an optimal tree according to the word frequency tables arranged in the descending order; and S5, storing the word frequency table, the optimal tree and optimal tree paths of all nodes. According to the method, node name fields repeatedly appearing in original data are sorted, the optimal tree is established according to the sorted word frequency table, the original data is stored into the database by adopting the optimal tree structure, and the word frequency table is stored into the database, so that the size of the database is greatly reduced. Compared with the prior art, the method has the advantages that optimal coding is adopted, the node name fields are arranged in a descending order according to the frequency of occurrence, and the size of the whole database can be reduced to the maximum extent.

Description

technical field [0001] The invention relates to the field, and mainly relates to a method, a system, a terminal device and a readable storage medium for greatly compressing the volume of a database. Background technique [0002] In embedded software products, due to cost constraints, limited storage space, and high requirements for software performance, it is necessary to use a database to ensure the read and write performance of data, and it is also necessary for the database to occupy as little space as possible. Therefore, there is a need for a method for compressing the volume of a database that takes into account both software execution efficiency and storage space saving. [0003] When storing data in a traditional database, if the compression technology is not used to process the original data in order to ensure efficiency, then when the amount of data is large, the space occupied by the database will be relatively high; if in order to save space, some data compressio...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F16/901
CPCG06F16/901G06F16/9027
Inventor 胡建伟颜锐
Owner 四川创智联恒科技有限公司
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products