Method and system of optimizing tree-structured data with simple path characteristics

A simple path and tree structure technology, applied in the field of data processing, can solve the problems of insufficient coding and query efficiency, low system function and use efficiency, etc., and achieve the effect of simple structure, easy expression, and high storage efficiency

Active Publication Date: 2017-08-04
INST OF COMPUTING TECH CHINESE ACAD OF SCI
View PDF2 Cites 11 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0015] 5. In the key-value pair mentioned in 2 above, the value of the key can only be of type (string)
[0045] 2) NoSQL data processing system is not efficient enough to encode and query data
These will lead to some additional restrictions on the function and use of the system and will cause its execution to be inefficient

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method and system of optimizing tree-structured data with simple path characteristics
  • Method and system of optimizing tree-structured data with simple path characteristics
  • Method and system of optimizing tree-structured data with simple path characteristics

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0105] In view of the above deficiencies in the prior art, the present invention redesigns and implements a semi-structured data processing system STEED. The following introduces the overall architecture of the STEED system and briefly introduces the functional requirements of each module, then analyzes the interface definitions between these modules, and briefly explains how STEED internally processes and stores data.

[0106] Such as image 3 As shown, STEED mainly consists of three modules:

[0107] (1) Data analysis module:

[0108] Read text data and parse it into row or column binary format data, which is stored in the data storage module. In the process of data parsing, a syntax tree is dynamically generated to store the definition of semi-structured data. When parsing the data in JSON format, because it does not define corresponding data format (syntax tree, schema tree), so the present invention can only dynamically generate the definition of data format in the pro...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention provides a processing method and system of optimizing tree-structured data with simple path characteristics. The method includes the steps that 1, a simple path is set, wherein the path which has one many-valued domain at most from a root node to a leaf node in a syntax tree defined by data is called the simple path; 2, information of leaf node on the simple path is stored in flat parallel data to obtain a complete nested structure on the path; 3, in the process of querying line data, the line data needs to be assembled into row structure data, and hierarchical relation in the data can be simplified by simple path optimization, wherein the path from the root node to the leaf node can be represented through the leaf node, and all non-leaf nodes in the path can be ignored. By analyzing common semi-structured data and defining the concept of the simple path, STEED data storage, line data assembly and the querying process are optimized through the simple path, and related operation and function efficiency are improved.

Description

technical field [0001] The invention relates to the technical field of data processing, in particular to a method and system for optimizing tree structure data by using simple path features. Background technique [0002] With the development of computer network and big data processing technology, traditional relational data can no longer meet the requirements of data definition and use in the network and big data environment, and semi-structured data represented by JSON and Protocol Buffers because It can not only fully express the object (Object) data in the programming language, but also modify and expand the original data format according to the format change of the data, so it is widely used in the actual environment. [0003] Definition of tree-structured data: [0004] T value =T primitive | T object | T array [0005] T primitive =string|number|boolean|null [0006] [0007] [0008] Record=T object [0009] As shown above, the tree structure data is d...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/30G06F17/22
CPCG06F16/81G06F16/8365G06F16/8373G06F40/14
Inventor 陈世敏王智义
Owner INST OF COMPUTING TECH CHINESE ACAD OF SCI
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products