Processing method and system for tree structured data

A technology of data processing and tree structure, applied in the field of data processing, can solve problems such as insufficient coding and query efficiency, low system function and use efficiency, etc., and achieve the effect of simple structure, easy expression, and high storage efficiency

Active Publication Date: 2017-08-25
INST OF COMPUTING TECH CHINESE ACAD OF SCI
View PDF7 Cites 32 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0015] 5. In the key-value pair mentioned in 2 above, the value of the key can only be of type (string)
[0045] 2) NoSQL data processing system is not efficient en

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Processing method and system for tree structured data
  • Processing method and system for tree structured data
  • Processing method and system for tree structured data

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0116] In view of the above deficiencies in the prior art, the present invention redesigns and implements a semi-structured data processing system STEED. The following introduces the overall architecture of the STEED system and briefly introduces the functional requirements of each module, then analyzes the interface definitions between these modules, and briefly explains how STEED internally processes and stores data.

[0117] Such as image 3 As shown, STEED mainly consists of three modules:

[0118] (1) Data analysis module:

[0119] Read text data and parse it into row or column binary format data, which is stored in the data storage module. In the process of data parsing, a syntax tree is dynamically generated to store the definition of semi-structured data. When parsing the data in JSON format, because it does not define corresponding data format (syntax tree, schema tree), so the present invention can only dynamically generate the definition of data format in the pro...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention provides a processing method and system for tree structured data (STEED) and relates to the technical field of data processing. The system supports reading of text data and analyzes the text data into row or column type binary data, wherein in the analysis process, a grammar tree is dynamically generated and definitions of semi-structured data are stored; the row or column type binary data is stored, wherein the row or column type binary data is mutually converted and the binary data is directly output as JSON data in a text format; and based on the binary data, the semi-structured data is subjected to query operation.

Description

technical field [0001] The present invention relates to the technical field of data processing, in particular to a tree structure data processing method and system (System for TrEE structured Data, STEED). Background technique [0002] With the development of computer network and big data processing technology, traditional relational data can no longer meet the requirements of data definition and use in the network and big data environment, and semi-structured data represented by JSON and Protocol Buffers because It can not only fully express the object (Object) data in the programming language, but also modify and expand the original data format according to the format change of the data, so it is widely used in the actual environment. [0003] Definition of tree-structured data: [0004] T value =T primitive | T object | T array [0005] T primitive =string|number|boolean||null [0006] [0007] [0008] Record=T object [0009] As shown above, the tree str...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06F17/30
CPCG06F16/258G06F16/81G06F16/84
Inventor 陈世敏王智义
Owner INST OF COMPUTING TECH CHINESE ACAD OF SCI
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products