Unlock instant, AI-driven research and patent intelligence for your innovation.

A syntax tree extraction method for json data

An extraction method and syntax tree technology, applied in the field of data processing, can solve the problems of low efficiency of JSON data record storage and query, parsing errors, slow query response of sparse fields, etc., and achieve the goal of improving efficiency, improving execution efficiency, and optimizing storage efficiency Effect

Active Publication Date: 2021-07-20
INST OF COMPUTING TECH CHINESE ACAD OF SCI
View PDF7 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0029] The above-mentioned fields are closely related to people's production and life, and require relatively high data processing efficiency. However, in the above-mentioned fields, JSON data records (Record) have low storage and query efficiency, and queries for sparse domains The response is slow, and parsing errors will occur during parsing and calculation for domains with the same domain name but different domain value types

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • A syntax tree extraction method for json data
  • A syntax tree extraction method for json data
  • A syntax tree extraction method for json data

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0064] In order to make the object, technical solution and advantages of the present invention clearer, the present invention will be further described in detail below through specific embodiments in conjunction with the accompanying drawings. It should be understood that the specific embodiments described here are only used to explain the present invention, not to limit the present invention.

[0065] The block diagrams shown in the drawings are merely examples of functional entities, which do not necessarily correspond to physically separate entities, i.e. they can be implemented in software, or in one or more hardware modules or integrated circuits These functional entities, or these functional entities are implemented in different networks and / or processor means and / or microcontroller means.

[0066] For the convenience of understanding, the definition of JSON is explained here first. The definition related to JSON data has the following forms:

[0067] T vclue =T primi...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The present invention provides a syntax tree extraction method of JSON data, comprising: sampling the JSON data in a given data set, and recording the JSON record structure tree; traversing all the JSON record structure trees to accumulate and generate a statistical structure tree; traversing the statistics The counter of each node in the structure tree divides the domains in the JSON record into sparse domains and non-sparse domains; classifies the sparse domains with the same characteristics together to form the classification of the domains with the same characteristics; based on non-sparse domains, the same The initial state of the syntax tree is generated by the classification of the domain of the feature: according to the initial state of the syntax tree, all JSON records in the given data set are parsed one by one, and the syntax tree corresponding to the data set is dynamically constructed. The present invention improves the organization efficiency of the syntax tree by classifying the sparse domains in the JSON data record (Record) according to the characteristics; by classifying the domains with the same characteristics, the columnar files of the parsed data are reduced Quantity, thereby improving the efficiency in the subsequent query process.

Description

technical field [0001] The present invention relates to a data processing method, specifically, to the field of JSON data storage and query, more specifically, to a method for extracting a syntax tree (metadata of tree-structured data) of JSON data, and based on this method Data processing method, query method and system. Background technique [0002] With the development of computer network and big data processing technology, traditional relational data can no longer meet the requirements of data definition and use in the network and big data environment. The semi-structured data represented by JSON can not only fully express the object (class) in the programming language, but also modify and expand the original data format according to the format change of the data, so it is used in the actual environment. widely used. [0003] In traditional relational databases, before importing records into the database, users need to pre-define the metadata of records in the form of ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G06F40/14G06F16/81
CPCG06F16/81
Inventor 陈世敏王智义祁琦
Owner INST OF COMPUTING TECH CHINESE ACAD OF SCI