Unlock instant, AI-driven research and patent intelligence for your innovation.

Data processing method and system

A data and data block technology, applied in the direction of electrical digital data processing, special data processing applications, instruments, etc., can solve the problems of affecting query speed, difficulty, and large network overhead, so as to achieve fast query and analysis, ensure query efficiency, and improve The effect of query efficiency

Inactive Publication Date: 2015-06-10
BEIJING DIDI INFINITY TECH & DEV
View PDF4 Cites 3 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Specifically, for example, in a database, multiple fields such as "ID", "name", "count", and "year" are included, when it is necessary to use Structured Query Language (SQL) to query the data in the database When, for example, "SELECT name FROM order WHERE year=2014", since the row-based storage database can only be read row by row, when performing a query, each row of data in the database needs to be read out, and then the corresponding query The data of the statement condition is extracted, resulting in slow query speed
In addition, because when the query targets certain columns, unnecessary column reads cannot be skipped; due to the mixture of columns with different data values, row storage is not easy to obtain a very high compression ratio, that is, the space utilization rate is low
[0005] For columnar storage, although the query efficiency of row-based storage is low, different data columns of columnar-based storage data are stored in different storage nodes. Therefore, to obtain a complete data, the tuple data must be re- The cost of the composition is large, resulting in excessive network overhead, which cannot meet the needs of the distributed file system for instant query of big data
For example, if the data of different columns of the same data unit is stored on different storage nodes, in order to obtain the same data unit, it is necessary to repeatedly read multiple nodes to reconstruct related data, resulting in increased network overhead
When reading massive amounts of data, this network overhead can severely impact query speed

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Data processing method and system
  • Data processing method and system
  • Data processing method and system

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0033] Embodiments of the present disclosure will now be described in detail with reference to the accompanying drawings. It should be noted that the same numerals may be used for similar units or functional components in the drawings. The accompanying drawings are only intended to illustrate embodiments of the present disclosure. Those skilled in the art can obtain alternative technical solutions from the following descriptions without departing from the spirit and protection scope of the present disclosure.

[0034] Embodiments of the present disclosure will be described in detail below in conjunction with the accompanying drawings.

[0035] like figure 2 As shown, according to an embodiment of the present disclosure, a data writing method is provided. The method includes: at step S101, reading data of a predetermined row to generate a corresponding file block. In step S102, create a header file for each file block, wherein the header file includes the index of each file...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention relates to data processing methods and device systems, in particular to a data write-in method and system and a data reading method and system. The data write-in method includes that data of predetermined rows are read so as to generate corresponding file blocks; a header file is established for each file block, the header files comprise index of each file block in a database and index of the files that are stored in each file block, and each file block and the corresponding header file form a data block; the data blocks are written in storage nodes. By means of the methods and the systems, efficient compression of data can be performed, storage costs are lowered, the storage space is saved, and data query and analysis speed is increased.

Description

technical field [0001] Embodiments of the present disclosure generally relate to the field of data storage, for processing data, especially data writing and reading methods and corresponding systems. Background technique [0002] For enterprises, organizations, agencies and other institutions, a large amount of data will be generated in daily life, and these data will increase with the passage of time, and the amount of data will become extremely large. These big data provide valuable raw data for business development, statistical analysis, policy formulation, etc. However, as the amount of collected or collected data continues to increase, the system load capacity for storing these data continues to increase, and higher requirements are placed on the data storage architecture. How to improve the storage capacity and query efficiency of massive data has become a solution One of the hard problems of big data problems. [0003] Traditional data storage is row storage and col...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06F17/30
CPCG06F16/1737G06F16/182
Inventor 董旭冯海涛
Owner BEIJING DIDI INFINITY TECH & DEV