Data processing method and device

A data processing and data technology, applied in the field of data processing, can solve the problems of large storage space and waste of storage space for time series data, and achieve the effect of reducing size, reducing storage space occupation, and increasing data volume

Pending Publication Date: 2021-08-24
ALIBABA GRP HLDG LTD
View PDF0 Cites 4 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0002] Time series data (English full name: Time Series Data) is defined as a series of data indexed by time dimension, and can also be understood as detection data based on a series of indicators continuously generated at a stable frequency, for example, when detecting the air quality of a city , a series of data generated by collecting a value of sulfur dioxide concentration per second; in various applications, it is necessary to store such time-series data persistently in the database for querying. The way of storing time-series data in the prior art usually uses The collected original time-series data is stored in the database. This storage method will make the time-series data occupy a large storage space. When storing massive time-series data for a long time, directly storing the original time-series data will also cause waste of storage space.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Data processing method and device
  • Data processing method and device
  • Data processing method and device

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0089] In the following description, numerous specific details are set forth in order to provide a thorough understanding of the specification. However, this specification can be implemented in many other ways different from those described here, and those skilled in the art can make similar extensions without violating the connotation of this specification, so this specification is not limited by the specific implementations disclosed below.

[0090] Terms used in one or more embodiments of this specification are for the purpose of describing specific embodiments only, and are not intended to limit one or more embodiments of this specification. As used in one or more embodiments of this specification and the appended claims, the singular forms "a", "the", and "the" are also intended to include the plural forms unless the context clearly dictates otherwise. It should also be understood that the term "and / or" used in one or more embodiments of the present specification refers t...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The embodiment of the invention provides a data processing method and device. The data processing method comprises the steps: obtaining to-be-processed time series data, and dividing the time series data into first time series data and second time series data; coding the first time sequence data based on a dictionary coding algorithm to generate a first coded value; coding the second time sequence data based on a column compression algorithm to generate a second coded value; and generating processed time sequence data according to the first coded value and the second coded value. According to the data processing method, the time sequence data is divided into two parts according to the data type, and the compression coding method corresponding to each part of time sequence data is adopted to perform compression coding on the time sequence data, so that the size of the time sequence data is greatly reduced, the occupation of the time sequence data on the storage space of the database is reduced, and the data volume of the time sequence data written into the database is increased.

Description

technical field [0001] The embodiments of this specification relate to the technical field of data processing, and in particular, to a data processing method. One or more embodiments of this specification also relate to a data processing apparatus, a computing device, and a computer-readable storage medium. Background technique [0002] Time series data (English full name: Time Series Data) is defined as a series of data indexed by time dimension, and can also be understood as detection data based on a series of indicators continuously generated at a stable frequency, for example, when detecting the air quality of a city , a series of data generated by collecting a value of sulfur dioxide concentration per second; in various applications, it is necessary to store such time-series data persistently in the database for querying. The way of storing time-series data in the prior art usually uses The collected original time-series data is stored in the database. This storage met...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F16/22G06F16/21G06F8/41
CPCG06F8/44G06F16/217G06F16/2282
Inventor 胡建洪
Owner ALIBABA GRP HLDG LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products