Data statistics method, system and device based on discrete grouping and storage medium

A technology of data statistics and data processing equipment, which is applied in the field of data statistics and can solve problems such as poor implementation of load balancing

Pending Publication Date: 2020-12-22
CTRIP COMP TECH SHANGHAI
View PDF0 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Generally speaking, data skew is caused by poor load balancing implementation.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Data statistics method, system and device based on discrete grouping and storage medium
  • Data statistics method, system and device based on discrete grouping and storage medium
  • Data statistics method, system and device based on discrete grouping and storage medium

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0037] Example embodiments will now be described more fully with reference to the accompanying drawings. Example embodiments may, however, be embodied in many forms and should not be construed as limited to the embodiments set forth herein. Rather, these embodiments are provided so that this disclosure will be thorough and complete, and will fully convey the concept of the example embodiments to those skilled in the art. The same reference numerals denote the same or similar structures in the drawings, and thus their repeated descriptions will be omitted.

[0038] figure 1 It is a flow chart of the data statistics method based on discrete grouping of the present invention. Such as figure 1 As shown, the embodiment of the present invention provides a kind of data statistical method based on discrete grouping, comprises the following steps:

[0039] S110. Obtain real-time data with multiple attribute values, and perform data discretization based on a combination of at least ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention provides a data statistics method, system and device based on discrete grouping and a storage medium, and the method comprises the steps: obtaining real-time data with a plurality of attribute values, taking the combination of at least two attribute values in the real-time data as a grouping label for data discretization, and obtaining a plurality of data groups; circularly groupingthe data groups to the corresponding data processing devices according to the number of the data processing devices; obtaining a data statistics condition sent by a statistics requester, and sending the data statistics condition to each data processing device; and combining the data information fed back by the data processing equipment and outputting the combined data information as a data statistical result. According to the method, the robustness of a program can be improved, data inclination at a flow peak value is avoided, complete state management is achieved, and the accuracy and consistency of the data can be ensured.

Description

technical field [0001] The present invention relates to the field of data statistics, in particular, to a data statistics method, system, equipment and storage medium based on discrete grouping. Background technique [0002] Real-time data statistics under the big data scenario is an important part of building a real-time data warehouse system. Whether it is the display of the application's business system or the analysis of the application's analysis system for real-time label analysis, real-time summary statistics are an important technical scenario, so must It is necessary to ensure the robustness and flexibility of the big data real-time summary statistics program. [0003] For a cluster system, the general cache is distributed, that is, different nodes are responsible for a certain range of cached data. Usually, the dispersion of cached data is not enough, resulting in a large amount of cached data being concentrated on one or several service nodes, which is called dat...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/18
CPCG06F17/18
Inventor 王旭郑浩华张延成吉聪睿
Owner CTRIP COMP TECH SHANGHAI
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products