PostgreSQL high concurrency streaming big data multidimensional quasi real-time statistic method

A multi-dimensional, big data technology, applied in the field of data statistics, can solve problems such as the inability to meet the needs of high-efficiency statistics and huge computing overhead, and achieve the effects of improving report timeliness, saving hardware investment, and improving statistical performance.

Inactive Publication Date: 2014-11-26
杭州斯凯网络科技有限公司
View PDF3 Cites 1 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Or because each piece of data is triggered, it obviously brings a relatively large computing overhead, which is extremely easy to generate a write bottleneck, and cannot meet the high time-efficiency statistical requi

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • PostgreSQL high concurrency streaming big data multidimensional quasi real-time statistic method
  • PostgreSQL high concurrency streaming big data multidimensional quasi real-time statistic method
  • PostgreSQL high concurrency streaming big data multidimensional quasi real-time statistic method

Examples

Experimental program
Comparison scheme
Effect test

Embodiment

[0033] A PostgreSQL high-concurrency streaming big data multi-dimensional quasi-real-time statistics method, suitable for OLTP systems,

[0034] Step 1: Create the incremental state table of the flow meter, the statistical function, the incremental state table of the function, and the analysis function, and initialize the incremental state table of the flow meter and the incremental state table of the function;

[0035] Step 2: Calculate the number of access times of the analysis function, and record it as an integer, as the statistical number data,

[0036] Step 3: Analyze function work, enter sub-steps,

[0037] Sub-step 1: The computer judges whether the parameter value of the data is correct, if it is correct, enter the data into the flow information table, if not, end this method and report an error,

[0038] Sub-step 2: Take out the statistical function from the function incremental state table, store it in the computer memory, traverse the function name and variables o...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention relates to a PostgreSQL high concurrency streaming big data multidimensional quasi real-time statistic method. According to the technical scheme, the method is characterized in that a recording table increment status table, a statistical function, a function increment status table and an analysis function are established, and the function increment status of the recording table increment status table is initialized; the number obtaining frequency of the analysis function is calculated and recorded as an integer and serves as statistic frequency data, the analysis function works, and substeps are started; a computer outputs a statistic dimension table containing multidimensional quasi real-time statistic data according to the recording statistic data obtained by the statistic function. Due to the method, the hardware input is reduced by at least 100 times, and real-time effectiveness is controlled within one minute.

Description

technical field [0001] The invention belongs to a data statistics method, in particular to a method for multi-dimensional quasi-real-time statistics of PostgreSQL high-concurrency streaming big data. Background technique [0002] With the development of the Internet, there are more and more applications and users on the Internet, and the data generated by users is also growing explosively. The data generated by users can be analyzed according to various dimensions of the data to obtain the data that enterprises are concerned about, such as user data. Liquidity, difference, dissemination, relationship between products and user groups, time-sharing of application popularity, ranking by region, etc., mining potential needs of users according to user characteristics, etc.; the simplest and most effective way for the program to obtain data generated by users is For pipeline data, a single data packet contains all the information of each dimension at the occurrence time point. Thi...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06F17/30G06Q10/06
CPCG06F16/283
Inventor 周正中
Owner 杭州斯凯网络科技有限公司
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products