Unlock instant, AI-driven research and patent intelligence for your innovation.

Couf, a data analysis method

a data analysis and data technology, applied in the field of data analysis and data mining, can solve the problems of slow whole process, waste of time, and difficult, if not impossible, and achieve the effect of improving the accuracy of data analysis, reducing the difficulty of data mining, and improving the accuracy of data mining

Active Publication Date: 2020-10-22
WANG ZHIPING
View PDF0 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Benefits of technology

This patent aims to provide an improved method and apparatus to efficiently calculate statistics, which reduces processing time and resources. It also introduces a new and simple big-data analysis method and apparatus, as well as a new and easy data analysis method for data mining and a new and simple process for risk analysis. The invention is a computer system with a database that stores simple statistics on each unit of data, which can be easily used to calculate subsequent query statistics on the records in any arbitrary combination of units. Overall, this patent presents a solution for effective and efficient data analysis, especially for big-data.

Problems solved by technology

However, the huge amount of data makes the analyses difficult, if not impossible.
However, there are two obvious disadvantages associated with Hadoop.
This is redundant and a waste of time and it makes the whole process slow.
But there is a setback.
In order for Spark to work well, entire data set needs to be fitted into the memory, Spark needs a lot of memory and memory is more expensive.
When the data set cannot be fitted into the memory, Spark does not work well.
Furthermore, when the same query is executed continually, it runs very fast.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Couf, a data analysis method
  • Couf, a data analysis method
  • Couf, a data analysis method

Examples

Experimental program
Comparison scheme
Effect test

examples

[0083]Example 1. Suppose Z is a multinational company. Z has a computer server hosting a database with company's payroll information. Z has several divisions such as marketing, operations, IT, etc. Z uses COUF for its payroll analyses. Suppose Z defines its divisions as the units defined in COUF. Also suppose xi is the salary value for employee i. Z pre-calculates salary simple statistics for each division (unit) and stores them into company's database as follows:

Number of records: Nm, No, Nt (for Marketing m, Operations o & IT t respectively)

Minimum value of salaries in each division (unit): min m, , mino, mint (for Marketing m, Operations o & IT t respectively)

Maximum value of salaries in each division (unit): maxm, maxo, maxt (for Marketing m, Operations o & IT t respectively)

Sum of salaries (said numerical field):

∑mxi,∑oxi,∑txi

(for Marketing m, Operations o & IT t respectively)

Sum of the squares of the salaries

∑mxi2,∑oxi2,∑txi2

(for Marketing m, Operations o & IT t respectively)

[...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

A calculate once use forever (COUF) method to efficiently determine statistics on data records of a data set. Data records are read from an original data set and grouped together into units and simple statistic on each unit are calculated and stored in a database. When a query asking for an additional statistic comes in, the additional statistic can be determined quickly and efficiently from the already calculated unit simple statistics so that read redundancies and computational redundancies can be eliminated.

Description

CROSS REFERENCE TO RELATED APPLICATIONS[0001]This application is a continuation in part of U.S. application Ser. No. 15 / 404,141, filed on Jan. 11, 2017 which is a continuation of U.S. application Ser. No. 15 / 361,441, filed on Nov. 27, 2016. The benefit under 35 USC § 120 for both earlier non-provisional applications is hereby claimed, and the aforementioned applications are hereby both herein incorporated by reference in their entireties.FIELD OF THE INVENTION[0002]This invention relates generally to the field of data analysis, data mining and more specifically to big-data analysis.BACKGROUND OF THE INVENTION[0003]Nowadays more and more businesses, governmental agencies and other organizations need to deal with huge amount of data every day. With the spread of Internet and widely adopted e-commerce, the data sizes grow exponentially and they easily exceed terabytes, petabytes or even beyond. In order to find consumer trends, uncover fraud, or detect terrorists, or for other purposes...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06F16/2453G06F16/2455G06F16/22G06F16/248G06F16/2457
CPCG06F16/248G06F16/24552G06F16/2282G06F16/24573G06F16/24539
Inventor WANG, ZHIPING
Owner WANG ZHIPING