Unlock instant, AI-driven research and patent intelligence for your innovation.

Method and device for determining quantiles of data

A technique for determining methods and quantiles, which is applied in the field of data processing, and can solve the problems of heavy tail distribution, unevenness, and inability to accurately reflect the statistical characteristics of data streams, etc.

Inactive Publication Date: 2017-07-28
BEIJING UNIV OF POSTS & TELECOMM
View PDF0 Cites 7 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] However, in practical applications, the data stream often presents a heavy-tailed distribution, that is, an uneven distribution.
In this case, due to the use of the default linear function of the uniform distribution of the data stream, the determined quantile error will be large, and the statistical characteristics of the data stream cannot be accurately reflected.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method and device for determining quantiles of data
  • Method and device for determining quantiles of data
  • Method and device for determining quantiles of data

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0101] The following will clearly and completely describe the technical solutions in the embodiments of the present invention with reference to the accompanying drawings in the embodiments of the present invention. Obviously, the described embodiments are only some, not all, embodiments of the present invention. Based on the embodiments of the present invention, all other embodiments obtained by persons of ordinary skill in the art without creative efforts fall within the protection scope of the present invention.

[0102] A method for determining quantiles of data provided by an embodiment of the present invention will firstly be described in detail below.

[0103] see figure 1 , figure 1 A schematic flowchart of a method for determining quantiles of data provided in an embodiment of the present invention may include: a data training process and a quantile estimation process;

[0104] The data training process may include the following steps:

[0105] S101, fitting the tra...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The embodiment of the invention discloses a method and device for determining quantiles of data. The method includes the following steps that fitting is conducted on training data selected from target data, and a probability density function p(x) corresponding to the training data is obtained; by means of the probability density function p(x), a fitting distribution function F(x) corresponding to the training data and an inverse function F-1(x) of the fitting distribution function F(x) are calculated, wherein the fitting distribution function F(x) is a nonlinear function; for fractiles contained in a preset fractile sequence P, fitting quantiles corresponding to the fractiles are calculated by means of the inverse function F-1(x), and the fitting quantiles are stored in a fitting quantile sequence B; a target data sequence D with to-be-calculated quantiles is obtained; according the fitting distribution function F(x), the inverse function F-1(x) and the fitting quantile sequence B, for the target data sequence D, the current quantiles corresponding to the fractiles are determined respectively. By means of the method and the device, errors of the determined quantiles are reduced.

Description

technical field [0001] The invention relates to the technical field of data processing, in particular to a method and device for determining quantiles of data. Background technique [0002] In the current era of big data, more and more data streams need to be further processed to obtain more useful information. Quantile determination of data streams is one of the important processing methods, which has a wide range of applications in the fields of computer and finance. Quantiles refer to the numerical points that divide the probability distribution range of a random variable into several equal parts. Commonly used are medians, quartiles, and percentiles. By determining its series of quantiles, the cumulative distribution function of the data stream can be observed intuitively, and then its statistical properties can be analyzed. [0003] The existing quantile determination method does not need to analyze the probability distribution model of the data stream, but extracts a...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/18G06N7/00
CPCG06F17/18G06N7/01
Inventor 乔媛媛林政刘军何大中
Owner BEIJING UNIV OF POSTS & TELECOMM