Regression prediction method and device

A technology for regression prediction and data prediction, which is applied in prediction, data processing applications, calculations, etc. It can solve the problems of missing data, not satisfying the law and range of numerical changes, and not solving the problems of missing and heterogeneous features, so as to improve the prediction effect of effect

Inactive Publication Date: 2012-03-21
INST OF COMPUTING TECH CHINESE ACAD OF SCI
View PDF0 Cites 35 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0003] The existing regression prediction methods have the following two problems: First, due to the lack of data or feature selection, sometimes the original data points themselves may not contain enough information to perform regression prediction on the output (this problem can be referred to as feature missing); Secondly, because the data on each dimension of the data point X may not be numerical, it may not satisfy the changing law and range of numerical values, such as periodic angles, Boolean gender, etc., and enumerated colors Etc., which affects the effect of regression and the accuracy of prediction to a certain extent (this problem can be referred to as feature heterogeneity for short)
In addition, this method does not solve the problem of missing features and heterogeneous features

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Regression prediction method and device
  • Regression prediction method and device
  • Regression prediction method and device

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0041] In order to make the object, technical solution and advantages of the present invention clearer, the present invention will be further described in detail below through specific embodiments in conjunction with the accompanying drawings. It should be understood that the specific embodiments described here are only used to explain the present invention, not to limit the present invention.

[0042] In order to better understand the present invention, some background technical knowledge is introduced first.

[0043]MapReduce (Jeffrey Dean Sanjay Ghemawat. MapReduce: a flexible data processing tool [J]. Communications of the ACM, January 2010, v.53 n.1.) is a large-scale data parallel framework proposed by Google in recent years (cloud computing Framework) is also a programming model and specification for large-scale data processing, which provides a good underlying package and facilitates writing parallel programs. MapReduce adopts the idea of ​​​​divide and conquer. The b...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention provides a regression prediction method, wherein not only similarity between independent variables X is taken into consideration, but also similarity between dependent variables Y of raw data is taken into consideration, and the model of output value y development based on the historical angle of close neighbors. Compared with the conventional model without taking data development mode into consideration, only one preprocessing section is added to a data set, and the information of the data point can be diversified without the need of extra resource; and the information of the raw data point X is diversified, and finally the prediction effect is improved. Furthermore, the regression prediction method can be realized on a MapReduce frame, and the execution speed can be improved by utilizing the parallelism of the device.

Description

technical field [0001] The invention belongs to statistical regression analysis and prediction, in particular to a regression prediction method and device used in statistical machine learning. Background technique [0002] Regression Analysis (Regression Analysis) is a method of statistically analyzing data, mainly to explore whether there is a specific relationship between data. Regression analysis is to establish a model of the relationship between the dependent variable Y (response variables) or dependent variables (dependent variables) and the independent variable X (predictors) or independent variables (independent variables). In statistical machine learning, the regression prediction method is mainly used to predict and analyze data. Among them, X is generally multidimensional data and Y is generally numerical data, which is called multiple regression. According to the regression equation, it can be divided into linear regression and nonlinear regression. The most b...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06Q10/04
Inventor 李锐张帅王斌李鹏张冠元鲁凯
Owner INST OF COMPUTING TECH CHINESE ACAD OF SCI
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products