Regression model hyperparameter optimization method

A regression model and optimization method technology, applied in the fields of electrical digital data processing, special data processing applications, instruments, etc., can solve the problems of error estimation uncertainty, increasing computational complexity, powerlessness, etc.

Inactive Publication Date: 2018-04-06
SHANXI UNIV
View PDF0 Cites 7 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, the training and verification process requires random partitioning of the data, which brings uncertainty in error estimation and non-uniqueness in parameter selection; in addition, the data partitioning, training and verification steps increase the computational complexity
When the computing power is limited and the actual demand is relatively urgent (such as real-time prediction of short-term traffic flow), this parameter optimization method seems to be unable to do what it wants.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Regression model hyperparameter optimization method
  • Regression model hyperparameter optimization method
  • Regression model hyperparameter optimization method

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0027] The present invention will be further described in detail with reference to the accompanying drawings and embodiments.

[0028] Hyperparameter optimization problem: Known regression dataset {(x i ,y i ), i=1,2,...,n} and the regression model m(·|p), for the hyperparameter p of the model, L candidate parameters {p l , l=1,2,...,L} select a parameter p* that is most suitable for this data set, that is, to make the regression model m(·|p*) achieve the best prediction accuracy on this data set .

[0029] Symbolic representation: {(x i ,y i ), i=1,2,…,n} represent the existing regression data set, where x i ,y i Represent the input vector and output value of the i-th sample, n is the sample size in the data set; m(·|p) represents the regression model, where p is the hyperparameter of the model, and its value range is a set containing L elements The set of arithmetic difference or geometric sequence P={p l ,l=1,2,...,L}; p* is the optimal hyperparameter, and p*∈P; the...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a regression model hyperparameter optimization method comprising the first step of sequentially training regression models with a hyperparameter being p1 on an entire data set,and obtaining L well-trained candidate hyperparameter models; the second step of obtaining errors of each candidate hyperparameter model on each sample (xi, yi); the third step of calculating a direction similarity matrix; the fourth step of calculating the symmetry similarity of each parameter; and the fifth step of searching a hyperparameter with the smallest symmetry similarity, and taking thehyperparameter as an optimal parameter for returning. The parameter optimization process provided by the method does not need the artificial setting of other parameters, and the optimization processhas no subjective interference; the optimization process does not require data division and has high efficiency and certainty; and main calculation parts of the method are relatively independent and can be processed in parallel on large data. The efficiency of the method is 6-8 times that of a cross validation method.

Description

technical field [0001] The invention belongs to the field of machine learning optimization modeling, and in particular relates to a regression model-oriented hyperparameter optimization method. Background technique [0002] In the context of the rapid growth of data and information, machine learning, as the core driving force of data mining, has become an indispensable key link in the process of knowledge extraction. Inferring the correspondence between input data and output data based on empirical data is an important class of problems to be solved by machine learning. This type of problem is a regression prediction problem when both the input data and the output data are numerical. For example, predict precipitation based on conditions such as temperature and humidity, predict electricity load based on the population, GDP and consumer index of a certain place, and use the closing price, increase and trading volume of the stock index futures index on the previous day to pr...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F19/00
CPCG16Z99/00
Inventor 姜高霞王文剑杜航原
Owner SHANXI UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products