Machine learning hyper-parameter importance assessment method and system, and storage medium

A machine learning and hyperparameter technology, applied in the fields of instruments, computer parts, character and pattern recognition, etc., can solve the problems of wasting machine learning time, wasting user time and energy, wasting computer resources, etc.

Active Publication Date: 2018-08-24
CHINA UNIV OF PETROLEUM (EAST CHINA)
View PDF5 Cites 9 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] However, the configuration process of the hyperparameter configuration module of the current automated machine learning system is entirely based on experience, or the configuration of several hyperparameters is adjusted one by o

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Machine learning hyper-parameter importance assessment method and system, and storage medium
  • Machine learning hyper-parameter importance assessment method and system, and storage medium
  • Machine learning hyper-parameter importance assessment method and system, and storage medium

Examples

Experimental program
Comparison scheme
Effect test

Example Embodiment

[0060] It should be pointed out that the following detailed descriptions are all illustrative and are intended to provide further explanations for the application. Unless otherwise indicated, all technical and scientific terms used herein have the same meaning as commonly understood by those of ordinary skill in the technical field to which this application belongs.

[0061] The present invention makes full use of multiple data sets in the open machine learning environment OpenML and performance data of each data set under multiple algorithms, combines the meta-learning method to calculate the distance between the target data set and the historical data set, and uses the Relief algorithm And the clustering algorithm gets the importance ranking of each type of hyperparameter of the classification algorithm to be evaluated, and the ranking result is used to guide the automatic parameter adjustment process of the target data set in the classification algorithm to be evaluated. The p...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The present invention discloses a machine learning hyper-parameter importance assessment method and system, and a storage medium. The method comprises the steps of: obtaining different data sets in anOpenML, extracting meta features to show each data set, and collecting data of performances of a classification algorithm to be assessed in different hyper-parameter configurations; extracting meta features to show a used target data set, calculating distances between the meta features to obtain an increasing sequence of the distance between the target data set and a historical data set; using the data of the performances of the classification algorithm to be assessed in different hyper-parameter configurations to assess the hyper-parameter importance, executing a provided Relied and a clustering algorithm for former m historical data sets in order being close to the target data set according to an ordered sequence of distance increasing between the historical data set and the target dataset, and finally obtaining the hyper-parameter importance sequence of the classification algorithm to be assessed and a guided automatic parameter regulation process. The machine learning hyper-parameter importance assessment method and system give a certain guide for the hyper-parameter regulation of a classification algorithm black box so as to save the time and improve the purpose of efficiency.

Description

technical field [0001] The invention is a machine learning hyperparameter importance evaluation method, system and storage medium. Background technique [0002] Machine learning provides important technical support for data processing and data classification. However, model selection and parameter tuning are still two major problems that plague users, so automated machine learning systems emerged as the times require. The automated machine learning system uses automated machine learning algorithms to achieve automated data preprocessing, automated algorithm selection, and automated parameter adjustment, improving the accuracy of data classification and prediction, and freeing users from the heavy tasks of selecting algorithms and repeatedly adjusting parameters come out. [0003] Since the core of automated machine learning is automated algorithm selection and automated hyperparameter configuration, the system reduces the machine learning process to a Combined Algorithm Sel...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06K9/62
CPCG06F18/217
Inventor 孙运雷魏倩孔言
Owner CHINA UNIV OF PETROLEUM (EAST CHINA)
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products