Unlock instant, AI-driven research and patent intelligence for your innovation.

A training data selection method for machine learning

A data selection and training data technology, applied in the field of machine learning, can solve problems such as difficult generalization, achieve the effect of improving performance and reducing computing overhead

Active Publication Date: 2022-03-01
UNIV OF SCI & TECH OF CHINA
View PDF6 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] However, the above-mentioned existing data selection strategies are artificially defined heuristic strategies, which are highly specific. Since different machine learning tasks usually have different data distribution and model characteristics, these rules are often used in different machine learning tasks. difficult to generalize

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • A training data selection method for machine learning
  • A training data selection method for machine learning
  • A training data selection method for machine learning

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0021] Next, the technical solutions in the embodiments of the present invention will be apparent from the specific part of the present invention, which is clearly, and it is understood that the described embodiments are merely the embodiments of the present invention, not all of the embodiments. Based on the embodiments of the invention, all other embodiments obtained without making creative labor premises, in the preceding embodiments of the present invention. Contents not described in detail in the embodiments of the present invention belong to prior art known to those skilled in the art.

[0022] Such as figure 1 As shown, the embodiment of the present invention provides a method of data selection of machine learning. It is a method of dynamically selecting training data in accordance with the current training status at different stages of machine learning, and can improve the performance of the machine learning model, including the following step:

[0023] Step 1, select the ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a data selection method for machine learning, comprising the following steps: step 1, selecting a machine learning model of the data to be selected, and obtaining a training data set corresponding to the machine learning model; step 2, randomly selecting from the training data set Select a data subset as the policy training data set, apply the policy training data set to the machine learning model for several rounds of training through deep reinforcement learning, and determine the data selection strategy that matches the machine learning model according to the training results; Step 3, by determining The data selection strategy selects the input data of the machine learning model in batches, and uses the selected data for the training of the machine learning model. The method can use the obtained optimal training data selection strategy for the current machine learning model to select the training data of the machine learning model and improve the performance of the machine learning model.

Description

Technical field [0001] The present invention relates to the field of machine learning, and more particularly to a training data selection method for machine learning. Background technique [0002] In recent years, machine learning, especially based on deep learning technology based on large-scale deep neural networks, has developed rapidly, and has been applied in various aspects of life. With the increasing popularity of deep learning, data selection in machine learning has become an increasingly concern. How to automatically select data, improve the performance of deep learning models, becoming a current urgent need. [0003] At present, there have been many methods in the field of machine learning data selection, such as in accordance with the "difficulty", "curriculum), which is sequentially trained from low to high, and is conducive to the training process of the model. In addition, the loss function size of self-study data is the measurement of "difficulty". In the self-lea...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G06K9/62G06N20/00
CPCG06F18/214
Inventor 李向阳范阳张兰
Owner UNIV OF SCI & TECH OF CHINA