Modeling data selection method and apparatus, storage medium and electronic device

A technology of modeling data and initial data, applied in the field of data processing, can solve problems such as low accuracy of classification models, inaccurate selection of modeling data, etc.

Inactive Publication Date: 2018-06-15
NEUSOFT CORP
View PDF0 Cites 22 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] The main purpose of the present disclosure is to provide a method, device, storage medium and electronic equipment for selecting modeling data to solve the problem of low accuracy of classification models caused by inaccurate selection of modeling data in the prior art

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Modeling data selection method and apparatus, storage medium and electronic device
  • Modeling data selection method and apparatus, storage medium and electronic device
  • Modeling data selection method and apparatus, storage medium and electronic device

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0065] Specific embodiments of the present disclosure will be described in detail below in conjunction with the accompanying drawings. It should be understood that the specific embodiments described here are only used to illustrate and explain the present disclosure, and are not intended to limit the present disclosure.

[0066] In order to make it easier for those skilled in the art to understand the technical solutions provided by the embodiments of the present disclosure, the application scenarios of the embodiments of the present disclosure are described first.

[0067] In the face of massive data labeling work, due to inconsistent classification standards, differences in personal understanding, or staff slack in labeling work, it usually leads to inaccurate category labels attached to a large amount of data. When building a classification model, this part The data will become noise samples, affecting the accuracy of the classification model. The disclosure can be applied...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention relates to a modeling data selection method and apparatus, a storage medium and an electronic device, and aims to solve the problem of low classification model accuracy caused by inaccurate modeling data selection in the prior art. The method comprises the steps of building a mathematic model for multiple times by taking initial data with data tags as a training set; after the mathematic model is built each time, performing classification calculation through the built mathematic model by taking the initial data as a test set to obtain a model output result, wherein the model output result comprises classification tags of the initial data; judging whether the classification tags obtained each time are consistent with the data tags or not, thereby obtaining a judgment result; and according to the judgment result, screening out target initial data finally used for building the model from the initial data.

Description

technical field [0001] The present disclosure relates to the technical field of data processing, and in particular, to a method, device, storage medium and electronic equipment for selecting modeling data. Background technique [0002] Machine learning (Machine Learning, referred to as ML) is the fundamental way to make computers intelligent. It is a method for computers to use existing data to train a certain model and use this model to predict the future. As one of the core research fields of artificial intelligence, machine learning has been applied in various fields of artificial intelligence, especially in data mining. [0003] In related technologies, the field of machine learning is mainly divided into supervised learning and unsupervised learning. Among them, the classification algorithm in supervised learning means that by marking specific category labels on the data, the computer can complete the classification calculation for new data by learning the "features" o...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06N99/00G06K9/62
CPCG06N20/00G06F18/2411G06F18/24323G06F18/214
Inventor 赵耕弘崔朝辉赵立军张霞
Owner NEUSOFT CORP
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products