Data classification and identification method, apparatus, computer apparatus, and readable storage medium

A technology of classification identification and classification data, applied in the computer field, can solve the problems of low robustness of classification identification accuracy, inability to meet machine classification identification, inability to train classification models, etc., to achieve high accuracy and robustness, improve efficiency, Robust effect

Inactive Publication Date: 2019-04-16
北京中关村科金技术有限公司
View PDF0 Cites 4 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0003] In the existing field of text classification, the open source corpus content published by scientific research websites is mainly used for scientific research purposes, but cannot train the corresponding classification models required by the industrial environment, but can be used to train the corresponding classification models required by the industrial environment for commercial use. Most machine learning algorithms rely on marked databases, but marked databases are built on the basis of a large amount of manpower collected and marked manually. The larger the marked database, the higher the labor cost. Smaller labeled databases cannot guarantee sufficient accuracy and robustness of the trained classification model
[0004] It can be seen that in the existing technology, the classification model corresponding to the industrial environment has technical defects such as high cost, low accuracy of classification and identification, and low robustness, which cannot meet the needs of current machine classification and identification.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Data classification and identification method, apparatus, computer apparatus, and readable storage medium
  • Data classification and identification method, apparatus, computer apparatus, and readable storage medium
  • Data classification and identification method, apparatus, computer apparatus, and readable storage medium

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0030] In order to make the object, technical solution and advantages of the present invention clearer, the present invention will be further described in detail below in conjunction with the accompanying drawings and embodiments. It should be understood that the specific embodiments described here are only used to explain the present invention, not to limit the present invention.

[0031] In the embodiment of the present invention, it should be noted that the method for classifying and identifying data sets is not only suitable for classifying and identifying pure data sets, but for all information that can be converted into multidimensional data through existing computer technology, including and not Limited to text, audio, and image information.

[0032] In the embodiment of the present invention, the data set classification identification method can be applied to the terminal; computer equipment, the computer equipment can be an independent physical server or terminal, or ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention relates to the computer field and provides a data set classification identification method. The method comprises the following steps: acquiring a data set to be classified; Dividing thedata set to be classified into a plurality of data clusters to be classified; Judging whether the data cluster to be classified satisfies the standard defined by the sample data set; Discarding invalid data clusters to be categorized that do not meet the criteria defined by the sample data set; Determining a classification mark which is most relevant to the data cluster to be classified accordingto the characteristic information of the data cluster to be classified satisfying the standard; classifying and identifying The data clusters to be classified. A method for classifying and identifyinga data set provide by an embodiment of that present invention By dividing the data set to be classified into a plurality of data clusters to be classified, the processing efficiency is improved, andthe characteristic information of the data cluster to be classified and the characteristic information of the sample data set are utilized, so that the classification identification method of the dataset still has high robustness and accuracy depending on a small amount of sample data information.

Description

technical field [0001] The present invention relates to the field of computer technology, in particular to a data set classification and identification method, device, computer equipment and readable storage medium. Background technique [0002] The current machine learning technology is in the ascendant, especially the deep learning technology is expanding in the field of industrial application, and the ability to independently classify and label data reasonably through learning is one of the foundations of machine deep learning technology. [0003] In the existing field of text classification, the open source corpus content published by scientific research websites is mainly used for scientific research purposes, but cannot train the corresponding classification models required by the industrial environment, but can be used to train the corresponding classification models required by the industrial environment for commercial use. Most machine learning algorithms rely on ma...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F16/35
Inventor 钟尉
Owner 北京中关村科金技术有限公司
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products