Data recognition method and device

A data identification and data technology, applied in the field of data processing, can solve the problems of inconspicuous differences, failure to use all recognizers, and failure to achieve ideal results, etc., to achieve the effect of ensuring differences and improving accuracy

Active Publication Date: 2014-03-26
NEC (CHINA) CO LTD
View PDF3 Cites 5 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0009] Integrated learning methods have strong scalability for machine learning on large-scale data, but if they are directly applied to large-scale data learning problems, the final data mining accuracy will not be very high due to the inconspicuous differences , can not achieve the desired effect
Although large-scale data learning is achieved, it cannot fully reflect the advantages of large-scale data learning
[0010] At present, the ensemble learning method can be applied to large-scale learning through resampling technology and subset division. However, different recognizers ca

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Data recognition method and device
  • Data recognition method and device
  • Data recognition method and device

Examples

Experimental program
Comparison scheme
Effect test

Example Embodiment

[0024] The embodiment of the present invention provides a data recognition method and device, which divides a labeled data set containing labeled data samples into multiple data subsets according to the difference of labeled data samples, so that each recognizer is trained according to each data subset. , To ensure the difference of each recognizer, therefore, when performing data recognition on the data to be recognized, the recognition result given by the trained recognizer is obtained, and then the final recognition result of the data to be recognized is determined according to each recognition result, which improves Accuracy of big data recognition.

[0025] In the process of dividing the entire annotation set into multiple subsets, the difference between the data subsets is taken into account as an optimized index, so as to ensure that the final obtained multiple subsets have the greatest difference.

[0026] Furthermore, when performing data recognition, multiple recognizers ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a data recognition method and device, and relates to the technology of data processing. A label data set including label data samples is divided into a plurality of data subsets according to the difference of the label data samples, so that all recognizers conduct training according to the data subsets respectively, the difference of all the recognizers is guaranteed, hence, when data recognition is carried out on data to be recognized, given recognition results of the recognizers after training are obtained, then, a final recognition result of the data to be recognized is determined according to all the recognition results, and thus the accuracy of big data recognition is improved.

Description

Technical field [0001] The present invention relates to data processing technology, in particular to a data identification method and device. Background technique [0002] At present, the data generation speed in the real and virtual world is increasing. Automatic identification of data will facilitate users to find and use the data. Therefore, when many applications or systems obtain new data, they need to pass on existing data. Recognition method, data recognition of the obtained data. [0003] The current data recognition method is mainly: first select the corresponding training data from the labeled data to be learned by the recognizer, and when new data is obtained, the learned recognizer can be used to recognize the data. [0004] When learning the recognizer, the technologies most relevant to this patent include large-scale machine learning and integrated learning. The two learning methods are specifically explained below: [0005] Large-scale machine learning refers to the th...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06F17/30
CPCG06N5/025
Inventor 李建强刘春辰
Owner NEC (CHINA) CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products