Data prediction classification method and device

A technology of data prediction and classification methods, which is applied in the field of data processing, and can solve the problems of classification results and classification count values ​​leaking user privacy information, etc.

Inactive Publication Date: 2016-03-30
INST OF SOFTWARE - CHINESE ACAD OF SCI
View PDF0 Cites 38 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0005] The present invention provides a data prediction and classification method and device, which can solve the

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Data prediction classification method and device
  • Data prediction classification method and device
  • Data prediction classification method and device

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0016] The following will clearly and completely describe the technical solutions in the embodiments of the present invention with reference to the accompanying drawings in the embodiments of the present invention. Obviously, the described embodiments are only some, not all, embodiments of the present invention. Based on the embodiments of the present invention, all other embodiments obtained by persons of ordinary skill in the art without making creative efforts belong to the protection scope of the present invention.

[0017] Some technologies used in the embodiments of the present invention are firstly introduced below.

[0018] Differential privacy is a privacy protection technology based on data distortion. By adding noise to the query or analysis results to distort the data, to ensure that the operation of inserting or deleting a certain record in the data set will not affect the output of any query, so as to achieve the purpose of privacy protection. The formal definit...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a data prediction classification method and device relating to the data process technique field, solving the problem in the prior art that classification result itself and classification count value are possible to leak the private information of a user. The method comprises: building a random forest namely multiple decision trees through a training dataset; carrying out prediction classification to a test dataset by the decision trees in the random forest, and obtaining the classification result satisfying differential privacy. The invention can realize high accuracy prediction classification of the high-dimension large-scale data.

Description

technical field [0001] The invention relates to the technical field of data processing, in particular to a data prediction and classification method and device. Background technique [0002] Classification is an important class of data mining methods. Its purpose is to find out a model that describes and distinguishes data classes or concepts, so that the model can be used to predict the class label of the object. A typical representative of a classification model is a decision tree, which is a tree-shaped classification model. The nodes in the tree represent tests on a certain attribute, and the leaf nodes represent a class. However, both the classification result itself and the classification count value may leak user privacy information. Decision tree classification under traditional privacy protection is mostly realized by data perturbation such as adding random noise or K-anonymity method, or by encrypting original data and intermediate calculation results. However, wh...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06K9/62
CPCG06F18/24765G06F18/214G06F18/241
Inventor 丁丽萍穆海蓉宋宇宁
Owner INST OF SOFTWARE - CHINESE ACAD OF SCI
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products