Safe feature engineering method and device

A feature engineering and feature data technology, applied in computer security devices, instruments, computing, etc., can solve hidden dangers in data transmission security, cannot solve data trust issues, cannot completely eliminate privacy data leakage, and achieve the effect of improving security

Active Publication Date: 2019-07-19
ADVANCED NEW TECH CO LTD
View PDF3 Cites 20 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

The disadvantage of this solution is that the data plaintext still needs to be output externally, which cannot completely eliminate the leakage of private data, nor can it solve the problem of data trust itself
In addition, it is hard to find a completely trusted third party, and there are hidden dangers in data transmission security during the transmission of data / models to trusted third parties
Not only that, there is an operation authority problem in the trusted third party, it is difficult to control the granularity of security checks on data access, and the results of feature engineering processing may also be leaked

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Safe feature engineering method and device
  • Safe feature engineering method and device
  • Safe feature engineering method and device

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0040] The present disclosure will be further described below in conjunction with specific embodiments and drawings, but the protection scope of the present disclosure should not be limited thereby.

[0041] The present disclosure provides secure feature engineering methods and apparatus. In a joint modeling scenario, there is generally a situation where one party has feature data and the other party has label information but no or a small amount of incomplete feature data, which requires the sample data of the modeling partners to work together to complete feature engineering processing . According to an embodiment of the present disclosure, the first device may transmit a first data set to the second device, and the first data set includes ciphertexts of label information of multiple data objects. The second device may perform feature engineering on a second data set including feature data of the plurality of data objects to generate a subset of the second data set. The se...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention provides a safe feature engineering method and device. The first device may transmit a first data set to the second device, the first data set including a ciphertext of tag information of the plurality of data objects. The second device may perform feature engineering processing on a second data set including feature data of the plurality of data objects to generate a subset of the second data set. The second device may generate a ciphertext subset of the tag information of the corresponding data object in the first data set according to the subset of the second data set, and send the ciphertext of the ciphertext subset to the first device. The first device can decrypt the ciphertext subset and generate statistical information, and send the statistical information to the second device. The second device may use the statistical information to calculate a characteristic engineering index. The invention further provides a corresponding feature engineering device.

Description

technical field [0001] The present disclosure relates to feature engineering, and in particular to a safe feature engineering method and device. Background technique [0002] The modeling process requires basic feature engineering processing on sample data, and then model training with the data set processed by feature engineering. Feature engineering refers to the process of converting raw data into feature vectors. Feature engineering is the most important initial step in machine learning, which will directly affect the effect of machine learning, and usually requires a lot of time. Typical feature engineering includes processes such as data cleaning, feature discretization, feature extraction, and feature selection. In big data scenarios, data is usually stored in a distributed manner, and feature engineering processing requires the fusion of modeling sample data from all parties. For example, when binning data, it needs the cooperation of feature data and label inform...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F21/60
CPCG06F21/602
Inventor 林文珍
Owner ADVANCED NEW TECH CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products