Data privacy protection method and system in machine learning

A machine learning and data privacy technology, applied in the field of data security, can solve the problems of decreased training efficiency, increased model accuracy loss, poor practicability of fully homomorphic encryption, etc., achieving good scalability, small degree of ciphertext expansion, and practicality Effect

Active Publication Date: 2018-10-30
RENMIN UNIVERSITY OF CHINA
View PDF6 Cites 30 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] 1) The current security protection of statistical data mainly focuses on protecting individual data from being extracted from group data, but lacks the protection of the data content itself;
[0005] 2) Existing schemes propose to use fully homomorphic encryption technology to process data and apply it to machine learning, but the length of ciphertext generated by fully homomorp

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Data privacy protection method and system in machine learning
  • Data privacy protection method and system in machine learning
  • Data privacy protection method and system in machine learning

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0023] The present invention will be described in detail below in conjunction with the accompanying drawings and embodiments.

[0024] The present invention provides a data privacy protection system in machine learning, which includes a user end and a machine learning server end. The user end is provided with a data encryption system, which is used to protect the privacy of the data owner by using an encryption algorithm with order-preserving / distribution-preserving properties. The original data is encrypted, and the ciphertext data is generated and sent to the machine learning server; the machine learning server is equipped with a machine learning service system, which is used to train the machine learning model to be used according to the ciphertext data, and obtain the optimal machine learning model , and use the optimal machine learning model to predict or classify the ciphertext data to be predicted or classified, and return the prediction or classification results to the ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention relates to a data privacy protection method and system in machine learning. The method is characterized by comprising the following steps of: 1) selecting a to-be-used encryption algorithm and system parameters to generate a secret key; 2) encrypting original data to generate corresponding cyphertext data; 3) carrying out training and parameter adjustment on a to-be-used machine learning model by using the cyphertext data so as to obtain an optimal machine learning model; and 4) encrypting to-be-predicted or classified original data by using the secret key in the step 1) by adoption of the method in the step 2) , and inputting the to-be-predicted or classified original data into the optimal machine learning model to obtain a prediction or classification result. According to the method and system, an order preserving/distribution property preserving encryption algorithm and the machine learning model are combined, so that the original data and the machine learning model can be protected. The swelling degree of cyphertext output by the order preserving/distribution property preserving encryption algorithm is far lower than that of a full-homomorphic encryption algorithm, and certain distribution features in plaintext features can be kept, so that the machine learning is relatively high in efficiency and has relatively good expansibility.

Description

technical field [0001] The invention relates to the field of data security, in particular to a data privacy protection method and system in machine learning. Background technique [0002] In recent years, with the continuous development of information technology, machine learning technology has become the cornerstone of technology in the era of big data. Machine learning technology explores the existing data, discovers potential connections in the data, and classifies or predicts according to the obtained model. Numerous service providers use machine learning models as a resource to provide services to the public, bringing many conveniences to people. However, current machine learning-based services ignore public privacy concerns. In order to be able to train models, service providers collect a large amount of user information, which even includes user privacy data. Users lose control over the data after uploading the data, and cannot guarantee whether the data will be mi...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06F21/60H04L29/06
CPCG06F21/602H04L63/0428
Inventor 秦波唐文易赵素云陈红
Owner RENMIN UNIVERSITY OF CHINA
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products