Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Graph data anonymization method and device based on deep neural network, and storage medium

A deep neural network, graph data technology, applied in storage media, graph data anonymity method based on deep neural network, and device field, can solve the problems of limited manually specified features, sacrificing the usable value of data, huge graph data, etc.

Pending Publication Date: 2020-07-31
NAT UNIV OF DEFENSE TECH
View PDF3 Cites 4 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, the potential feature dimension of graph data is very large, and the manually specified features are limited. Attackers can easily find potential correlation features by using advanced mining methods such as machine learning. In order to achieve a good anonymity effect, a large amount of noise information needs to be added, sacrificing usable value of data
For the anonymity of relational structures of graph-structured data such as social networks, existing anonymity technologies can only specify feature dimensions for privacy protection, and cannot resist multi-dimensional association attacks
Moreover, it is difficult to achieve a good trade-off between data anonymity and usability when existing technologies perform operations such as adding noise, feature generalization, and perturbation to the features in the graph.
That is, when anonymity is high, data availability is often low

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Graph data anonymization method and device based on deep neural network, and storage medium
  • Graph data anonymization method and device based on deep neural network, and storage medium
  • Graph data anonymization method and device based on deep neural network, and storage medium

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0033] See figure 1 , a kind of graph data anonymous method based on deep neural network of the present invention, comprises the following steps:

[0034] Step 1: Use the random walk strategy of the node2vec algorithm to sample to obtain the real node sequence in the graph data, and the nodes in the node sequence are represented as vectors in a One-hot manner;

[0035] Step 2: Construct a learning model of graph data features based on a deep neural network, use the real node sequence as input to train the learning model, optimize model parameters, and obtain a trained learning model;

[0036] Step 3: Input the real node sequence into the trained learning model, and output the random walk sequence of the simulated nodes;

[0037] Step 4: Add noise satisfying the mechanism of differential privacy to the obtained random walk sequence of simulated nodes, and then synthesize them to obtain an anonymous graph.

[0038] The present invention uses simulated synthetic graph data to r...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention provides a graph data anonymization method and device based on a deep neural network and a storage medium. According to the method, an anonymous graph with high anonymity and high data availability can be generated for data release, and the method comprises the following steps: 1, sampling by using a random walk strategy of a node2vec algorithm to obtain a real node sequence in graphdata, and representing nodes in the node sequence as vectors in an One-hot manner; 2, constructing a learning model of graph data features based on a deep neural network, training the learning modelby taking a real node sequence as input, and optimizing model parameters to obtain a trained learning model; 3, inputting a real node sequence into the trained learning model, and outputting a randomwalk sequence of the simulated nodes; 4, adding noise meeting a differential privacy mechanism to the obtained random walk sequence of the simulated node, and then performing synthesis to obtain an anonymous graph.

Description

technical field [0001] The invention relates to the field of neural networks in machine learning, in particular to a method, device and storage medium for anonymizing graph data based on deep neural networks. Background technique [0002] Today, with the rapid development of Internet big data research and application, it is often necessary to share a large number of data sets between different companies, research institutions, etc., and even many data sets are provided to researchers in the form of public release to fully mine data in scientific research. , public services and commercial applications. The content of the data involves all aspects of social life, including transaction data, cooperative network data, social network data, location trajectory data, medical data, telephone communication data, commodity purchase data, etc. The most common of these data sets are graph data sets that exist in the form of networks. Nodes and edges contain rich user attributes and ass...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06F21/62G06N3/04G06N3/08
CPCG06F21/6254G06N3/08G06N3/045
Inventor 方俊斌贾焰李爱平周斌喻承蒋千越宋怡晨王培刘运璇郑新萍王浩王昌海李晨晨
Owner NAT UNIV OF DEFENSE TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products