Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Big data desensitization method

A big data and desensitization technology, applied in the field of big data, can solve the problems of data mining and data analysis cannot be carried out, lost, etc.

Pending Publication Date: 2020-10-23
中国农业银行股份有限公司上海市分行
View PDF0 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

For these data, the data itself has meaning. According to the traditional random value replacement desensitization method or special character replacement desensitization method, the data itself will be changed. Yes, the meaning of the data itself is partially or completely lost, resulting in subsequent Data mining and data analysis cannot be performed

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Big data desensitization method
  • Big data desensitization method
  • Big data desensitization method

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0025] The present invention proposes a big data desensitization method, which desensitizes the specified data in the multi-dimensional fact table, figure 1 A flowchart of a big data desensitization method according to an embodiment of the present invention is disclosed. As shown, the method includes:

[0026] S101. Initialization step, reading specified data in the multi-dimensional fact table and arranging them into a data matrix, each column in the data matrix corresponds to a dimension, and the data matrix is ​​an original data matrix.

[0027] The financial industry usually uses fact tables to store customer data. In fact tables, some data are used for customer identification, such as name, card number, address, mobile phone number, etc.; some data record customer asset information, such as total assets , Renminbi assets, foreign currency assets, wealth management products, credit cards, deposits, bonds, etc.; some data record customer behavior information, such as trans...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a big data desensitization method, which is used for desensitizing specified data in a multi-dimensional fact table, and comprises an initialization step of reading the specified data in the multi-dimensional fact table and arranging the specified data into a data matrix, each column in the data matrix corresponds to one dimension, and the data matrix is an original data matrix. And a spatial transformation step of transforming the specified data of each dimension according to columns, the transformation which comprises stretching transformation, contraction transformation or distortion transformation, and obtaining a transformed data matrix. Wherein after normalization processing, the difference between the value of each datum in the transformed data matrix and thecorresponding numerical value in the original data matrix is smaller than 5%. According to the big data desensitization method, spatial transformation is used for desensitizing sensitive data, spatial relative position information of the desensitized data is reserved, and data loss caused by spatial transformation is smaller than 5%. The big data desensitization method can also be applied to a distributed framework so as to meet the requirement of big data operation of a distributed system.

Description

technical field [0001] The present invention relates to the field of big data, more specifically, to the data security technology of big data. Background technique [0002] Data processing is becoming an important infrastructure. For data processing, data security, especially the security of sensitive data, is particularly important. Data desensitization for sensitive data is also an infrastructure. In the financial field, data desensitization in the prior art basically uses random value replacement desensitization and special character replacement desensitization methods. The former uses random value replacement (letters become random letters, numbers become random numbers) to change data, and the latter uses special characters (such as "*") to change data. [0003] For data that has no specific meaning and only serves as an indicator, such as name, mobile phone number, card number, etc., this desensitization method is suitable. Indicative information such as name, mobile...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06F21/62G06Q40/02G06F7/58G06F17/16
CPCG06F21/6254G06F7/58G06F17/16G06Q40/03
Inventor 臧其事赵可欣吴晓峰
Owner 中国农业银行股份有限公司上海市分行
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products