Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Data desensitization method for class information

A technology for category information and data desensitization, applied in the field of privacy protection and security, it can solve the problem of category information losing data analysis value, and achieve the effect of protecting security.

Active Publication Date: 2018-08-17
XIDIAN UNIV
View PDF8 Cites 4 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

In the prior art, the desensitization method for category information is commonly used to be fixed replacement, that is, to replace all category information with the same fixed value. In this way, the frequency and percentage of category information are completely changed, so that the category information of the overall data set is lost. The value of data analysis

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Data desensitization method for class information

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0026] The data desensitization method for category information according to the present invention comprises the following steps:

[0027] 1) Obtain the category information, and obtain the ASCII value corresponding to the control character according to the ASCII code comparison table, wherein the category information is the character between a-z and A-Z, and its corresponding hexadecimal ASCII values ​​are 61-7a, 41-5a respectively ;

[0028] 2) Randomly generate a certain decimal integer between 1-133, then convert the decimal integer into a number with the same base as the ASCII value in step 1), and then perform a sum operation with the ASCII value in step 1), If the category information "F" is desensitized, the hexadecimal ASCII value of "F" obtained after processing in step 1) is 46, and the number between 47-cb is obtained after the summation in step 2);

[0029] 3) Create a dictionary, where the keys of the dictionary are characters between 0-9 and a-f, of which there...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a data desensitization method for class information. The data desensitization method for the class information includes the following steps that 1) the class information is obtained, and ASCII values corresponding to control characters are obtained according to an ASCII comparison table; 2) decimal integers are randomly generated, the decimal integers are converted into numbers with the same scale with the ASCII values in the step 1), and then the obtained numbers and the ASCII values in the step 1) are summed; 3) a dictionary is established, wherein keys of the dictionary are characters between 0-9 and a-f, and values in the dictionary are 16 randomly-selected non-repetitive English capital letters; 4) every summation operation result serves as corresponding valuessearched by keys in the dictionary established in the step 3), then all the searched values are combined in sequence to form character strings, and finally, the character strings serve as data desensitization results for the class information. By means of the data desensitization method for the class information, the frequency and the percentage of the class information after desensitization canbe reserved, and data analysis values are not reduced.

Description

technical field [0001] The invention belongs to the technical field of privacy protection security, and relates to a data desensitization method for category information. Background technique [0002] With the elementization of data production, the continuous development of data science and data technology, and the in-depth mining and application of data value, a big data revolution is underway, and all walks of life are generating a huge number of data fragments every day. At present, a large amount of sensitive data such as customer category information has been accumulated in the business production system, such as whether a certain disease (Y / N), the degree of cure of a certain disease (H / M / L), etc. are very important Private information, once these private information is stolen and used by criminals, will bring economic and even reputation losses to individuals. Therefore, data owners must desensitize category information when using customer information. At present, t...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06F21/62G06F17/22
CPCG06F21/6245G06F40/126
Inventor 李辉孟雪
Owner XIDIAN UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products