Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Sensitive information desensitization method based on pattern matching algorithm

A sensitive information and pattern matching technology, applied in the field of data desensitization, can solve the problems of not being able to obtain the original and real data content, and data miners cannot access the data, so as to achieve the effect of ensuring the original value and accurate desensitization

Inactive Publication Date: 2019-11-22
BEIJING INST OF COMP TECH & APPL
View PDF5 Cites 6 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] (1) Data mining personnel cannot access the original and real data. Even if the mining personnel obtain distorted data, the original and real data content cannot be obtained through analysis and reconstruction

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Sensitive information desensitization method based on pattern matching algorithm
  • Sensitive information desensitization method based on pattern matching algorithm
  • Sensitive information desensitization method based on pattern matching algorithm

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0046] In order to make the purpose, content, and advantages of the present invention clearer, the specific implementation manners of the present invention will be further described in detail below in conjunction with the accompanying drawings and embodiments.

[0047] like figure 1 As shown, a kind of sensitive information desensitization based on pattern matching algorithm provided by the present invention comprises the following steps:

[0048] Step 1: Define the text data to be desensitized, extract the text string T in the text data, and subsequently desensitize the sensitive information of the text string T;

[0049]Taking PDF documents as an example, it is necessary to analyze and understand its file format from the aspects of its object, file structure, document structure, content flow, and encoding. Analyze the hierarchical structure of the file to be desensitized, find the document directory dictionary object, obtain the content of other objects according to the ref...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention relates to a sensitive information desensitization method based on a pattern matching algorithm, and belongs to the technical field of data desensitization. The invention provides a sensitive information desensitization method based on a pattern matching algorithm. According to the method, sensitive information of original text character strings is accurately matched and positioned by using an improved Sunbay algorithm, and the sensitive information is desensitized by recording the positions of all the sensitive character strings and through the technologies of substitution, shuffling and the like in a data desensitization method. According to the method, sensitive characters in the text string are efficiently and accurately desensitized, and meanwhile, the original value ofdata is ensured.

Description

technical field [0001] The invention belongs to the technical field of data desensitization, and in particular relates to a sensitive information desensitization method based on a pattern matching algorithm. Background technique [0002] Sensitive data generally refers to data containing sensitive attributes that should not be disclosed. The content of sensitive data can generally determine the relevant intelligence information of an individual, such as personal ID number, personal mobile phone number, home address, family member relationship, etc. Data security is an important part of information security. In the process of model testing and experiments, it is often necessary to use some sensitive data for training. In order to protect the privacy of individuals, it is necessary to desensitize the data. [0003] Data desensitization, also known as data bleaching, refers to the transformation and hiding of data through defined desensitization rules for certain sensitive inf...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06F21/62
CPCG06F21/6245
Inventor 唐舸轩石波乔序赵磊
Owner BEIJING INST OF COMP TECH & APPL
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products