Data anonymization method and system

A data and number technology, applied in the field of data anonymization methods and systems, can solve the problem of inability to handle sensitive attributes of text types, and achieve the effect of preventing privacy leakage

Inactive Publication Date: 2012-07-04
NEC (CHINA) CO LTD
View PDF0 Cites 20 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0019] Although the existing methods introduced above have effectively dealt with the problem of privacy leakage based on sensitive attributes, they can only deal with numerical or categorical sensitive attributes, but cannot deal with text-type sensitive attributes. Attributes

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Data anonymization method and system
  • Data anonymization method and system
  • Data anonymization method and system

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0031] Features and exemplary embodiments of various aspects of the invention will be described in detail below. The following description covers numerous specific details in order to provide a thorough understanding of the present invention. It will be apparent, however, to one skilled in the art that the present invention may be practiced without some of these specific details. The following description of the embodiments is only to provide a clearer understanding of the present invention by showing examples of the present invention. The present invention is by no means limited to any specific configuration and algorithm presented below, but covers any modification, replacement and improvement of related elements, components and algorithms without departing from the spirit of the present invention.

[0032] It should be noted that the text-type, numeric-type, and / or category-type sensitive attribute values ​​referred to herein are personal or individual private information ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention provides a data anonymization method and system. The data anonymization method comprises the steps of: carrying out text analysis on the attribute value of a text type in data; replacing the attribute value of the text type in the data with the attribute value of a value type or a class type according to text analysis result; and carrying out anonymization processing on the data in which the attribute value of the text type is replaced by the attribute value of the value type or the class type. According to the invention, after anonymization processing, the data comprising the attribute value of the text type not only can prevent the privacy leakage based on the attribute value, but also still has use value.

Description

technical field [0001] The present invention relates to the computer field, and more particularly relates to a method and system for anonymizing data. Background technique [0002] In statistics, microdata refers to data that contains personal information, such as data that exists in a hospital's medical database that contains information such as age, gender, and diagnosis results of each patient. When microdata is published or shared, the protection of personal privacy is an issue that has to be considered. Microdata usually includes the following three types of attributes: strong identification attributes (Explicit Identifier), quasi-identification attributes (Quasi-Identifiers, QIs), and sensitive attributes (Sensitive Attribute). For a data record, the value of the strong identification attribute can be used to clearly identify the individual related to the record, such as "name", "ID number", etc. are strong identification attributes. For a data record, it usually con...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F21/24G06F21/60
Inventor 赵彧李建强刘博
Owner NEC (CHINA) CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products