A big data environment-oriented privacy information leakage prevention automatic identification method and system

A privacy information and automatic identification technology, applied in the direction of digital data protection, etc., can solve the problems of data open sharing and private information leakage, and achieve the effect of improving the recall rate and judgment accuracy rate, realizing simplicity, and improving the judgment accuracy rate.

Pending Publication Date: 2019-05-17
GUIZHOU AEROSPACE INST OF MEASURING & TESTING TECH
View PDF16 Cites 4 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0012] The technical problem to be solved by the present invention is to provide an automatic identification method and system for anti-disclosure of private information in a big data environment, to solve the problem of leakage of private information faced by open data sharing at present, and to ensure the security of private information in data circulation

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • A big data environment-oriented privacy information leakage prevention automatic identification method and system
  • A big data environment-oriented privacy information leakage prevention automatic identification method and system
  • A big data environment-oriented privacy information leakage prevention automatic identification method and system

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0026] The present invention will be further described in detail through specific embodiments below, but these embodiments are only for illustration and do not limit the scope of the present invention.

[0027] Please refer to Figure 1 to Figure 3 , an automatic identification method for anti-disclosure of private information in a big data environment according to the present invention is characterized in that it includes: screening keywords and determining automatic extraction of keywords; according to the extracted keywords, filtering out content that definitely does not have private information, for The privacy information judgment module provides input; conducts in-depth content analysis on the data after the preliminary screening, judges the private information, and gives the judgment result.

[0028] In one embodiment, the steps of screening keywords and automatically extracting keywords include: based on practical experience and expert argumentation, perfecting the dic...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention relates to a big data environment-oriented privacy information leakage prevention automatic identification method and system, and the method comprises the steps of screening keywords, and judging that the keywords are automatically extracted; according to the extracted keywords, filtering out the content of the positive privacy-free information, and providing the input for a privacyinformation judgment module; and carrying out deep content analysis on the preliminarily screened data, judging the privacy information, and giving out a judgment result. The method is simple to implement, the judgment data size of the privacy information judgment module is reduced to a great extent aiming at the condition that the output port for data collection or data circulation faces mass data, and the judgment accuracy of privacy information is improved; the automatic extraction of text keywords and deep analysis of privacy information are realized, and the judgment accuracy is high; thetimely updating of the keyword list and the classifier is achieved, and the recall rate and the judgment accuracy of privacy information screening are further improved.

Description

technical field [0001] The invention relates to an automatic identification method and system for anti-disclosure of privacy information in a big data environment. Background technique [0002] At present, with the acceleration of government data openness and sharing and the wide application of big data in government affairs, transportation, tourism and other fields, data providers such as governments and enterprises are facing severe problems and challenges of privacy information leakage. It can be said that the leakage of private information has become a bottleneck restricting the open sharing of big data and further restricting the development of the big data industry. [0003] In order to ensure that user privacy is not leaked during the open sharing of data in government affairs, transportation, tourism and other fields, the state has issued a series of laws and regulations related to information security, such as "Network Security Law", "Confidentiality Law", "Governme...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06F21/62
CPCY02D10/00
Inventor 杨玉龙
Owner GUIZHOU AEROSPACE INST OF MEASURING & TESTING TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products