A PDF document privacy leakage prevention method and system

A privacy disclosure and document technology, applied in digital data processing, program/content distribution protection, instruments, etc., can solve problems such as endangering user information security, privacy leakage, and undiscovered privacy leakage channels

Active Publication Date: 2019-03-01
INST OF INFORMATION ENG CHINESE ACAD OF SCI
View PDF7 Cites 1 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

In the era of global information exchange and the wide application of big data technology, the information leaked from PDF documents is likely to be combined with information leaked from other sources, which will be used by criminals, causing more serious privacy leaks and endangering user information security.
As far as previous research is concerned, there are still possible ways of privacy leakage that have not been discovered, especially there is no easy-to-use system for ordinary users to discover and prevent privacy leakage of PDF documents

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • A PDF document privacy leakage prevention method and system
  • A PDF document privacy leakage prevention method and system
  • A PDF document privacy leakage prevention method and system

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0034] In order to enable those skilled in the art to better understand the technical solutions in the embodiments of the present invention, and to make the purpose, features and advantages of the present invention more obvious and easy to understand, the technical core of the present invention will be further described in detail below in conjunction with the accompanying drawings and examples instruction of.

[0035] The present embodiment combines the method and system for preventing privacy leakage of PDF documents proposed by the present invention in detail as follows:

[0036] The system consists of figure 1 As shown, it is divided into three modules: document sensitive component extraction module, document sensitive component display module, and document privacy information erasure module. The specific description of each module is as follows:

[0037] 1. The document sensitive component extraction module extracts sensitive components in PDF documents that may reveal p...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention provides a PDF document privacy leakage prevention method, which comprises the following steps: extracting metadata of the PDF document and checking whether the PDF document has passwordprotection through the metadata; if the document is not password protected, checking whether the document is file copy protected; if the document is not protected from file copying, an optional description in PDF auxiliary support is extracted from it; Filter out descriptive text and Internet links from optional descriptions, leaving only file paths; presenting a component of the metadata that may contain privacy information and the file path to a user; according to the user's selection, the components of the document that will disclose the privacy information are erased to generate a PDF document which does not contain the privacy information and does not destroy the original structure and content. The invention also provides a PDF document privacy leakage prevention system, which comprises a document sensitive component extraction module, a document sensitive component display module and a document privacy information erasing module.

Description

technical field [0001] The present invention relates to the field of computer network security, in particular to a defense method and system for privacy leakage of PDF documents. Background technique [0002] PDF (Portable Document Format) is a general-purpose document format launched by Adobe, which can integrate rich text, images, tables, links and other information into one file, and can be stable on various devices and operating systems. present content. Thanks to the flexibility and stability of PDF documents, it is widely used in multiple scenarios such as information transmission, knowledge exchange, and data archiving. It has also become a common medium for internal and external information exchange in government, business, education and other fields. [0003] PDF documents carry a large amount of data and information. In addition to the text content, there are also personal information such as the author's name, affiliation, and contact information left in it. Bec...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F21/10
CPCG06F21/10G06F21/1066
Inventor 冯云刘宝旭崔翔刘潮歌刘奇旭
Owner INST OF INFORMATION ENG CHINESE ACAD OF SCI
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products