Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Code leakage detection system and method based on natural language processing technology

A technology of natural language processing and leak detection, applied in the field of information security, can solve problems such as limited labor costs, and achieve the effect of consuming time and money

Active Publication Date: 2020-04-10
NANJING FUJITSU NANDA SOFTWARE TECH
View PDF3 Cites 4 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

The limitation of manual inspection is that the enterprise has many historical projects, including many large-scale projects with a large amount of code. Relying solely on manual inspection, limited by labor costs, can only inspect a limited range

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Code leakage detection system and method based on natural language processing technology
  • Code leakage detection system and method based on natural language processing technology
  • Code leakage detection system and method based on natural language processing technology

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0097] Embodiments of the present invention will be described in further detail below in conjunction with the accompanying drawings.

[0098] A code leakage detection system and method based on natural language processing technology of the present invention mainly includes the following implementation contents:

[0099] 1) Construction content: In order to ensure the information security of the enterprise, it is necessary to check whether internal codes and other materials are leaked to the network. At present, this work needs to manually extract keywords, retrieve and judge web pages, which not only consumes a lot of labor, but also has a limited scope of inspection. The present invention utilizes AI technology (especially NLP technology) to improve relevant links to achieve the effect of automatic inspection.

[0100] 2) Key technology: use NLP and other technologies to analyze the source code of the project, and extract representative keywords; use crawler technology to re...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a code leakage detection system and method based on a natural language processing technology. The method comprises the steps: taking a code as a stricter and more logical special language, carrying out the analysis of a source code through the natural language processing technology, replacing the manual analysis and analysis process, and extracting the key information whichcan most represent a project; extracting webpage information by using a web crawler technology, and performing content screening extraction on the acquired webpage information by using a machine learning technology; using a strategy of combining a traditional text similarity calculation method and a deep learning algorithm, carrying out similarity comparison on source code and an online suspicious code, displaying suspicious webpages according to the similarity, and providing a basis for final judgment. The problems that when source code is analyzed through a traditional manual means and keyinformation is extracted from the source code, the extraction excessively relies on a human experience, and large-scale code analysis is low in efficiency, time-consuming, financial-consuming and thelike are solved.

Description

technical field [0001] The invention belongs to the technical field of information security, and in particular relates to a code leakage detection system and method based on natural language processing technology. Background technique [0002] The information security of an enterprise is related to the normal development of the enterprise. For IT companies, information leakage is a typical information security accident, mainly manifested in the leakage of codes, documents and other materials in the project to the Internet. In recent years, code leakage incidents have occurred frequently, which have had some negative impacts on enterprises or individuals, and information security is facing great challenges. Leakage of product core code may pose a potential threat to product security; leakage of enterprise confidential information may cause direct economic losses to the enterprise; not only the enterprise, information leakage will also endanger the personal privacy of public ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06F21/56G06F21/55
CPCG06F21/563G06F21/552G06F21/554
Inventor 许方超钱志强周文烨陈奕鑫
Owner NANJING FUJITSU NANDA SOFTWARE TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products