Supercharge Your Innovation With Domain-Expert AI Agents!

Method and system for realizing rapid open source component detection of source code file

A source code file, component detection technology, applied in error detection/correction, software testing/debugging, instruments, etc., can solve problems such as rough detection granularity and unrealistic

Active Publication Date: 2021-11-30
PEKING UNIV
View PDF9 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, most SCA tools are mainly based on the feature file analysis of software projects to identify open source components, and the detection granularity is rough, and there are many errors; and as the scale of development software becomes larger and larger, manual comparison detection is performed at the code level Obviously even more unrealistic

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method and system for realizing rapid open source component detection of source code file
  • Method and system for realizing rapid open source component detection of source code file
  • Method and system for realizing rapid open source component detection of source code file

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0016] Embodiments of the present disclosure will be described in more detail below with reference to the accompanying drawings. Although certain embodiments of the present disclosure are shown in the drawings, it should be understood that the disclosure may be embodied in various forms and should not be construed as limited to the embodiments set forth herein; A more thorough and complete understanding of the present disclosure. It should be understood that the drawings and embodiments of the present disclosure are for exemplary purposes only, and are not intended to limit the protection scope of the present disclosure.

[0017] In the description of the embodiments of the present disclosure, the term "comprising" and its similar terms should be interpreted as an open inclusion, that is, "including but not limited to". The term "based on" should be understood as "based at least in part on". The term "one embodiment" or "the embodiment" should be read as "at least one embodi...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The embodiment of the invention provides a method and a system for realizing rapid open source component detection of a source code file. The method comprises the following steps: generating a first Simhash value according to a target source code file, and carrying out matching analysis on the first Simhash value and a second Simhash value in a fingerprint database to obtain a third Simhash value in the first Simhash value; and obtaining source code files (namely auxiliary source code files) of a source code library corresponding to all or part of the third Simhash values, and performing code comparison or NLP text similarity analysis on the target source code file according to the auxiliary source code files. Through quick matching analysis based on the Simhash algorithm, the open source component detection efficiency is undoubtedly and greatly improved by the scheme, and in order to make up for false alarms possibly caused by quick matching of the Simhash algorithm, correction is carried out at relatively low overhead through corresponding measures in the scheme, so that the accuracy of related homologous component detection is ensured.

Description

technical field [0001] The embodiments disclosed in this application mainly relate to the technical field of software component analysis (SCA) security testing, and more specifically, relate to a method and system for realizing rapid open source component detection of source code files. Background technique [0002] For software development entities, code reuse is undoubtedly an element that can have a significant impact on software development performance. Generally speaking, code reuse in a narrow sense often refers to refining repeated code into reusable code. For example, the specific form can be a method, a set of templates, or a framework, etc., so as to avoid repeated coding code as much as possible. , thereby improving the coding efficiency. However, with the open source of third-party libraries and components, code reuse in a broad sense also includes: introducing third-party open source libraries and open source components into the developed software project; amon...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06F11/36
CPCG06F11/3604
Inventor 张涛陈钟
Owner PEKING UNIV
Features
  • R&D
  • Intellectual Property
  • Life Sciences
  • Materials
  • Tech Scout
Why Patsnap Eureka
  • Unparalleled Data Quality
  • Higher Quality Content
  • 60% Fewer Hallucinations
Social media
Patsnap Eureka Blog
Learn More