Software project and third-party library knowledge graph construction method for software system

A software project, software-oriented technology, applied in software reuse, other database retrieval, unstructured text data retrieval, etc., can solve the problems of effectively obtaining reuse decision and reuse evolution knowledge support, and difficulty for developers.

Pending Publication Date: 2020-06-05
FUDAN UNIV
View PDF8 Cites 10 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, these knowledge are often multi-source and heterogeneous, making it difficult for develop

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Software project and third-party library knowledge graph construction method for software system

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0029] The specific implementation of the present invention is described below for the GitHub Java open source project and the Maven third-party library, and its main use process is:

[0030] (1) Basic knowledge extraction of software projects. Extract the list of Java open source projects on GitHub through the GitHub API, use the Python library Scrapy to automatically crawl the web pages of each Java open source project, and then use the Python library Beautiful Soup to parse the web content to extract the basic knowledge of the software project; use the Python library Scrapy to automatically crawl Get the Maven third-party library list and the webpage of each third-party library, and then use the Python library Beautiful Soup to parse the webpage content to extract the basic knowledge of the third-party library software project; extract the business classification of the software project through the LDA topic model;

[0031] (2) Basic knowledge extraction of software project...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention belongs to the technical field of software engineering, and particularly relates to a software project and third-party library knowledge graph construction method for a software ecosystem. The method comprises the steps of obtaining basic knowledge of a software project and a release version of the software project and a version release notification of the software project by crawling and analyzing basic information; obtaining code cloning knowledge between release versions of different software projects through code cloning detection analysis; obtaining defects and defect repairknowledge in a software project release version by crawling and analyzing a defect tracking system of a software project, and analyzing link knowledge of the defects, a third-party library API and code cloning. The software project knowledge graph constructed by the method comprises software projects, software project release versions, relationships between defects and code clones and the like. The software ecosystem-oriented software project and third-party library knowledge graph constructed by the invention can support intelligent applications such as software project maturity evaluation,third-party library update recommendation, conflict detection and the like.

Description

technical field [0001] The invention belongs to the technical field of software engineering, and in particular relates to a software ecosystem-oriented software project and a method for constructing a knowledge map of a third-party library. Background technique [0002] Many software systems in the open source community and within the enterprise belong to related business fields or categories, compete with each other, depend on each other, and have a large number of duplicate codes and similar functions, forming a complex software ecosystem. It is an important means to improve the efficiency of software development and the quality of software products by selecting appropriate items from this software ecosystem and reusing them in the form of code or third-party libraries. When developers decide whether to reuse a software project, they not only need to consider the multi-dimensional knowledge of a software project (for example, the business classification, functional charact...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06F16/36G06F16/951G06F8/36
CPCG06F16/367G06F16/951G06F8/36
Inventor 陈碧欢彭鑫赵文耘
Owner FUDAN UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products