Unlock instant, AI-driven research and patent intelligence for your innovation.

Software evolution modeling method based on dynamic topic model

A topic model and software evolution technology, applied in special data processing applications, instruments, electronic digital data processing, etc., can solve problems such as inappropriate corpus and no consideration of document disorder.

Active Publication Date: 2014-10-08
YANGZHOU UNIV
View PDF2 Cites 2 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, the software evolution model it adopts has the following problems: 1) It simply uses LDA to model the documents in the entire corpus at one time, and then calculates a series of indicators through the "document-topic" matrix to characterize the evolution process of software features. Good effect, but it does not take into account the principle of "documents in a single time period are disordered, and documents between time periods are ordered" (such as the Hall Model); 2) cumbersome post-processing such as topic connection is required process (such as the Link Model)
However, the dynamic topic model assumes that the evolution of topics obeys a normal distribution, and this limitation makes it disadvantageous for large changes, that is, it is not suitable for corpora where topics change drastically.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Software evolution modeling method based on dynamic topic model
  • Software evolution modeling method based on dynamic topic model
  • Software evolution modeling method based on dynamic topic model

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0026] Such as figure 1 A software evolution modeling method based on a dynamic topic model is shown, including the following steps:

[0027] Step 1) Preprocessing: Tokenize the code corresponding to the commit message in the CVS database first, such as removing punctuation marks (such as "", "?"), removing numbers (such as "1", "23" ), etc., followed by stopping words (stopping), such as prepositions "for", "to"; pronouns "it", "he"; articles "a", "an", "the", etc., and finally stemming Stemming, transforming each word into its prototype, such as "fixed" is the past tense, converted to the prototype "fix"; "moving" is the present continuous tense, converted to the prototype "move", after three steps, Generate preprocessed message documents, and establish time series documents and vector space model documents respectively.

[0028] Step 2-1) Create a time series document: Divide the preprocessed message document according to the time interval T, T can be any time interval, g...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a software evolution modeling method based on a dynamic topic model in the field of software engineering. The method includes the steps that preprocessing is performed, a time sequence document is established, a vector space model document is established, the dynamic topic model is established, a 'document-topic' matrix is converted, and a 'topic-word' matrix is converted. Developers and a project manager are helped to understand the development emphasis in all stages of time in the project development process and the evolution process of development characteristics in the whole time period, the development and maintenance time of the developers is shortened, the software developing efficiency is improved, and the method can be used for software development and maintenance.

Description

technical field [0001] The invention relates to a software modeling method, in particular to a software evolution modeling method, which belongs to the field of software development engineering. Background technique [0002] As software projects are developed, their complexity increases and so does their maintenance and understanding. The attention of developers and project managers needs to shift between different topics and tasks many times, and they often need to answer questions such as: How has this software project changed over time? Which topic or task is the focus of the current work and how much energy is spent on it? What is the proportion of various software development features (such as fixing bugs, adding functions, modifying functions, improving functions, etc.) in different time periods? What is the focus of the same development feature in different time stages? To answer such a series of questions, you need to go to the software history library (such as so...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06F9/44G06F17/30
Inventor 孙小兵胡嘉俊
Owner YANGZHOU UNIV
Features
  • R&D
  • Intellectual Property
  • Life Sciences
  • Materials
  • Tech Scout
Why Patsnap Eureka
  • Unparalleled Data Quality
  • Higher Quality Content
  • 60% Fewer Hallucinations
Social media
Patsnap Eureka Blog
Learn More