Unlock instant, AI-driven research and patent intelligence for your innovation.

Automatic software traceability recovery method for enhancing data preprocessing process

A technology to enhance data and restore methods, applied in the computer field, can solve problems such as unbalanced data samples and inability to directly apply enterprise projects

Pending Publication Date: 2021-09-10
NANJING UNIV
View PDF0 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

In the face of the huge number of products and the potential tracking relationship between products in enterprise projects, the problem of data sample imbalance caused by the positive and negative labeling strategy in the binary classification model will become more and more serious. At the same time, how to effectively maintain the relationship between products in enterprises There is no long-term stable strategy for the traceability of the tracking relationship, and temporary measures are taken. The above problems make the automatic traceability recovery method of open source projects unable to be directly applied to enterprise projects.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Automatic software traceability recovery method for enhancing data preprocessing process
  • Automatic software traceability recovery method for enhancing data preprocessing process
  • Automatic software traceability recovery method for enhancing data preprocessing process

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0055] The following will clearly and completely describe the technical solutions in the embodiments of the present invention with reference to the accompanying drawings in the embodiments of the present invention. Obviously, the described embodiments are only some, not all, embodiments of the present invention. Based on the embodiments of the present invention, all other embodiments obtained by persons of ordinary skill in the art without making creative efforts belong to the protection scope of the present invention.

[0056] In this embodiment, the tracking relationship between recovery requirements and code submission, defect and code submission is mainly described, so as to provide support for the successful implementation of software process simulation modeling in enterprises. Software process simulation modeling has attracted widespread attention because it can support software companies to achieve quantitative management and improve software process maturity. However, ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses an automatic software traceability recovery method for enhancing a data preprocessing process, which comprises the following steps: selecting a product of which a tracking relationship is to be recovered, extracting related fields of the product to carry out data cleaning and carrying out feature engineering to obtain a sample data set; dividing the sample data set into a marked data set and a missing tracking data set by using a label marking method; segmenting the marked data set into a marked training set and a test set by using a four-fold time sequence verification method; combining the mark training set and the missing tracking data set by using a semi-supervised unbalanced learning framework to generate a new training set; using a plurality of resampling modes, balancing a training set, training a dichotomy model, evaluating the performance of the dichotomy model, and recovering the tracking relation between products. Starting from an enhanced data preprocessing process, the problems of many project products, poor data quality, unbalanced sample data and the like are solved through multiple enhancement measures, and the F1 value, the accuracy rate and the recall rate are greatly improved.

Description

technical field [0001] The invention relates to the technical field of computer technology, in particular to an automatic software traceability restoration method for enhancing the data preprocessing process. Background technique [0002] Software traceability is the ability to relate any uniquely identifiable software artifact to other artifacts, maintain and use the resulting network to answer questions about the software product and its development process. Software traceability technology is dedicated to creating or maintaining the traceability relationship between different artifacts, which helps to improve the quality of process-oriented data. However, software traceability is a difficult and error-prone task. The main difficulty comes from how to fill the logical abstraction gap between the requirements written in natural language and the code written in programming language. At the same time, in the face of the huge number of products and the number of potential tr...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06K9/62
CPCG06F18/24323G06F18/214Y02D10/00
Inventor 陈静张贺董黎明匡宏宇荣国平邵栋
Owner NANJING UNIV