A method of constructing a training dataset of bad smell of code by combining code evolution information
A technology for training data sets and codes, applied in code compilation, program code conversion, neural learning methods, etc., can solve the problems of low reliability of data sets, inability to generate large-scale data sets, etc., to avoid overfitting and improve Predictive ability, effect of removing unwanted noise
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment Construction
[0030] For a given version of a software project, the entities marked as smelly by the automatic code smell detection tool can be divided into two categories: one type is detected by the code smell automatic detection tool as no bad smell in a subsequent version , is called the changed bad smell entity; the other type is the entity that is still detected as bad smell in a subsequent version, called the unchanged bad smell entity. After research, it is found that in the training data set that uses the code smell automatic detection tool to mark whether the entity has a bad smell, combined with the historical information of software evolution, only the changed bad smell entities are regarded as real bad smell entities, and they are compared with the baseline version Train and build a model of a supervised machine learning algorithm with entities that have not been recognized as bad smells by the automatic code smell detection tool in a subsequent version, instead of relying only ...
PUM
Login to View More Abstract
Description
Claims
Application Information
Login to View More 


