A Method for Automatically Generating Code Change Log Based on Nearest Neighbor Algorithm
An automatic generation, nearest neighbor technology, applied in program files, software maintenance/management, etc., can solve the problems of high model training and running time, complex model structure, lack of interpretability, etc., to achieve easy understanding and simple model structure. , the effect of reducing the actual running time
Inactive Publication Date: 2021-05-04
ZHEJIANG UNIV
View PDF11 Cites 0 Cited by
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
[0004] (1) The model structure is complex and lacks interpretability;
[0005] (2) Model training and running time are expensive;
[0006] (3) Sensitive to noise and not robust
Method used
the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View moreImage
Smart Image Click on the blue labels to locate them in the text.
Smart ImageViewing Examples
Examples
Experimental program
Comparison scheme
Effect test
Embodiment
[0045] Given an original code change log (commit message) extracted from the Git repository, denoted as OCM (original commit message), and using the git diff command to obtain the corresponding diff information, denoted as OF (original diff). Take OF as the input of NNGen (the code warehouse submission information generation algorithm of the nearest neighbor algorithm), and output the result MG (massage generated). The performance parameters of this method are shown in Table 1.
the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more PUM
Login to view more
Abstract
The invention discloses a code change log automatic generation method based on the nearest neighbor algorithm, which belongs to the field of code change log automatic generation. The method includes: preprocessing of input data, preprocessing of training set data, obtaining a set of word frequency vector pairs through bag-of-words model, calculating candidate intermediate results through KNN algorithm, calculating BLEU‑4 values, and finally obtaining output results. This method has the characteristics of simple model structure, strong interpretability, no training of the model, greatly reduced actual running time compared with NMT, insensitivity to noise, and strong robustness.
Description
technical field [0001] The invention belongs to the field of automatic generation of code change logs, in particular to a method for automatically generating code change logs based on the nearest neighbor algorithm. Background technique [0002] Currently, techniques for code changelogs have been extensively studied. DeltaDoc takes source code changes as input, obtains path predicates through symbolic execution, and then uses a set of predefined rules and transformations to generate commit messages. ChangeScribe first extracts the necessary information by analyzing the corresponding source code changes and the abstract syntax tree. It then populates a predefined template with the extracted information as a log of the commit. NMT is the first to apply deep neural networks to its field by using neural machine translation algorithms to automatically generate streamlined code change logs from source code. [0003] However, the following problems generally exist in the current...
Claims
the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more Application Information
Patent Timeline
Login to view more
Patent Type & Authority Patents(China)
IPC IPC(8): G06F8/73
CPCG06F8/73
Inventor 蔡亮张洋鄢萌刘忠鑫夏鑫李善平王新宇
Owner ZHEJIANG UNIV
Who we serve
- R&D Engineer
- R&D Manager
- IP Professional
Why Eureka
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Social media
Try Eureka
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap