Code classification method based on neural network linguistic model
Patent Information
- Authority / Receiving Office
- CN · China
- Patent Type
- Applications(China)
- Current Assignee / Owner
- UNIV OF ELECTRONICS SCI & TECH OF CHINA
- Publication Date
- 2017-09-29
Smart Images

Figure 1 
Figure 2 
Figure 3
Abstract
Description
technical field
[0001] The invention relates to a code classification method, in particular to a code classification method based on a neural network language model, which can classify codes according to functions. Background technique
[0002] Hindle et al. used statistical methods to compare programming languages with natural languages and found that they had very similar statistical properties. These features are very difficult for humans to capture, but they demonstrate that learning-based methods can be applied to the field of code analysis. Code analysis methods based on machine learning have been studied for a long time, relying on a large number of artificial features when solving problems such as code error detection and code duplication analysis. For a specific problem, these features require a large amount of labeled data. Moreover, the data representation of this method is a one hot representation, that is, an N-dimensional vector is used to encode the N wo...