A Method for Learning Function-Hierarchical Embedding Representations in Source Code in Hyperbolic Spaces

A technology of embedding representation and source code, applied in the field of network research, can solve problems such as lack of hierarchical information and space, and achieve efficient and accurate data dimensionality reduction

Active Publication Date: 2022-03-08
CENT SOUTH UNIV
View PDF4 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0005] Aiming at the deficiencies of existing research methods, the present invention provides a method for learning the hierarchical embedding representation of functions in source codes in hyperbolic space, which solves the problems of lack of hierarchical information and insufficient space in existing research methods

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • A Method for Learning Function-Hierarchical Embedding Representations in Source Code in Hyperbolic Spaces
  • A Method for Learning Function-Hierarchical Embedding Representations in Source Code in Hyperbolic Spaces
  • A Method for Learning Function-Hierarchical Embedding Representations in Source Code in Hyperbolic Spaces

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0061] The following will clearly and completely describe the technical solutions in the embodiments of the present invention with reference to the accompanying drawings in the embodiments of the present invention. Obviously, the described embodiments are only some of the embodiments of the present invention, not all of them. Based on the embodiments of the present invention, all other embodiments obtained by persons of ordinary skill in the art without making creative efforts belong to the protection scope of the present invention.

[0062] The present invention is mainly based on the function call graph we constructed, which is a graph data structure with hierarchical information. Combining the Ridge curvature that can contain structural information as a weight, we can learn an effective function embedding representation in a hyperbolic space, which is the source The field of code research pioneered a new approach. At the same time, it improves the time and space efficiency ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a method for learning the function level embedding representation in the source code in the hyperbolic space, the method has the following steps: S1, data collection and construction stage: S2, Ridge curvature analysis stage: S3, Poincaré model learning Function embedding representation stage: the present invention applies the latest data mining technology, dimension reduction technology, deep learning processing technology, and visualization technology to hyperbolic space function embedding learning, which can effectively accommodate massive data, just like source code data; efficient and Accurately achieve data dimensionality reduction, and then visualize and analyze hierarchical information.

Description

technical field [0001] The invention relates to the technical field of network research, in particular to a function-level embedding representation method for learning source codes in a hyperbolic space. Background technique [0002] The successful application of distributed representations in natural language processing and the naturalness assumption have inspired researchers to apply distributed representations to source code mining. Most existing work on learning distributed representations from source code typically treats programs as sequences of symbols or bags of words. Some of them first parse the program to obtain the corresponding Abstract Syntax Tree (AST), which is suitable for tree-based neural networks or expand the AST to a graph for further processing. According to these underlying neural network models, existing works can generally be classified into the following categories: feed-forward neural networks, recurrent neural networks, convolutional neural netw...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Patents(China)
IPC IPC(8): G06F8/75G06K9/62
Inventor 刘燕鲁鸣鸣何小贤毕文杰刘海英
Owner CENT SOUTH UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products