A multi-dimensional graph tensor fusion representation and embedding method for a code
Patent Information
- Authority / Receiving Office
- CN · China
- Patent Type
- Patents(China)
- Current Assignee / Owner
- HUAZHONG UNIV OF SCI & TECH
- Filing Date
- 2023-05-23
- Publication Date
- 2026-06-26
AI Technical Summary
Existing code embedding methods based on text and symbols are inaccurate, cannot effectively capture the structured semantic information of the source code, and require a large training corpus, resulting in low efficiency. GINN ignores remote information and node order, leading to context loss.
A multidimensional graph tensor fusion representation method is adopted. By generating heterogeneous code graph structures such as AST, DDG, CFG and NCS of source code files, and combining graph convolution and tensor loop calculation, a high-dimensional code graph tensor is generated to learn the internal features of the code and apply it to tasks such as malicious code identification and vulnerability detection.
It improves the accuracy and efficiency of code semantic embedding, better captures the contextual information and long-term dependencies of the code, enhances the classification ability of neural networks, and improves the detection accuracy of downstream tasks.
Smart Images

Figure CN116720185B_ABST