Check patentability & draft patents in minutes with Patsnap Eureka AI!

Programming language type identification method and device based on software gene

A programming language and type recognition technology, applied in the field of data processing, can solve the problems of low recognition accuracy and efficiency, and achieve the effect of improving recognition accuracy and efficiency

Active Publication Date: 2022-02-18
上海戎磐网络科技有限公司
View PDF12 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0007] In view of this, the object of the present invention is to provide a method and device for identifying the programming language type based on software genes, so as to alleviate the technical problems of low accuracy and efficiency in the prior art for identifying the programming language type of binary executable files

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Programming language type identification method and device based on software gene
  • Programming language type identification method and device based on software gene
  • Programming language type identification method and device based on software gene

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0027] According to an embodiment of the present invention, an embodiment of a method for identifying a programming language type based on software genes is provided. It should be noted that the steps shown in the flow chart of the accompanying drawings can be implemented in a computer system such as a set of computer-executable instructions and, although a logical order is shown in the flowcharts, in some cases the steps shown or described may be performed in an order different from that shown or described herein.

[0028] figure 1 is a flow chart of a software gene-based programming language type identification method according to an embodiment of the present invention, such as figure 1 As shown, the method includes the following steps:

[0029] Step S102, obtaining executable files in multiple programming languages;

[0030] Step S104, using gene sequencing technology to extract the software genome of the executable file from the executable file;

[0031] Step S106, cons...

Embodiment 2

[0068] The embodiment of the present invention also provides a software gene-based programming language type identification device, the binary executable file programming language type identification device is used to implement the binary executable file programming language type identification method provided in the above content of the embodiment of the present invention , the following is a specific introduction of the binary executable file programming language type identification device provided by the embodiment of the present invention.

[0069] Such as figure 2 as shown, figure 2 It is a schematic diagram of the above-mentioned binary executable file programming language type recognition device, and the binary executable file programming language type recognition device includes: an acquisition unit 10, an extraction unit 20, a construction unit 30 and an identification unit 40, wherein the acquisition unit uses For obtaining executable files of multiple programming...

Embodiment 3

[0074] An embodiment of the present invention also provides an electronic device, including a memory and a processor, the memory is used to store a program that supports the processor to execute the method described in the first embodiment above, and the processor is configured to execute the program stored in memory.

[0075] see image 3 , the embodiment of the present invention also provides an electronic device 100, including: a processor 50, a memory 51, a bus 52 and a communication interface 53, the processor 50, the communication interface 53 and the memory 51 are connected through the bus 52; Executable modules, such as computer programs, stored in the execution memory 51 .

[0076]Wherein, the memory 51 may include a high-speed random access memory (RAM, Random Access Memory), and may also include a non-volatile memory (non-volatile memory), such as at least one disk memory. The communication connection between the system network element and at least one other netwo...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention provides a programming language type identification method and device based on software genes, and relates to the technical field of data processing. The method comprises the steps: acquiring executable files of multiple programming languages; extracting a software genome of the executable file from the executable file by using a software gene sequencing technology; constructing a discriminator based on the software genome of the executable file; after obtaining a to-be-identified executable file, converting the to-be-identified executable file into a to-be-identified genome, inputting the to-be-identified genome into the discriminator, and determining a programming language type of the to-be-identified executable file. The technical problem that in the prior art, the recognition accuracy and efficiency of the binary executable file programming language type are low is solved.

Description

technical field [0001] The invention relates to the technical field of data processing, in particular to a software gene-based programming language type identification method and device. Background technique [0002] The file programming language type identification based on Yara rules requires reverse analysis of the executable files of the samples to be checked, and most of the reverse analysis needs to be done manually, with a large workload and a lot of time consumption; it is necessary to establish a large and complicated Yara rule base, once the rules The lack of missing, the probability of missed judgment and misjudgment will be higher, and the efficiency will be low. [0003] The file programming language type identification based on the suffix name, although it can quickly identify the file programming language type by identifying the suffixes such as ".c", ".cpp", ".py", ".java", ".go", etc., but it is very Unreliable, because it is extremely easy to change the su...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06F8/30
CPCG06F8/31Y02D10/00
Inventor 刘旭胡逸漪章丽娟陈鹏王禹翔陈振兴
Owner 上海戎磐网络科技有限公司
Features
  • R&D
  • Intellectual Property
  • Life Sciences
  • Materials
  • Tech Scout
Why Patsnap Eureka
  • Unparalleled Data Quality
  • Higher Quality Content
  • 60% Fewer Hallucinations
Social media
Patsnap Eureka Blog
Learn More