Method, device and system for writing assembly codes and readable storage medium

A code and assembly technology, applied in the computer field, can solve problems such as basic blocks and conditional judgment blocks that are not suitable for data dependence, not considering the impact of hardware instruction optimization performance, and poor portability. The effect of saving calculation time and reducing the number of instructions

Inactive Publication Date: 2018-07-20
ZHENGZHOU YUNHAI INFORMATION TECH CO LTD
View PDF11 Cites 4 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0003] However, in the prior art, due to the poor portability of bottom-level optimization and the difficulty of optimization, bottom-level optimization is often ignored
At present, the commonly used low-level optimization method is the SLP (Superword Parallel) vectorization method. This method uses the memory access of adjacent addresses as the seed of packaging, and then expands the package heuristically through the definition-use chain and use-definition chain, and finally Using the dependency scheduling package, although making full use of the server's vectorized instructions, it does not take into account the optimization of the hardware instruction pipeline and the impact of memory access on performance, and it is also not suitable for basic blocks and conditional judgment blocks with data dependencies.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method, device and system for writing assembly codes and readable storage medium
  • Method, device and system for writing assembly codes and readable storage medium
  • Method, device and system for writing assembly codes and readable storage medium

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0040] The following will clearly and completely describe the technical solutions in the embodiments of the present invention in conjunction with the accompanying drawings in the embodiments of the present invention. Obviously, the described embodiments are only some of the embodiments of the present invention, not all of them. Based on the embodiments of the present invention, all other embodiments obtained by persons of ordinary skill in the art without making creative efforts belong to the protection scope of the present invention.

[0041] The object of the present invention is to provide a method, device, system and readable storage medium for compiling assembly code, which can obtain source code with good initial performance, so that the program has good performance before it is optimized to improve calculation speed and save calculation time .

[0042] In order to enable those skilled in the art to better understand the technical solution of the present invention, the p...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a method for writing assembly codes. Firstly, by selecting an instruction or an instruction set with the minimum instruction time consumption from alternative instructions, thefunction which needs to be implemented by a program can be achieved with the minimum consumption of CPU cycles; then according to a preset instruction scheduling rule, the target instruction is rearranged so that normal execution of each instruction pipeline can be ensured, and the calculation speed is increased; finally, vectorization is performed on statements without data dependency in the program so that the instruction execution speed can be further increased; at least four pieces of consecutive data are read at a time when the program performs memory access so that the cache hit rate can be increased, and the calculation speed is further increased. According to the provided method for writing the assembly codes, source codes with good initial performance can be obtained, so that theprogram has good performance before being optimized, correspondingly the calculation speed is increased, and the calculation time is shortened. In addition, the invention also discloses a device andsystem for writing the assembly codes and a computer readable storage medium, and the device, the system and the computer readable storage medium all have the effects above.

Description

technical field [0001] The invention relates to the field of computers, in particular to a method, device, system and readable storage medium for compiling assembly codes. Background technique [0002] With the explosive growth of data in various application fields, the time spent on high-performance application computing in this field has also increased exponentially. In order to reduce the calculation time of high-performance applications and improve the timeliness of calculation results, each application can be optimized in two aspects, namely coarse-grained multi-process optimization and underlying optimization that is highly dependent on hardware. [0003] However, in the prior art, since the portability effect of the bottom-layer optimization is poor and the optimization is difficult, the bottom-layer optimization is often ignored. At present, the commonly used low-level optimization method is the SLP (Superword Parallel) vectorization method. This method uses the mem...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F9/38
CPCG06F9/3802G06F9/382G06F9/3867
Inventor 曹代
Owner ZHENGZHOU YUNHAI INFORMATION TECH CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products