Supercharge Your Innovation With Domain-Expert AI Agents!

Floating point peak value calculation throughput test algorithm based on INTEL AVX instruction set

A floating-point computing and instruction set technology, which is used in faulty hardware testing methods, detection of faulty computer hardware, calculations, etc.

Active Publication Date: 2018-07-10
青岛雷神科技股份有限公司
View PDF7 Cites 2 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] Neither scalar nor superscalar processors solve a problem, that is, one calculation instruction is still used to realize a numerical calculation, but as the simple increase of execution units has a greater impact on power consumption and heat generation, the calculation performance of the CPU has developed into There are two ideas of multi-threaded concurrency and SIMD parallelism. Multi-threading still does not change the execution at the instruction level, but still increases the number of instruction executions per unit time, while SIMD changes the basic idea of ​​​​instruction execution in the past and introduces vector (vector) computing. the concept of

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Floating point peak value calculation throughput test algorithm based on INTEL AVX instruction set
  • Floating point peak value calculation throughput test algorithm based on INTEL AVX instruction set
  • Floating point peak value calculation throughput test algorithm based on INTEL AVX instruction set

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0150] The following will clearly and completely describe the technical solutions in the embodiments of the present invention with reference to the accompanying drawings in the embodiments of the present invention. Obviously, the described embodiments are only some, not all, embodiments of the present invention. Based on the embodiments of the present invention, all other embodiments obtained by persons of ordinary skill in the art without making creative efforts belong to the protection scope of the present invention.

[0151] see figure 2 , the present invention provides a technical solution: a floating-point peak calculation throughput test algorithm based on the INTEL AVX instruction set, comprising the following steps: to write this type of program, the requirements need to be clarified, and the effect to be achieved is to test the SIMD peak floating-point peak value of the CPU. Point performance, but also consider the throughput of the adder, multiplier and FMA fus...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a floating point peak value calculation throughput test algorithm based on an INTEL AVX instruction set. The method comprises following steps: step 1: AVX 128Bit ADD / Sub; step2: AVX 128bit Mul; step 3: AVX 128Bit FMA; step 4: AVX 128Bit ADD+mul; step 5: AVX 256Bit ADD / Sub; step 6: AVX 256Bit Mul; step 7: AVX 256Bit FMA; step 8: AVX 256Bit ADD+Mul; step 9: AVX 512Bit ADD / Sub; step 10: AVX 512Bit Mul; step 11: AVX 512Bit Bit ADD+Mul; step 12: AVX 512Bit Bit FMA; step 13: automatic iteration and nanosecond timing of the calculation process; step 14: for different processor configurations; step 15: single-threaded testing+ multi-threaded calling; step 16: displaying test results. According to the invention, in the shortest possible test time, the vector computing powerof the current mainstream X86 processors can be evaluated to help researchers quickly determine the architecture of the architecture processor in SIMD throughput, and find the focus and performance peak point of its basic computing power.

Description

technical field [0001] The invention relates to the field of floating-point algorithms, in particular to a floating-point peak calculation throughput test algorithm based on the INTEL AVX instruction set. Background technique [0002] In the past, mainstream CPUs increased the computing power of the CPU per unit time by increasing the number of instruction executions per second. Usually, one instruction was executed per clock cycle to complete a calculation. However, as the complexity of calculations increased, the CPU introduced pipeline execution. The method is to divide a complete instruction execution step into multiple steps, and each register is completed in the form of a pipeline. Each clock cycle also completes an operation. As long as the pipeline can run stably, it can still be approximately one clock cycle. There is one instruction to complete, and this form of completing one instruction in one clock cycle is also called a scalar processor. [0003] However, as t...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06F11/22G06F9/30G06F9/38
CPCG06F9/3001G06F9/3887G06F11/2236G06F11/2273
Inventor 李艳兵陆凯林蒋钟文
Owner 青岛雷神科技股份有限公司
Features
  • R&D
  • Intellectual Property
  • Life Sciences
  • Materials
  • Tech Scout
Why Patsnap Eureka
  • Unparalleled Data Quality
  • Higher Quality Content
  • 60% Fewer Hallucinations
Social media
Patsnap Eureka Blog
Learn More