Vectorized Montgomery modular multiplier design method based on NEON engine

A design method and modular multiplier technology, applied in the direction of program control design, instrumentation, calculation, etc., can solve the problems of the public key encryption system NEON application, etc., to achieve the effect of improving the operation speed, improving the overall performance, and increasing the signature verification rate

Active Publication Date: 2015-09-30
成都三零嘉微电子有限公司
View PDF8 Cites 4 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0003] At present, there are a large number of NEON solutions for multimedia applications, such as OpenMax multimedia library, etc., and there are also NEON applications for mathematical operations, such as MathNeon mathematics library, etc. However, there are almost no NEON applications for public key encryption systems.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Vectorized Montgomery modular multiplier design method based on NEON engine
  • Vectorized Montgomery modular multiplier design method based on NEON engine
  • Vectorized Montgomery modular multiplier design method based on NEON engine

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0043] Describe technical scheme of the present invention in further detail below in conjunction with accompanying drawing: as figure 2 with image 3 Shown, a kind of design method of the vectorized Montgomery modular multiplier based on NEON engine, it comprises the following steps:

[0044] S1: Data input: load the operands A, B, M and n0 into the register, where B i Load at the beginning of each round of data iterative calculation steps; integer Z=(Z s-1 ,...,Z 0 ) r , M=(M s-1 ,...,M 0 ) r , A=(A s-1 ,...,A 0 ) r , Bi , T, where r=2 w , n0=M'mod r; n is the length of the module, and the unit is bit; w is the bit width of the word, and the unit is bit; s is the number of iterations; n0 is a 32-bit constant in the present embodiment;

[0045] S2: data initialization: initial value 0 is assigned to three temporary vector arrays, and the three temporary vector arrays are composed of result Z, temporary data C and carry data CT; wherein, Z=(Z s-1 ,...,Z 0 ) r ,...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a vectorized Montgomery modular multiplier design method based on an NEON engine. The method comprises the steps that 1, data are input; 2, the data are initialized; data iterative computation is carried out; 4, Z and CT are accumulated for updating the Z value according to the iterative results; 5, overflow judging and processing are carried out on the computing result Z. The carry data are cached independently, accumulation of each iteration for the carry data is reduced, and the carry data are accumulated at a time only after the last iteration is finished, so that high data calculation and low data calculation do not depend on each other, vectorized data parallel processing can be carried out through the NEON engine, specifically, the operation speed of the MMM based on an ARM Cortex-A series chip can be improved, and the overall performance of a public key encryption system is improved.

Description

technical field [0001] The invention relates to a design method of a vectorized Montgomery Modular Multiplier (Vectorized Montgomery Modular Multiplier, VMMM) based on a NEON engine. Background technique [0002] The "NEON media processing engine" integrated on the Cortex-A series processor of the ARM v7 architecture is an extended instruction set of single instruction multiple data (Single Instruction Multiple Data, SIMD), which is similar to the MMX / SSE extended instruction under the Intel architecture. set. The NEON engine can effectively accelerate multimedia applications and is widely used in handheld terminals or other lightweight electronic devices based on Cortex-A series processors. [0003] At present, there are a large number of NEON solutions for multimedia applications, such as OpenMax multimedia library, and NEON applications for mathematical operations, such as MathNeon mathematics library, etc. However, there are almost no NEON applications for public key en...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F9/302
Inventor 吴汶泰
Owner 成都三零嘉微电子有限公司
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products