Unlock instant, AI-driven research and patent intelligence for your innovation.

Method for optimizing DCT quick algorithm based on parallel processing in AVS

A parallel processing and fast algorithm technology, which is applied in the field of audio and video codec, can solve the problems of reducing the amount of calculation and improving the calculation speed

Inactive Publication Date: 2008-05-28
CENT ACADEME OF SVA GROUP
View PDF0 Cites 11 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

where w 00 The element of the position is the DC component, and the other elements in the CoeffMatrix matrix represent the AC components of different frequencies according to their positions. This change makes the multiplication of the matrix converted into addition, subtraction and shift operations, which reduces the amount of calculation, but the calculation speed is still To be improved, so that the encoder can encode images in real time

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method for optimizing DCT quick algorithm based on parallel processing in AVS
  • Method for optimizing DCT quick algorithm based on parallel processing in AVS
  • Method for optimizing DCT quick algorithm based on parallel processing in AVS

Examples

Experimental program
Comparison scheme
Effect test

Embodiment approach

[0035] The invention provides a method for optimizing the DCT fast algorithm based on parallel processing in the AVS standard, comprising the following steps:

[0036] Step 1. Data alignment:

[0037] Step 1.1, align the data to the position of the whole byte in one cycle, and 16-byte alignment is required for 128-bit registers;

[0038] Step 1.2, fetching the aligned data in the 8×8 data block into the corresponding instruction register one by one, such as MMX register (64-bit register), SSE2 register (128-bit register);

[0039] Step 2. Temporary data storage when registers are required when the register bank is full:

[0040] Step 2.1, dividing a temporary data storage space;

[0041] Step 2.2, storing the data in the register into the temporary memory space;

[0042] Step 2.3, taking out the data from the temporary data storage space;

[0043] Step 3. Instruction pairing: complete two different instruction operations without conflicts in the same cycle;

[0044] Step ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention provides a method for optimizing a DCT fast algorithm based on the parallel processing in an AVS standard. The method comprises the steps that the parallel optimization is operated on the basis of a DCT butterfly-shaped fast algorithm, the data alignment is adopted, temporary datum are stored, the pairing is operated through a command, the datum is prefetched to be expanded or reduced, and coefficients are merged and multiplied, etc., thereby the calculation speed is improved, the CPU occupancy time is reduced, therefore the image can be coded through an encoder in real time.

Description

technical field [0001] The invention relates to the technical field of audio and video coding and decoding in signal processing, in particular to a method for optimizing a DCT fast algorithm based on parallel processing in the AVS standard of video coding. Background technique [0002] A number of digital audio and video codec standards that have emerged in recent years are representative of the international standard H.264 / MPEG-4 AVC, and the standard AVS independently formulated by my country. The AVS standard adopts a series of technologies to achieve high-efficiency video coding, including intra-frame prediction, inter-frame prediction, DCT (Discrete Cosine Transform) transformation, quantization, and entropy coding. Inter prediction uses block-based motion vectors to eliminate redundancy between images, intra prediction uses spatial prediction mode to eliminate redundancy in images, and then transforms and quantizes prediction residuals to eliminate visual redundancy in...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): H04N7/26H04N19/42H04N19/436H04N19/625H04N19/70
Inventor 陈勇李国平
Owner CENT ACADEME OF SVA GROUP