Accelerated implementation method of dct algorithm and dwt algorithm based on cuda architecture for image compression

A technology of image compression and implementation method, applied in the field of image processing, can solve the problems of low compression rate and achieve the effect of improving the compression rate

Active Publication Date: 2021-02-02
BEIJING INST OF AEROSPACE CONTROL DEVICES +1
View PDF11 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] The purpose of the present invention is to solve the problem of low compression rate in the existing image processing means, and propose a method for realizing the accelerated implementation method of DCT algorithm and DWT algorithm based on CUDA architecture for image compression

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Accelerated implementation method of dct algorithm and dwt algorithm based on cuda architecture for image compression
  • Accelerated implementation method of dct algorithm and dwt algorithm based on cuda architecture for image compression
  • Accelerated implementation method of dct algorithm and dwt algorithm based on cuda architecture for image compression

Examples

Experimental program
Comparison scheme
Effect test

specific Embodiment approach 1

[0028] The accelerated implementation method of the DCT algorithm based on the CUDA architecture for image compression of the present embodiment, the method is implemented by the following steps:

[0029] Step 1, analyze the software system and hardware system of the CUDA platform, and build the CUDA platform based on VS2010 under the Windows operating system; wherein, CUDA refers to the unified computing device architecture, which is the abbreviation of Compute Unified Device Architecture; VS2010 refers to Microsoft Visual Studio 2010 version, It is a Windows platform application development environment launched by Microsoft;

[0030] Step 2, at first, realize the running of serial DCT algorithm on CPU, be used for contrasting with the method of the present invention; DCT algorithm is mapped as the kernel function of two-layer CUDA execution model, obtains improved DCT algorithm, realizes improved DCT algorithm Running on the GPU side; DCT algorithm refers to discrete cosine ...

specific Embodiment approach 2

[0033] Different from the specific embodiment one, the method for accelerating the implementation of the DCT algorithm based on the CUDA architecture for image compression in this embodiment, the specific process of building the CUDA platform based on VS2010 under the Windows operating system described in step one is:

[0034] 1st, the Windows operating system is selected as WIN7 32-bit flagship operating system, the program development environment is built based on Visual Studio 2010, and the CUDA version is CUDA4.0,

[0035] 2. Prepare the following software packages:

[0036] Microsoft Visual Studio 2010, referred to as VS2010,

[0037] Driver: devdriver_4.0_winvista-win7_32_275.33_notebook.exe, for the graphics card driver

[0038] CUDA Toolkit v4.0: cudatoolkit_4.0.17_win_32.msi,

[0039] CUDA SDK v4.0: gpucomputingsdk_4.0.19_win_32.exe,

[0040] Visual Assist X, a plug-in assistant for VS2010,

[0041] Parallel Nsight v2.0: Parallel_Nsight_Win32_2.0.11166.msi,

[00...

specific Embodiment approach 3

[0058] The difference from specific embodiment 1 or 2 is that, in the accelerated implementation method of the CUDA architecture-based DCT algorithm for image compression in this embodiment, the process of implementing the serial DCT algorithm on the CPU described in step 2 is as follows: The relevant information of the CPU processor is as follows:

[0059]

[0060]

[0061] Such as figure 2 As shown, the serial operation process of the DCT algorithm is: input image; then allocate storage space; then convert byte type to float type; then each pixel value -128; then calculate each coefficient of the first block, repeat the above steps to the last block The processing is completed; after that, the IDCT algorithm is processed; after that, each pixel value is +128; after that, the float type is converted into a byte type; after that, the image is generated; after that, the PSNR and output time are calculated; after that, the storage space is released; and finally ends.

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention relates to an accelerated realization method of DCT algorithm and DWT algorithm based on CUDA framework for image compression, belonging to the field of image compression. The existing image processing methods have the problem of low compression rate. A DCT algorithm based on the DWT architecture and an accelerated implementation method of the DWT algorithm for image compression, analyzing the software system and hardware system of the CUDA platform, and building the CUDA platform based on VS2010 under the Windows operating system; the DCT algorithm and the DWT algorithm respectively Mapped to the kernel function of the two-layer CUDA execution model, the improved DCT algorithm and DWT algorithm are obtained, and the improved DCT algorithm and DWT algorithm are respectively implemented on the GPU side; the improved DCT algorithm is run on the CUDA platform. The invention is applicable to the realization of DCT algorithm and DWT algorithm on CUDA platform. Parallel execution of the calculation process can obtain a compression ratio of several tens of times relative to the CPU, thereby effectively improving the compression rate of digital image compression.

Description

technical field [0001] The invention relates to an image processing method, in particular to an accelerated realization method of a DCT algorithm and a DWT algorithm based on CUDA framework for image compression. Background technique [0002] Today, society has entered the information age, and digital images have become an important means for people to obtain and exchange information because of their large amount of information and the characteristics of easy processing. The application fields of image processing involve all aspects of human life and work, including aerospace, biomedical engineering, communication engineering technology applications, industrial engineering, military public security, and culture and art. However, digital images have a large amount of data, and information transmission and storage become difficult. Therefore, it is very necessary to study image compression technology to reduce the amount of image data, especially with the rapid development of...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Patents(China)
IPC IPC(8): H04N19/42H04N19/625H04N19/436
CPCH04N19/42H04N19/436H04N19/625
Inventor 滑艺陈浩牛文龙
Owner BEIJING INST OF AEROSPACE CONTROL DEVICES
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products