GPU (Graphics Processing Unit) acceleration method used for hierarchical searching motion estimation

A technology of layered search and motion estimation, applied in the field of video processing, can solve problems such as inability to meet actual needs, limited search range, etc., to reduce communication, avoid thread idling, and improve processing speed.

Active Publication Date: 2014-09-24
PEKING UNIV SHENZHEN GRADUATE SCHOOL
View PDF3 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0005] After searching the existing technical literature, it is found that among the existing methods of using GPU to accelerate the motion estimation algorithm, only the method of accelerating the motion estimation algorithm for full search or global elimination search
And due to the limit of the number of threads that can be accommodated by the thread block, these acceleration methods are greatly limited in the search range and cannot meet the actual needs

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • GPU (Graphics Processing Unit) acceleration method used for hierarchical searching motion estimation
  • GPU (Graphics Processing Unit) acceleration method used for hierarchical searching motion estimation
  • GPU (Graphics Processing Unit) acceleration method used for hierarchical searching motion estimation

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0034] The present invention will be described in detail below in conjunction with the accompanying drawings and embodiments. This embodiment is only an embodiment of the present invention but not all embodiments.

[0035] Layered search motion estimation is performed on a video sequence with a resolution of 1920×1080. The search image block size on the original image layer is 8×8, and the search area should cover an area centered on the current block with a width and a height of 256 pixels. A thread block of the used GPU can have 512 threads, and the warp size in the thread block is 16 threads.

[0036] In this application scenario, the following embodiment can be adopted to divide three image layers: the first image layer is the original image layer, the size of the search image block is 8×8, and the search area is an 8×8 area centered on the current block; The second image layer is a 960×540 resolution image downsampled from the original image, the search image block size ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a hierarchical searching motion estimation method used by utilizing GPU (Graphics Processing Unit) parallel computing capability acceleration, and the method comprises the following steps: generating images of different image layers in a hierarchical searching algorithm; self-adaptively carrying out threading distribution; carrying out SAD computation on each search image block at each search point in parallel; and utilizing a CPU (Central Processing Unit) to cooperatively look for the smallest SAD in parallel. The self-adaptive threading distribution scheme provided by the invention can satisfy the requirement of the resolution ratios of different searched images and the size of a searching area, the image downsapling is processed by the GPU in parallel to obtain better acceleration speed and reduce data communication between the CPU and the GPU, and the GPU can be effectively prevented from idling with a smallest SAD value lookup method cooperatively carried out by the GPU / CPU.

Description

technical field [0001] The invention relates to a motion estimation method in the field of video processing, in particular to a layered search motion estimation acceleration method using a GPU to assist a CPU. technical background [0002] Motion estimation is an important technology in the field of video processing, and plays an important role in applications such as hybrid video coding based on block matching, motion detection, and object tracking. The motion estimation algorithm calculates the SAD values ​​of all or part of the possible search points within the search range for each block, and obtains the best or near-best motion vector by finding the smallest SAD value. The full search motion estimation algorithm calculates the SAD of each possible point within the search range, and finally obtains the best motion vector. The full search motion estimation algorithm has a high computational load. Therefore, there are many fast search algorithms, such as three-step searc...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Patents(China)
IPC IPC(8): H04N19/30H04N19/563H04N19/53
Inventor 王振宇王荣刚董胜富高文
Owner PEKING UNIV SHENZHEN GRADUATE SCHOOL
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products